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NOVEL COMPOSITIONS AND METHODS FOR CANCER 

The present application is a continuing application of U.S.S.N.s 09/747,377, filed December 22, 2000 
and 09/798, 586. filed March 2, 2001, both of which are expressly incorporated herein by reference. 

FIELD OF THE INVENTION 

The present invention relates to novel sequences for use in diagnosis and treatment of cancer, 
especially carcinomas, as well as the use of the novel compositions in screening methods. 

BACKGROUND OF THE INVENTION 

Oncogenes are genes that can cause cancer. Carcinogenesis can occur by a wide variety of 
mechanisms, including infection of cells by viruses containing oncogenes, activation of 
protooncogenes in the host genome, and mutations of protooncogenes and tumor suppressor genes. 

There are a number of viruses known to be involved in human cancer as well as in animal cancer. Of 
particular interest here are viruses that do not contain oncogenes themselves; these are slow- 
transforming retroviruses. They induce tumors by integrating into the host genome and affecting 
neighboring protooncogenes in a variety of ways, including promoter insertion, enhancer insertion, 
and/or truncation of a protooncogene or tumor suppressor gene. The analysis of sequences at or 
near the insertion sites led to the identification of a number of new protooncogenes. 

With respect to lymphoma and leukemia, murine leukemia retrovirus (MuLV), such as SL3-3 or Akv, is 
a potent inducer of tumors when inoculated into susceptible newborn mice, or when carried in the 
germline. A number of sequences have been identified as relevant in the induction of lymphoma and 
leukemia by analyzing the insertion sites; see Sorensen et at., J. of Virology 74:2161 (2000); Hansen 
et al., Genome Res. 10(2):237-43 (2000); Sorensen et al., J. Virology 70:4063 (1996); Sorensen et a!., 
J. Virology 67:71 18 (1993); Joosten et al., Virology 268:308 (2000); and Li et al., Nature Genetics 
23:348 (1999); ail of which are expressly incorporated by reference herein. 

Accordingly, it is an object of the invention to provide sequences involved in cancer and in particular in 
oncogenesis. 
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SUMMARY OF THE INVENTION 



In accordance with the objects outlined above, the present invention provides methods for screening 
for compositions which modulate carcinomas, especially lymphoma and leukemia. Also provided 
herein are methods of inhibiting proliferation of a cell, preferably a lymphoma ceil. Methods of 
5 treatment of carcinomas, including diagnosis, are also provided herein. 

In one aspect, a method of screening drug candidates comprises providing a cell that expresses a 
carcinoma associated (CA) gene or fragments thereof. Preferred embodiments of CA genes are 
genes which are differentially expressed in cancer cells, preferably lymphatic, breast, prostate or 
epithelial cells, compared to other cells. Preferred embodiments of CA genes used in the methods 
10 herein include, but are not limited to the nucleic acids selected from Tables 1-10. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the CA gene. 

(n one embodiment, the method of screening drug candidates includes comparing the level of 
13 expression in the absence of the drug candidate to the level of expression in the presence of the drug 
15Q candidate. 

fj Also provided herein is a method of screening for a bioactive agent capable of binding to a CA protein 
fT (CAP), the method comprising combining the CAP and a candidate bioactive agent, and determining 
jj the binding of the candidate agent to the CAP. 

H 1 Further provided herein is a method for screening for a bioactive agent capable of modulating the 
2(jyJ activity of a CAP. In one embodiment, the method comprises combining the CAP and a candidate 
bioactive agent, and determining the effect of the candidate agent on the bioactivity of the CAP. 

o 

m Also provided is a method of evaluating the effect of a candidate carcinoma drug comprising 

administering the drug to a patient and removing a cell sample from the patient. The expression 
profile of the cell is then determined. This method may further comprise comparing the expression 
25 profile of the patient to an expression profile of a heathy individual. 

In a further aspect, a method for inhibiting the activity of an CA protein is provided. In one 
embodiment, the method comprises administering to a patient an inhibitor of a CA protein preferably 
selected from the group consisting of the sequences outlined in Tables 1-10 or their complements. 

A method of neutralizing the effect of a CA protein, preferably a protein encoded by a nucleic acid 
30 selected from the group of sequences outlined in Tables 1-10, is also provided. Preferably, the 
method comprises contacting an agent specific for said protein with said protein in an amount 
sufficient to effect neutralization. 

Moreover, provided herein is a biochip comprising a nucleic acid segment which encodes a CA 
protein, preferably selected from the sequences outlined in Tables 1-10. 

35 Also provided herein is a method for diagnosing or determining the propensity to carcinomas, 
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especially lymphoma or leukemia by sequencing at least one carcinoma or lymphoma gene of an 
individual. In yet another aspect of the invention, a method is provided for determining carcinoma 
including lymphoma and leukemia gene copy number in an individual. 

Novel sequences are also provided herein. Other aspects of the invention will become apparent to the 
5 skilled artisan by the following description of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to a number of sequences associated with carcinomas, especially 
lymphoma, breast cancer or prostate cancer. The relatively tight linkage between clonally-integrated 
proviruses and protooncogenes forms "provirus tagging", in which slow-transforming retroviruses that 

10 act by an insertion mutation mechanism are used to isolate protooncogenes. In some models, 

uninfected animals have low cancer rates, and infected animals have high cancer rates. It is known 
that many of the retroviruses involved do not carry transduced host protooncogenes or pathogenic 
frarts-acting viral genes, and thus the cancer incidence must therefor be a direct consequence of 
proviral integration effects into host protooncogenes. Since proviral integration is random, rare 

15 q integrants will "activate" host protooncogenes that provide a selective growth advantage, and these 
© rare events result in new proviruses at clonal stoichiometries in tumors. 

The use of oncogenic retroviruses, whose sequences insert into the genome of the host organism 
fZ resulting in carcinoma, allows the identification of host sequences involved in carcinoma. These 

sequences may then be used in a number of different ways, including diagnosis, prognosis, screening 
20m for modulators (including both agonists and antagonists), antibody generation (for immunotherapy and 
O imaging), etc. However, as will be appreciated by those in the art, oncogenes that are identified in one 
f H type of cancer such as lymphoma or leukemia have a strong likelihood of being involved in other types 
S' of cancers as well. Thus, while the sequences outlined herein are initially identified as correlated with 

■Li 

i\ lymphoma, they can also be found in other types of cancers as well, outlined below. 

25 Accordingly, the present invention provides nucleic acid and protein sequences that are associated 
with carcinoma, herein termed "carcinoma associated" or "CA" sequences. In a preferred 
embodiment, the present invention provides nucleic acid and protein sequences that are associated 
with carcinomas which originate in lymphatic tissue, herein termed "lymphoma associated" , "leukemia 
associated" or "LA" sequences. 

3 o Suitable cancers which can be diagnosed or screened for using the methods of the present invention 
include cancers classified by site or by histological type. Cancers classified by site include cancer of 
the oral cavity and pharynx (lip, tongue, salivary gland, floor of mouth, gum and other mouth, 
nasopharynx, tonsil, oropharynx, hypopharynx, other oral/pharynx); cancers of the digestive system 
(esophagus; stomach; small intestine; colon and rectum; anus, anal canal, and anorectum; liver; 

35 intrahepatic bile duct; gallbladder; other biliary; pancreas; retroperitoneum; peritoneum, omentum, and 
mesentery; other digestive); cancers of the respiratory system (nasal cavity, middle ear, and sinuses; 
larynx; lung and bronchus; pleura; trachea, mediastinum, and other respiratory); cancers of the 
mesothelioma; bones and joints; and soft tissue, including heart; skin cancers, including melanomas 
and other non-epithelial skin cancers; Kaposi's sarcoma and breast cancer; cancer of the female 
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genital system (cervix uteri; corpus uteri; uterus, nos; ovary; vagina; vulva; and other female genital); 
cancers of the male genital system (prostate gland; testis; penis; and other male genital); cancers of 
the urinary system (urinary bladder; kidney and renal pelvis; ureter; and other urinary); cancers of the 
eye and orbit; cancers of the brain and nervous system (brain; and other nervous system); cancers of 
5 the endocrine system (thyroid gland and other endocrine, including thymus); cancers of the 

lymphomas (hodgkin's disease and non-hodgkin's lymphoma), multiple myeloma, and ieukemias 
(lymphocytic leukemia; myeloid leukemia; monocytic leukemia; and other Ieukemias). 

Other cancers, classified by histological type, that may be associated with the sequences of the 
invention include, but are not limited to, Neoplasm, malignant; Carcinoma, NOS; Carcinoma, 
10 undifferentiated, NOS; Giant and spindle cell carcinoma; Small cell carcinoma, NOS; Papillary 

carcinoma, NOS; Squamous cell carcinoma, NOS; Lymphoepithelial carcinoma; Basal cell carcinoma, 
NOS; Pilomatrix carcinoma; Transitional cell carcinoma, NOS; Papillary transitional cell carcinoma; 
Adenocarcinoma, NOS; Gastrinoma, malignant; Cholangiocarcinoma; Hepatocellular carcinoma, 
NOS; Combined hepatocellular carcinoma and cholangiocarcinoma; Trabecular adenocarcinoma; 
15 Adenoid cystic carcinoma; Adenocarcinoma in adenomatous polyp; Adenocarcinoma, familial 
p? polyposis coli; Solid carcinoma, NOS; Carcinoid tumor, malignant; Branchiolo-alveolar 
f i adenocarcinoma; Papillary adenocarcinoma, NOS; Chromophobe carcinoma; Acidophil carcinoma; 
JJ Oxyphilic adenocarcinoma; Basophil carcinoma; Clear ceil adenocarcinoma, NOS; Granular cell 
!■* carcinoma; Follicular adenocarcinoma, NOS; Papillary and follicular adenocarcinoma; 
2 of* Nonencapsulating sclerosing carcinoma; Adrenal cortical carcinoma; Endometroid carcinoma; Skin 
m appendage carcinoma; Apocrine adenocarcinoma; Sebaceous adenocarcinoma; Ceruminous 
!y adenocarcinoma; Mucoepidermoid carcinoma; Cystadenocarcinoma, NOS; Papillary 
Q cystadenocarcinoma, NOS; Papillary serous cystadenocarcinoma; Mucinous cystadenocarcinoma, 
IU NOS; Mucinous adenocarcinoma; Signet ring cell carcinoma; Infiltrating duct carcinoma; Medullary 
25UJ carcinoma, NOS; Lobular carcinoma; Inflammatory carcinoma; Pagef's disease, mammary; Acinar 
cell carcinoma; Adenosquamous carcinoma; Adenocarcinoma w/ squamous metaplasia; Thymoma, 
malignant; Ovarian stromal tumor, malignant; Thecoma, malignant; Granulosa cell tumor, malignant; 
Androblastoma, malignant; Sertoli cell carcinoma; Leydig cell tumor, malignant; Lipid cell tumor, 
malignant; Paraganglioma, malignant; Extra-mammary paraganglioma, malignant; 
30 Pheochromocytoma; Giomangiosarcoma; Malignant melanoma, NOS; Amelanotic melanoma; 
Superficial spreading melanoma; Malig melanoma in giant pigmented nevus; Epithelioid cell 
melanoma; Blue nevus, malignant; Sarcoma, NOS; Fibrosarcoma, NOS; Fibrous histiocytoma, 
malignant; Myxosarcoma; Liposarcoma, NOS; Leiomyosarcoma, NOS; Rhabdomyosarcoma, NOS; 
Embryonal rhabdomyosarcoma; Alveolar rhabdomyosarcoma; Stromal sarcoma, NOS; Mixed tumor, 
35 malignant, NOS; Mullerian mixed tumor; Nephroblastoma; Hepatoblastoma; Carcinosarcoma, NOS; 
Mesenchymoma, malignant; Brenner tumor, malignant; Phyllodes tumor, malignant; Synovial 
sarcoma, NOS; Mesothelioma, malignant; Dysgerminoma; Embryonal carcinoma, NOS; Teratoma, 
malignant, NOS; Struma ovarii, malignant; Choriocarcinoma; Mesonephroma, malignant; 
Hemangiosarcoma; Hemangioendothelioma, malignant; Kaposi's sarcoma; Hemangiopericytoma, 
40 malignant; Lymphangiosarcoma; Osteosarcoma, NOS; Juxtacortical osteosarcoma; Chondrosarcoma, 
NOS; Chondroblastoma, malignant; Mesenchymal chondrosarcoma; Giant cell tumor of bone; Ewing's 
sarcoma; Odontogenic tumor, malignant; Ameloblastic odontosarcoma; Ameloblastoma, malignant; 
Ameloblastic fibrosarcoma; Pinealoma, malignant; Chordoma; Glioma, malignant; Ependymoma, 
NOS; Astrocytoma, NOS; Protoplasmic astrocytoma; Fibrillary astrocytoma; Astroblastoma; 



4 



Glioblastoma, NOS; Oligodendroglioma, NOS; Oligodendroblastoma; Primitive neuroectodermal; 
Cerebellar sarcoma, NOS; Ganglioneurobiastoma; Neuroblastoma, NOS; Retinoblastoma, NOS; 
Olfactory neurogenic tumor; Meningioma, malignant; Neurofibrosarcoma; Neurilemmoma, malignant; 
Granular cell tumor, malignant; Malignant lymphoma, NOS; Hodgkin's disease, NOS; Hodgkin's; 
5 paragranuloma, NOS; Malignant lymphoma, small lymphocytic; Malignant lymphoma, large cell, 
diffuse; Malignant lymphoma, follicular, NOS; Mycosis fungoides; Other specified non-Hodgkin's 
lymphomas; Malignant histiocytosis; Multiple myeloma; Mast cell sarcoma; Immunoproliferative small 
intestinal disease; Leukemia, NOS; Lymphoid leukemia, NOS; Plasma cell leukemia; Erythroieukemia; 
Lymphosarcoma cell leukemia; Myeloid leukemia, NOS; Basophilic leukemia; Eosinophilic leukemia; 
10 Monocytic leukemia, NOS; Mast cell leukemia; Megakaryoblastic leukemia; Myeloid sarcoma; and 
Hairy cell leukemia. 



Association in this context means that the nucleotide or protein sequences are either differentially 
expressed, activated, inactivated or altered in carcinomas as compared to normal tissue. As outlined 
below, CA sequences include those that are up-regulated (i.e. expressed at a higher level), as well as 
15 M those that are down-regulated (i.e. expressed at a lower level), in carcinomas. CA sequences also 
Q include sequences which have been altered (i.e., truncated sequences or sequences with 
Q substitutions, deletions or insertions, including point mutations) and show either the same expression 
C3 profile or an altered profile. In a preferred embodiment, the CA sequences are from humans; 
^ however, as will be appreciated by those in the art, CA sequences from other organisms may be 

2 0|^ useful in animal models of disease and drug evaluation; thus, other CA sequences are provided, from 

y vertebrates, including mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, 

farm animals (including sheep, goats, pigs, cows, horses, etc). In some cases, prokaryotic CA 
jU sequences may be useful. CA sequences from other organisms may be obtained using the 
2* techniques outlined below. 

cy 

yy 

25f§ CA sequences can include both nucleic acid and amino acid sequences. In a preferred embodiment, 
Ia the CA sequences are recombinant nucleic acids. By the term "recombinant nucleic acid" herein is 
meant nucleic acid, originally formed in vitro, in general, by the manipulation of nucleic acid by 
polymerases and endonucleases, in a form not normally found in nature. Thus an isolated nucleic 
acid, in a linear form, or an expression vector formed in vitro by ligating DNA molecules that are not 

3 o normally joined, are both considered recombinant for the purposes of this invention. It is understood 

that once a recombinant nucleic acid is made and reintroduced into a host ceil or organism, it will 
replicate non-recombinantly, i.e. using the in vivo cellular machinery of the host cell rather than in vitro 
manipulations; however, such nucleic acids, once produced recombinantly, although subsequently 
replicated non-recombinantly, are still considered recombinant for the purposes of the invention. 



35 Similarly, a "recombinant protein" is a protein made using recombinant techniques, i.e. through the 
expression of a recombinant nucleic acid as depicted above. A recombinant protein is distinguished 
from naturally occurring protein by at least one or more characteristics. For example, the protein may 
be isolated or purified away from some or all of the proteins and compounds with which it is normally 
associated in its wild type host, and thus may be substantially pure. For example, an isolated protein 

4 0 is unaccompanied by at least some of the material with which it is normally associated in its natural 
state, preferably constituting at least about 0.5%, more preferably at least about 5% by weight of the 
total protein in a given sample. A substantially pure protein comprises at least about 75% by weight of 
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the total protein, with at least about 80% being preferred, and at least about 90% being particularly 
preferred. The definition includes the production of an CA protein from one organism in a different 
organism or host cell. Alternatively, the protein may be made at a significantly higher concentration 
than is normally seen, through the use of an inducible promoter or high expression promoter, such 
that the protein is made at increased concentration levels. Alternatively, the protein may be in a form 
not normally found in nature, as in the addition of an epitope tag or amino acid substitutions, insertions 
and deletions, as discussed below. 

in a preferred embodiment, the CA sequences are nucleic acids. As will be appreciated by those in 
the art and is more fully outlined below, CA sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids, as well as screening 
applications; for example, biochips comprising nucleic acid probes to the CA sequences can be 
generated. In the broadest sense, then, by "nucleic acid" or "oligonucleotide" or grammatical 
equivalents herein means at least two nucleotides covalently linked together. A nucleic acid of the 
present invention will generally contain phosphodiester bonds, although in some cases, as outlined 
below (for example in antisense applications or when a candidate agent is a nucleic acid), nucleic acid 
analogs may be used that have alternate backbones, comprising, for example, phosphoramidate 
(Beaucage et al., Tetrahedron 49(1 0):1 925 (1993) and references therein; Letsinger, J. Org. Chem. 
35:3800 (1970); Sprinzi et al„ Eur. J. Biochem. 81 :579 (1977); Letsinger et al., Nucl. Acids Res. 
14:3487 (1986); Sawai et al, Chem. Lett. 805 (1984), Letsinger et al., J. Am. Chem. Soc. 1 10:4470 
(1988); and Pauwels et al., Chemica Scripta 26:141 91986)), phosphorothioate (Mag et al., Nucleic 
Acids Res. 19:1437 (1991); and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. 
Chem. Soc. 111:2321 (1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and 
Analogues: A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 
linkages (see Egholm, J.Am. Chem. Soc. 114:1895 (1992); Meier eta!., Chem. Int. Ed. Engl. 31:1008 
(1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all of which are 
incorporated by reference). Other analog nucleic acids include those with positive backbones (Denpcy 
et al., Proc. Natl. Acad. Sci. USA 92:6097 (1995); non-ionic backbones (U.S. Patent Nos. 5,386,023, 
5,637,684, 5,602,240, 5,216,141 and 4,469,863; Kiedrowshi et ah, Angew. Chem. Intl. Ed. English 
30:423 (1991); Letsinger et al., J. Am. Chem. Soc. 110:4470 (1988); Letsinger et al., Nucleoside & 
Nucleotide 13:1597 (1994); Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., 
Bioorganic & Medicinal Chem. Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); 
Tetrahedron Lett. 37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent 
Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing 
one or more carbocyclic sugars are also included within one definition of nucleic acids (see Jenkins et 
al.,Chem. Soc. Rev. (1995) pp1 69-1 76). Several nucleic acid analogs are described in Rawls, C & E 
News June 2, 1997 page 35. All of these references are hereby expressly incorporated by reference. 
These modifications of the ribose-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological environments for use 
in anti-sense applications or as probes on a biochip. 

As will be appreciated by those in the art, all of these nucleic acid analogs may find use in the present 
invention. In addition, mixtures of naturally occurring nucleic acids and analogs can be made; 
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alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic 
acids and analogs may be made. 



The nucleic acids may be single stranded or double stranded, as specified, or contain portions of both 
double stranded or single stranded sequence. As will be appreciated by those in the art, the depiction 
5 of a single strand "Watson" also defines the sequence of the other strand "Crick"; thus the sequences 
described herein also includes the complement of the sequence. The nucleic acid may be DNA, both 
genomic and cDNA, RNA or a hybrid, where the nucleic acid contains any combination of deoxyribo- 
and ribo-nucleotides, and any combination of bases, including uracil, adenine, thymine, cytosine, 
guanine, inosine, xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term 
10 "nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides 
such as amino modified nucleosides. In addition, "nucleoside" includes non-naturally occurring analog 
structures. Thus for example the individual units of a peptide nucleic acid, each containing a base, are 
referred to herein as a nucleoside. 

y. An CA sequence can be initially identified by substantial nucleic acid and/or amino acid sequence 

15 q homology to the CA sequences outlined herein. Such homology can be based upon the overall 

p nucleic acid or amino acid sequence, and is generally determined as outlined below, using either 

M homology programs or hybridization conditions. 

L. The CA sequences of the invention were initially identified as described herein; basically, infection of 

kl m ^ ce w ' th murine leukemia viruses (MLV) resulted in lymphoma, although many of these sequences 

2 0g will also be involved in other cancers as is generally outlined herein. 

h 



i y 



The CA sequences outlined herein comprise the insertion sites for the virus. In general, the retrovirus 
can cause carcinomas in three basic ways: first of all, by inserting upstream of a normally silent host 
gene and activating it (e.g. promoter insertion); secondly, by truncating a host gene that leads to 
oncogenesis; or by enhancing the transcription of a neighboring gene. For example, retrovirus 
25 enhancers, including SL3-3, are known to act on genes up to approximately 200 kiiobases of the 
insertion site. 

In a preferred embodiment, CA sequences are those that are up-regulated in carcinomas; that is, the 
expression of these genes is higher in carcinoma tissue as compared to normal tissue of the same 
differentiation stage. "Up-regulation" as used herein means at least about 50%, more preferably at 
30 least about 100%, more preferably at least about 150%, more preferably, at least about 200%, with 
from 300 to at least 1000% being especially preferred. 

In a preferred embodiment, CA sequences are those that are down-regulated in carcinomas; that is, 
the expression of these genes is lower in carcinoma tissue as compared to normal I tissue of the 
same differentiation stage, "Down-regulation" as used herein means at least about 50%, more 
35 preferably at least about 100%, more preferably at least about 150%, more preferably, at least about 
200%, with from 300 to at least 1000% being especially preferred. 

In a preferred embodiment, CA sequences are those that are altered but show either the same 
expression profile or an altered profile as compared to normal lymphoid tissue of the same 
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differentiation stage. "Altered CA sequences" as used herein refers to sequences which are 
truncated, contain insertions or contain point mutations. 

CA proteins of the present invention may be classified as secreted proteins, transmembrane proteins 
or intracellular proteins. 

5 In a preferred embodiment the CA protein is an intracellular protein. Intracellular proteins may be 
found in the cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of 
cellular function and replication (including, for example, signaling pathways); aberrant expression of 
such proteins results in unregulated or disregulated cellular processes. For example, many 
intracellular proteins have enzymatic activity such as protein kinase activity, protein phosphatase 
10 activity, protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, or 
targeting proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

m An increasingly appreciated concept in characterizing intracellular proteins is the presence in the 
is O proteins of one or more motifs for which defined functions have been attributed. In addition to the 
y highly conserved sequences found in the enzymatic domain of proteins, highly conserved sequences 
]2 have been identified in proteins that are involved in protein-protein interaction. For example, Src- 
y* homology-2 (SH2) domains bind tyrosine-phosphorylated targets in a sequence dependent manner. 
M PTB domains, which are distinct from SH2 domains, also bind tyrosine phosphorylated targets. SH3 
20 W domains bind to proline-rich targets. In addition, PH domains, tetratricopeptide repeats and WD 

domains to name only a few, have been shown to mediate protein-protein interactions. Some of these 
^ may also be involved in binding to phospholipids or other second messengers. As will be appreciated 
Sj by one of ordinary skill in the art, these motifs can be identified on the basis of primary sequence; 
hi thus, an analysis of the sequence of proteins may provide insight into both the enzymatic potential of 
250 the molecule and/or molecules with which the protein may associate. 

In a preferred embodiment, the CA sequences are transmembrane proteins. Transmembrane 
proteins are molecules that span the phospholipid bilayer of a cell. They may have an intracellular 
domain, an extracellular domain, or both. The intracellular domains of such proteins may have a 
number of functions including those already described for intracellular proteins. For example, the 

3 o intracellular domain may have enzymatic activity and/or may serve as a binding site for additional 

proteins. Frequently the intracellular domain of transmembrane proteins serves both roles. For 
example certain receptor tyrosine kinases have both protein kinase activity and SH2 domains. In 
addition, autophosphorylation of tyrosines on the receptor molecule itself, creates binding sites for 
additional SH2 domain containing proteins. 

35 Transmembrane proteins may contain from one to many transmembrane domains. For example, 
receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases and receptor 
serine/threonine protein kinases contain a single transmembrane domain. However, various other 
proteins including channels and adenylyl cyclases contain numerous transmembrane domains. Many 
important cell surface receptors are classified as "seven transmembrane domain" proteins, as they 

4 0 contain 7 membrane spanning regions. Important transmembrane protein receptors include, but are 

8 



not limited to insulin receptor, insulin-like growth factor receptor, human growth hormone receptor, 
glucose transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. IL-1 receptor, 
IL-2 receptor, etc. 

Characteristics of transmembrane domains include approximately 20 consecutive hydrophobic amino 
acids that may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains within the 
protein may be predicted. 

The extracellular domains of transmembrane proteins are diverse; however, conserved motifs are 
found repeatedly among various extracellular domains. Conserved structure and/or functions have 
been ascribed to different extracellular motifs. For example, cytokine receptors are characterized by a 
cluster of cysteines and a WSXWS (W= tryptophan, S= serine, X=any amino acid) motif. 
Immunoglobulin-like domains are highly conserved. Mucin-like domains may be involved in cell 
adhesion and leucine-rich repeats participate in protein-protein interactions. 

Many extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are receptors. Factors that bind the receptor domain include circulating ligands, which may 
be peptides, proteins, or small molecules such as adenosine and the like. For example, growth 
factors such as EGF, FGF and PDGF are circulating growth factors that bind to their cognate 
receptors to initiate a variety of cellular responses. Other factors include cytokines, mitogenic factors, 
neurotrophic factors and the like. Extracellular domains also bind to cell-associated molecules. In this 
respect, they mediate cell-cell interactions. Cell-associated ligands can be tethered to the cell for 
example via a glycosylphosphatidylinositoi (GPl) anchor, or may themselves be transmembrane 
proteins. Extracellular domains also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

CA proteins that are transmembrane are particularly preferred in the present invention as they are 
good targets for immunotherapeutics, as are described herein, in addition, as outlined below, 
transmembrane proteins can be also useful in imaging modalities. 

It will also be appreciated by those in the art that a transmembrane protein can be made soluble by 
removing transmembrane sequences, for example through recombinant methods. Furthermore, 
transmembrane proteins that have been made soluble can be made to be secreted through 
recombinant means by adding an appropriate signal sequence. 

In a preferred embodiment, the CA proteins are secreted proteins; the secretion of which can be either 
constitutive or regulated. These proteins have a signal peptide or signal sequence that targets the 
molecule to the secretory pathway. Secreted proteins are involved in numerous physiological events; 
by virtue of their circulating nature, they serve to transmit signals to various other cell types. The 
secreted protein may function in an autocrine manner (acting on the cell that secreted the factor), a 
paracrine manner (acting on cells in close proximity to the cell that secreted the factor) or an 
endocrine manner (acting on cells at a distance). Thus secreted molecules find use in modulating or 
altering numerous aspects of physiology. CA proteins that are secreted proteins are particularly 
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preferred in the present invention as they serve as good targets for diagnostic markers, for example 
for blood tests. 

An CA sequence is initially identified by substantial nucleic acid and/or amino acid sequence homology 
to the CA sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
5 amino acid sequence, and is generally determined as outlined below, using either homology programs 
or hybridization conditions. 

As used herein, a nucleic acid is a "CA nucleic acid" if the overall homology of the nucleic acid 
sequence to one of the nucleic acids of Tables 1 -10 is preferably greater than about 75%, more 
preferably greater than about 80%, even more preferably greater than about 85% and most preferably 
10 greater than 90%. In some embodiments the homology will be as high as about 93 to 95 or 98%. In a 
preferred embodiment, the sequences which are used to determine sequence identity or similarity are 
selected from those of the nucleic acids of Tables 1-10. In another embodiment, the sequences are 
naturally occurring allelic variants of the sequences of the nucleic acids of Tables 1-10. In another 
embodiment, the sequences are sequence variants as further described herein. 

is O Homology in this context means sequence similarity or identity, with identity being preferred. A 

5 preferred comparison for homology purposes is to compare the sequence containing sequencing 

li errors to the correct sequence. This homology will be determined using standard techniques known in 

2 the art, including, but not limited to, the local homology algorithm of Smith & Waterman, Adv. Appl. 

Mi Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 

20 W 48:443 (1970), by the search for similarity method of Pearson & Lipman, PNAS USA 85:2444 (1988), 

f by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 

{IT Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Drive, Madison, Wl), 

?1 the Best Fit sequence program described by Devereux et al., Nucl. Acid Res. 12:387-395 (1984), 

y preferably using the default settings, or by inspection. 

25 One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a 
group of related sequences using progressive, pairwise alignments. It can also plot a tree showing the 
clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive 
alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360 (1987); the method is similar to that 
described by Higgins & Sharp CABIOS 5:151-153 (1989). Useful PILEUP parameters including a 

3 0 default gap weight of 3.00, a default gap length weight of 0.10, and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. Mol. Biol. 
215, 403-410, (1990) and Karlin et al., PNAS USA 90:5873-5787 (1993). A particularly useful BLAST 
program is the WU-BLAST-2 program which was obtained from Altschul et al., Methods in 
Enzymology, 266: 460-480 (1996); http://blast.wust!]. WU-BLAST-2 uses several search parameters, 
35 most of which are set to the default values. The adjustable parameters are set with the following 

values: overlap span =1, overlap fraction = 0.125, word threshold (T) = 1 1. The HSP S and HSP S2 
parameters are dynamic values and are established by the program itself depending upon the 
composition of the particular sequence and composition of the particular database against which the 
sequence of interest is being searched; however, the values may be adjusted to increase sensitivity. 

4 0 A % amino acid sequence identity value is determined by the number of matching identical residues 
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divided by the total number of residues of the "longer" sequence in the aligned region. The "longer" 
sequence is the one having the most actual residues in the aligned region (gaps introduced by WU- 
Blast-2 to maximize the alignment score are ignored). 

Thus, "percent (%) nucleic acid sequence identity" is defined as the percentage of nucleotide residues 
5 in a candidate sequence that are identical with the nucleotide residues of the nucleic acids of Tables 
1-10. A preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default parameters, 
with overlap span and overlap fraction set to 1 and 0.125, respectively. 

The alignment may include the introduction of gaps in the sequences to be aligned. In addition, for 
sequences which contain either more or fewer nucleotides than those of the nucleic acids of Tables 1- 
10 10, it is understood that the percentage of homology will be determined based on the number of 

homologous nucleosides in relation to the total number of nucleosides. Thus, for example, homology 
of sequences shorter than those of the sequences identified herein and as discussed below, will be 
determined using the number of nucleosides in the shorter sequence. 

M In one embodiment, the nucleic acid homology is determined through hybridization studies. Thus, for 
15 y example, nucleic acids which hybridize under high stringency to the nucleic acids identified in the 
j5 figures, or their complements, are considered CA sequences. High stringency conditions are known 
jg in the art; see for example Maniatis et al., Molecular Cloning: A Laboratory Manual, 2d Edition, 1989, 
Mi and Short Protocols in Molecular Biology, ed. Ausubel, et aL, both of which are hereby incorporated by 
N 5 reference. Stringent conditions are sequence-dependent and will be different in different 
20 ^ circumstances. Longer sequences hybridize specifically at higher temperatures. An extensive guide 
l ^ to the hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular 
^5 Biology-Hybridization with Nucleic Acid Probes, "Overview of principles of hybridization and the 
111 strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to be about 5- 
W 10°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength 
25 p pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
;f °* which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium 
(as the target sequences are present in excess, at Tm, 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than about 1 .0 M 
sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and 

3 0 the temperature is at least about 30°C for short probes (e.g. 10 to 50 nucleotides) and at least about 

60°C for long probes (e.g. greater than 50 nucleotides). Stringent conditions may also be achieved 
with the addition of destabilizing agents such as formamide. 

In another embodiment, less stringent hybridization conditions are used; for example, moderate or low 
stringency conditions may be used, as are known in the art; see Maniatis and Ausubel, supra, and 
35 Tijssen, supra. 

In addition, the CA nucleic acid sequences of the invention are fragments of larger genes, i.e. they are 
nucleic acid segments. Alternatively, the CA nucleic acid sequences can serve as indicators of 
oncogene position, for example, the CA sequence may be an enhancer that activates a 
protooncogene. "Genes" in this context includes coding regions, non-coding regions, and mixtures of 

4 0 coding and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
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sequences provided herein, additional sequences of the CA genes can be obtained, using techniques 
well known in the art for cloning either longer sequences or the full length sequences; see Maniatis et 
al., and Ausubel, et al., supra, hereby expressly incorporated by reference. In general, this is done 
using PCR, for example, kinetic PCR. 

5 Once the CA nucleic acid is identified, it can be cloned and, if necessary, its constituent parts 

recombined to form the entire CA nucleic acid. Once isolated from its natural source, e.g., contained 
within a plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant CA nucleic acid can be further used as a probe to identify and isolate other CA nucleic 
acids, for example additional coding regions. It can also be used as a "precursor" nucleic acid to 
10 make modified or variant CA nucleic acids and proteins. 

The CA nucleic acids of the present invention are used in several ways. In a first embodiment, nucleic 
acid probes to the CA nucleic acids are made and attached to biochips to be used in screening and 
diagnostic methods, as outlined below, or for administration, for example for gene therapy and/or 
antisense applications. Alternatively, the CA nucleic acids that include coding regions of CA proteins 
15 y> can be put into expression vectors for the expression of CA proteins, again either for screening 
O purposes or for administration to a patient. 

p! In a preferred embodiment, nucleic acid probes to CA nucleic acids (both the nucleic acid sequences 
outlined in the figures and/or the complements thereof) are made. The nucleic acid probes attached 
y> to the biochip are designed to be substantially complementary to the CA nucleic acids, i.e. the target 

2 o W sequence (either the target sequence of the sample or to other probe sequences, for example in 

sandwich assays), such that hybridization of the target sequence and the probes of the present 
!IT invention occurs. As outlined below, this complementarity need not be perfect; there may be any 
|fj number of base pair mismatches which will interfere with hybridization between the target sequence 
hi and the single stranded nucleic acids of the present invention. However, if the number of mutations is 
25 O so great that no hybridization can occur under even the least stringent of hybridization conditions, the 
M 5 sequence is not a complementary target sequence. Thus, by "substantially complementary" herein is 
meant that the probes are sufficiently complementary to the target sequences to hybridize under 
normal reaction conditions, particularly high stringency conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and partially double 

3 o stranded. The strandedness of the probe is dictated by the structure, composition, and properties of 

the target sequence. In general, the nucleic acid probes range from about 8 to about 100 bases long, 
with from about 10 to about 80 bases being preferred, and from about 30 to about 50 bases being 
particularly preferred. That is, generally whole genes are not used. In some embodiments, much 
longer nucleic acids can be used, up to hundreds of bases. 

35 In a preferred embodiment, more than one probe per sequence is used, with either overlapping 
probes or probes to different sections of the target being used. That is, two, three, four or more 
probes, with three being preferred, are used to build in a redundancy for a particular target. The 
probes can be overlapping (i.e. have some sequence in common), or separate. 

As will be appreciated by those in the art, nucleic acids can be attached or immobilized to a solid 
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support in a wide variety of ways. By "immobilized" and grammatical equivalents herein is meant the 
association or binding between the nucleic acid probe and the solid support is sufficient to be stable 
under the conditions of binding, washing, analysis, and removal as outlined below. The binding can be 
covalent or non-covalent. By "non-covalent binding" and grammatical equivalents herein is meant one 
or more of either electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent 
binding is the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are attached by at 
least one bond, including sigma bonds, pi bonds and coordination bonds. Covalent bonds can be 
formed directly between the probe and the solid support or can be formed by a cross linker or by 
inclusion of a specific reactive group on either the solid support or the probe or both molecules. 
Immobilization may also involve a combination of covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be appreciated by 
those in the art. As described herein, the nucleic acids can either be synthesized first, with 
subsequent attachment to the biochip, or can be directly synthesized on the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or other 
grammatical equivalents herein is meant any material that can be modified to contain discrete 
individual sites appropriate for the attachment or association of the nucleic acid probes and is 
amenable to at least one detection method. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and other 
materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon™, etc.), polysaccharides, 
nylon or nitrocellulose, resins, silica or silica-based materials including silicon and modified silicon, 
carbon, metals, inorganic glasses, etc. In general, the substrates allow optical detection and do not 
appreciably fluoresce. 

In a preferred embodiment, the surface of the biochip and the probe may be derivatized with chemical 
functional groups for subsequent attachment of the two. Thus, for example, the biochip is derivatized 
with a chemical functional group including, but not limited to, amino groups, carboxy groups, oxo 
groups and thiol groups, with amino groups being particularly preferred. Using these functional 
groups, the probes can be attached using functional groups on the probes. For example, nucleic 
acids containing amino groups can be attached to surfaces comprising amino groups, for example 
using linkers as are known in the art; for example, homo-or hetero-bifunctional linkers as are well 
known (see 1994 Pierce Chemical Company catalog, technical section on cross-linkers, pages 
155-200, incorporated herein by reference). In addition, in some cases, additional linkers, such as 
alkyl groups (including substituted and heteroalkyl groups) may be used. 

In this embodiment, the oligonucleotides are synthesized as is known in the art, and then attached to 
the surface of the solid support. As will be appreciated by those skilled in the art, either the 5' or 3' 
terminus may be attached to the solid support, or attachment may be via an internal nucleoside. 

In an additional embodiment, the immobilization to the solid support may be very strong, yet non- 
covalent. For example, biotinylated oligonucleotides can be made, which bind to surfaces covalently 
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coated with streptavidin, resulting in attachment. 



Alternatively, the oligonucleotides may be synthesized on the surface, as is known in the art. For 
example, photoactivation techniques utilizing photopolymerization compounds and techniques are 
used. In a preferred embodiment, the nucleic acids can be synthesized in situ, using well known 
5 photolithographic techniques, such as those described in WO 95/251 16; WO 95/35505; U.S. Patent 
Nos. 5,700,637 and 5,445,934; and references cited within, all of which are expressly incorporated by 
reference; these methods of attachment form the basis of the Affymetrix GeneChip technology. 

In addition to the solid-phase technology represented by biochip arrays, gene expression can also be 
quantified using liquid-phase arrays. One such system is kinetic polymerase chain reaction (PCR). 
10 Kinetic PCR allows for the simultaneous amplification and quantification of specific nucleic acid 

sequences. The specificity is derived from synthetic oligonucleotide primers designed to preferentially 
adhere to single-stranded nucleic acid sequences bracketing the target site. This pair of 
oligonucleotide primers form specific, non-covalently bound complexes on each strand of the target 
sequence. These complexes facilitate in vitro transcription of double-stranded DNA in opposite 
15 D orientations. Temperature cycling of the reaction mixture creates a continuous cycle of primer 
binding, transcription, and re-melting of the nucleic acid to individual strands. The result is an 
IS exponential increase of the target dsDNA product. This product can be quantified in real time either 
U through the use of an intercalating dye or a sequence specific probe. SYBR® Greene I, is an example 
M of an intercalating dye, that preferentially binds to dsDNA resulting in a concomitant increase in the 
20 ^ fluorescent signal. Sequence specific probes, such as used with TaqMan® technology, consist of a 

fluorochrome and a quenching molecule covalently bound to opposite ends of an oligonucleotide. The 
probe is designed to selectively bind the target DNA sequence between the two primers. When the 
111 DNA strands are synthesized during the PCR reaction, the fluorochrome is cleaved from the probe by 
hi the exonuclease activity of the polymerase resulting in signal dequenching. The probe signaling 
2sO method can be more specific than the intercalating dye method, but in each case, signal strength is 

proportional to the dsDNA product produced. Each type of quantification method can be used in multi- 
well liquid phase arrays with each well representing primers and/or probes specific to nucleic acid 
sequences of interest. When used with messenger RNA preparations of tissues or cell lines, and an 
array of probe/primer reactions can simultaneously quantify the expression of multiple gene products 

3 0 of interest. See Germer, S., et al., Genome Res. 10:258-266 (2000); Heid, C. A., et al., Genome Res. 

6, 986-994 (1996). 

In a preferred embodiment, CA nucleic acids encoding CA proteins are used to make a variety of 
expression vectors to express CA proteins which can then be used in screening assays, as described 
below. The expression vectors may be either self-replicating extrachromosomal vectors or vectors 
35 which integrate into a host genome. Generally, these expression vectors include transcriptional and 
translational regulatory nucleic acid operably linked to the nucleic acid encoding the CA protein. The 
term "control sequences" refers to DNA sequences necessary for the expression of an operably linked 
coding sequence in a particular host organism. The control sequences that are suitable for 
prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome 

4 0 binding site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
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acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 
for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; 
a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the 
sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are 
contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, 
enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in accordance 
with conventional practice. The transcriptional and translationai regulatory nucleic acid will generally 
be appropriate to the host cell used to express the CA protein; for example, transcriptional and 
translationai regulatory nucleic acid sequences from Bacillus are preferably used to express the CA 
protein in Bacillus. Numerous types of appropriate expression vectors, and suitable regulatory 
sequences are known in the art for a variety of host cells. 

In general, the transcriptional and translationai regulatory sequences may include, but are not limited 
to, promoter sequences, ribosomal binding sites, transcriptional start and stop sequences, 
translationai start and stop sequences, and enhancer or activator sequences. In a preferred 
embodiment, the regulatory sequences include a promoter and transcriptional start and stop 
sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters may be either 
naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine elements of 
more than one promoter, are also known in the art, and are useful in the present invention. 

In addition, the expression vector may comprise additional elements. For example, the expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, for 
example in mammalian or insect cells for expression and in a procaryotic host for cloning and 
amplification. Furthermore, for integrating expression vectors, the expression vector contains at least 
one sequence homologous to the host cell genome, and preferably two homologous sequences which 
flank the expression construct. The integrating vector may be directed to a specific locus in the host 
cell by selecting the appropriate homologous sequence for inclusion in the vector. Constructs for 
integrating vectors are well known in the art. 

In addition, in a preferred embodiment, the expression vector contains a selectable marker gene to 
allow the selection of transformed host cells. Selection genes are well known in the art and will vary 
with the host cell used. 

The CA proteins of the present invention are produced by culturing a host cell transformed with an 
expression vector containing nucleic acid encoding an CA protein, under the appropriate conditions to 
induce or cause expression of the CA protein. The conditions appropriate for CA protein expression 
will vary with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation. For example, the use of constitutive promoters 
in the expression vector will require optimizing the growth and proliferation of the host cell, while the 
use of an inducible promoter requires the appropriate growth conditions for induction. In addition, in 
some embodiments, the timing of the harvest is important. For example, the baculoviral systems used 
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in insect ceil expression are lytic viruses, and thus harvest time selection can be crucial for product 
yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect, plant and animal cells, 
including mammalian cells. Of particular interest are Drosophila melanogaster cells, Saccharomyces 
5 cerevisiae and other yeasts, E. co//, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, 

BHK, CHO, COS, HeLa cells, THP1 cell line (a macrophage cell line) and human cells and cell lines. 

In a preferred embodiment, the CA proteins are expressed in mammalian ceils. Mammalian 
expression systems are also known in the art, and include retroviral systems. A preferred expression 
vector system is a retroviral vector system such as is generally described in PCT/US97/01019 and 

10 PCT/US97/01048, both of which are hereby expressly incorporated by reference. Of particular use as 
mammalian promoters are the promoters from mammalian viral genes, since the viral genes are often 
highly expressed and have a broad host range. Examples include the SV40 early promoter, mouse 
mammary tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter. Typically, transcription termination and poiyadenylation sequences 

is jm& recognized by mammalian cells are regulatory regions located 3' to the translation stop codon and 
E3 thus, together with the promoter elements, flank the coding sequence. Examples of transcription 
y terminator and polyadenlytion signals include those derived form SV40. 

yu The methods of introducing exogenous nucleic acid into mammalian hosts, as well as other hosts, is 
}■* well known in the art, and will vary with the host cell used. Techniques include dextran-mediated 

:; n <i 

20 UJ transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
j\ electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, and direct 
Pi microinjection of the DNA into nuclei. 

yj In a preferred embodiment, CA proteins are expressed in bacterial systems. Bacterial expression 
Q systems are well known in the art. Promoters from bacteriophage may also be used and are known in 

25 the art. In addition, synthetic promoters and hybrid promoters are also useful; for example, the tac 
promoter is a hybrid of the trp and lac promoter sequences. Furthermore, a bacterial promoter can 
include naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA 
polymerase and initiate transcription. In addition to a functioning promoter sequence, an efficient 
ribosome binding site is desirable. The expression vector may also include a signal peptide sequence 

3 o that provides for secretion of the CA protein in bacteria. The protein is either secreted into the growth 
media (gram-positive bacteria) or into the periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria). The bacterial expression vector may also include a 
selectable marker gene to allow for the selection of bacterial strains that have been transformed. 
Suitable selection genes include genes which render the bacteria resistant to drugs such as ampicillin, 

35 chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers also 
include biosynthetic genes, such as those in the histidine, tryptophan and leucine biosynthetic 
pathways. These components are assembled into expression vectors. Expression vectors for bacteria 
are well known in the art, and include vectors for Bacillus subtilis, E. co//, Streptococcus cremoris, and 
Streptococcus lividans, among others. The bacterial expression vectors are transformed into bacterial 

40 host cells using techniques well known in the art, such as calcium chloride treatment, electroporation, 
and others. 
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In one embodiment, CA proteins are produced in insect cells. Expression vectors for the 
transformation of insect cells, and in particular, baculovirus-based expression vectors, are well known 
in the art. 

In a preferred embodiment, CA protein is produced in yeast cells. Yeast expression systems are well 
known in the art, and include expression vectors for Saccharomyces cerevisiae, Candida albicans and 
G maltosa, Hansenuia polymorpha, Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. 
pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The CA protein may also be made as a fusion protein, using techniques well known in the art. Thus, 
for example, for the creation of monoclonal antibodies. If the desired epitope is small, the CA protein 
may be fused to a carrier protein to form an immunogen. Alternatively, the CA protein may be made 
as a fusion protein to increase expression, or for other reasons. For example, when the CA protein is 
an CA peptide, the nucleic acid encoding the peptide may be linked to other nucleic acid for 
expression purposes. 

In one embodiment, the CA nucleic acids, proteins and antibodies of the invention are labeled. By 
"labeled" herein is meant that a compound has at least one element, isotope or chemical compound 
attached to enable the detection of the compound. In general, labels fall into three classes: a) isotopic 
labels, which may be radioactive or heavy isotopes; b) immune labels, which may be antibodies or 
antigens; and c) colored or fluorescent dyes. The labels may be incorporated into the CA nucleic 
acids, proteins and antibodies at any position. For example, the label should be capable of producing, 
either directly or indirectly, a detectable signal. The detectable moiety may be a radioisotope, such as 
3 H, i4 C, 32 P, 35 S, or 125 l, a fluorescent or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
galactosidase or horseradish peroxidase. Any method known in the art for conjugating the antibody to 
the label may be employed, including those methods described by Hunter et al. a Nature, 144:945 
(1962); David et al., Biochemistry, 13:1014 (1974); Pain et ah, J. Immunol. Meth., 40:219 (1981); and 
Nygren, J. Histochem. and Cytochem., 30:407 (1982). 

Accordingly, the present invention also provides CA protein sequences. An CA protein of the present 
invention may be identified in several ways. "Protein" in this sense includes proteins, polypeptides, 
and peptides. As will be appreciated by those in the art, the nucleic acid sequences of the invention 
can be used to generate protein sequences. There are a variety of ways to do this, including cloning 
the entire gene and verifying its frame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a frame, assuming the CA protein has homology to 
some protein in the database being used. Generally, the nucleic acid sequences are input into a 
program that will search ail three frames for homology. This is done in a preferred embodiment using 
the following NCBI Advanced BLAST parameters. The program is blastx or blastn. The database is 
nr. The input data is as "Sequence in FASTA format". The organism list is "none". The "expect" is 
10; the filter is default. The "descriptions" is 500, the "alignments" is 500, and the "alignment view" is 
pairwise. The "query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap existence cost is 
1 1 , per residue gap cost is 1 ; and the lambda ratio is .85 default. This results in the generation of a 
putative protein sequence. 
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Also included within one embodiment of CA proteins are amino acid variants of the naturally occurring 
sequences, as determined herein. Preferably, the variants are preferably greater than about 75% 
homologous to the wild-type sequence, more preferably greater than about 80%, even more preferably 
greater than about 85% and most preferably greater than 90%. In some embodiments the homology 
will be as high as about 93 to 95 or 98%. As for nucleic acids, homology in this context means 
sequence similarity or identity, with identity being preferred. This homology will be determined using 
standard techniques known in the art as are outlined above for the nucleic acid homologies. 

CA proteins of the present invention may be shorter or longer than the wild type amino acid 
sequences. Thus, in a preferred embodiment, included within the definition of CA proteins are 
portions or fragments of the wild type sequences herein. In addition, as outlined above, the CA 
nucleic acids of the invention may be used to obtain additional coding regions, and thus additional 
protein sequence, using techniques known in the art. 

In a preferred embodiment, the CA proteins are derivative or variant CA proteins as compared to the 
wild-type sequence. That is, as outlined more fully below, the derivative CA peptide will contain at 
least one amino acid substitution, deletion or insertion, with amino acid substitutions being particularly 
preferred. The amino acid substitution, insertion or deletion may occur at any residue within the CA 
peptide. 

Also included in an embodiment of CA proteins of the present invention are amino acid sequence 
variants. These variants fall into one or more of three classes: substitutional, insertional or deletional 
variants. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in the 
DNA encoding the CA protein, using cassette or PCR mutagenesis or other techniques well known in 
the art, to produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant CA protein fragments having up to about 100-150 
residues may be prepared by in vitro synthesis using established techniques. Amino acid sequence 
variants are characterized by the predetermined nature of the variation, a feature that sets them apart 
from naturally occurring allelic or interspecies variation of the CA protein amino acid sequence. The 
variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, 
although variants can also be selected which have modified characteristics as will be more fully 
outlined below. 

While the site or region for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, in order to optimize the performance of a 
mutation at a given site, random mutagenesis may be conducted at the target codon or region and the 
expressed CA variants screened for the optimal combination of desired activity. Techniques for 
making substitution mutations at predetermined sites in DNA having a known sequence are well 
known, for example, M13 primer mutagenesis and LAR mutagenesis. Screening of the mutants is 
done using assays of CA protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on the order of from 
about 1 to 20 amino acids, although considerably larger insertions may be tolerated. Deletions range 
from about 1 to about 20 residues, although in some cases deletions may be much larger. 
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Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final 
derivative. Generally these changes are done on a few amino acids to minimize the alteration of the 
molecule. However, larger changes may be tolerated in certain circumstances. When small 
alterations in the characteristics of the CA protein are desired, substitutions are generally made in 
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accordance with the following chart: 
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Substantial changes in function or immunological identity are made by selecting substitutions that are 
less conservative than those shown in Chart I. For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for example 

3 0 the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the molecule at the target 
site; or the bulk of the side chain. The substitutions which in general are expected to produce the 
greatest changes in the polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl 
or threonyl is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 
alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 

35 electropositive side chain, e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative 
residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will elicit the same immune 
response as the naturally-occurring analogue, although variants also are selected to modify the 
40 characteristics of the CA proteins as needed. Alternatively, the variant may be designed such that the 
biological activity of the CA protein is altered. For example, glycosylation sites may be altered or 
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removed, dominant negative mutations created, etc. 



Covaient modifications of CA polypeptides are included within the scope of this invention, for example 
for use in screening. One type of covaient modification includes reacting targeted amino acid residues 
of an CA polypeptide with an organic derivatizing agent that is capable of reacting with selected side 
5 chains or the N-or C-terminal residues of an CA polypeptide. Derivatization with Afunctional agents is 
useful, for instance, for crossiinking CA polypeptides to a water-insoluble support matrix or surface for 
use in the method for purifying anti-CA antibodies or screening assays, as is more fully described 
below. Commonly used crossiinking agents include, e.g., 1,1-bis(diazoacetyl)-2-phenylethane, 
glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
10 homobifunctiona! imidoesters, including disuccinimidyl esters such as 3,3- 

dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1 ,8-octane and 
agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of 
15 M* hydroxyl groups of seryl, threonyl or tyrosyl residues, methylation of the a-amino groups of lysine, 

O arginine, and histidine side chains [T.E. Creighton, Proteins: Structure and Molecular Properties, W.H. 

S Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation 

s J* of any C-terminal carboxyl group. 

M- 

Another type of covaient modification of the CA polypeptide included within the scope of this invention 
20 w comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation 
pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found in 
ri native sequence CA polypeptide, and/or adding one or more glycosylation sites that are not present in 
ffj the native sequence CA polypeptide. 

O Addition of glycosylation sites to CA polypeptides may be accomplished by altering the amino acid 
25 r " sequence thereof. The alteration may be made, for example, by the addition of, or substitution by, one 
or more serine or threonine residues to the native sequence CA polypeptide (for O-linked glycosylation 
sites). The CA amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the CA polypeptide at preselected bases such that codons 
are generated that will translate into the desired amino acids. 

3 0 Another means of increasing the number of carbohydrate moieties on the CA polypeptide is by 

chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the 
art, e.g., in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston, LA Crit. Rev. 
Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the CA polypeptide may be accomplished chemically or 
3 5 enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as 
targets for glycosylation. Chemical deglycosylation techniques are known in the art and described, for 
instance, by Hakimuddin, et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge et a!., Anal. 
Biochem., 1 18:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be 
achieved by the use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth. 
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Enzymol., 138:350 (1987). 

Another type of covalent modification of CA comprises linking the CA polypeptide to one of a variety of 
nonproteinaceous polymers, e.g., polyethylene glycol, polypropylene glycol, or poiyoxyalkylenes, in the 
manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 
4,179,337. 

CA polypeptides of the present invention may also be modified in a way to form chimeric molecules 
comprising an CA polypeptide fused to another, heterologous polypeptide or amino acid sequence. In 
one embodiment, such a chimeric molecule comprises a fusion of an CA polypeptide with a tag 
polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope 
tag is generally placed at the amino-or carboxyl-terminus of the CA polypeptide, although internal 
fusions may also be tolerated in some instances. The presence of such epitope-tagged forms of an 
CA polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the 
epitope tag enables the CA polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of an CA polypeptide with an immunoglobulin or a 
particular region of an immunoglobulin. For a bivalent form of the chimeric molecule, such a fusion 
could be to the Fc region of an lgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. Examples include 
poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its 
antibody 12CA5 [Field et al., Mol. Cell. Biol., 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 
6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology, 5:3610-3616 
(1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et 
al., BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al., Science, 255:192-194 
(1992)]; tubulin epitope peptide [Skinner et al., J. Biol. Chem., 266:15163-15166 (1991)]; and the T7 
gene 10 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. 

Also included with the definition of CA protein in one embodiment are other CA proteins of the CA 
family, and CA proteins from other organisms, which are cloned and expressed as outlined below. 
Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to find 
other related CA proteins from humans or other organisms. As will be appreciated by those in the art, 
particularly useful probe and/or PCR primer sequences include the unique areas of the CA nucleic 
acid sequence. As is generally known in the art, preferred PCR primers are from about 1 5 to about 35 
nucleotides in length, with from about 20 to about 30 being preferred, and may contain inosine as 
needed. The conditions for the PCR reaction are well known in the art. 

In addition, as is outlined herein, CA proteins can be made that are longer than those encoded by the 
nucleic acids of the figures, for example, by the elucidation of additional sequences, the addition of 
epitope or purification tags, the addition of other fusion sequences, etc. 

CA proteins may also be identified as being encoded by CA nucleic acids. Thus, CA proteins are 
encoded by nucleic acids that will hybridize to the sequences of the sequence listings, or their 
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complements, as outlined herein. 



In a preferred embodiment, the invention provides CA antibodies. In a preferred embodiment, when 
the CA protein is to be used to generate antibodies, for example for immunotherapy, the CA protein 
should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is meant a portion of a protein which will generate and/or bind an antibody or T- 
cell receptor in the context of MHC. Thus, in most instances, antibodies made to a smaller CA protein 
will be able to bind to the full length protein. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

In one embodiment, the term "antibody" includes antibody fragments, as are known in the art, 
including Fab, Fab 2 , single chain antibodies (Fvfor example), chimeric antibodies, etc., either 
produced by the modification of whole antibodies or those synthesized de novo using recombinant 
DNA technologies. 

Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can 
be raised in a mammal, for example, by one or more injections of an immunizing agent and, if desired, 
an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal by 
multiple subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It may be 
useful to conjugate the immunizing agent to a protein known to be immunogenic in the mammal being 
immunized. Examples of such immunogenic proteins include but are not limited to keyhole limpet 
hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of 
adjuvants which may be employed include Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The immunization protocol may be 
selected by one skilled in the art without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be prepared 
using hybridoma methods, such as those described by Kohler and Miistein, Nature, 256:495 (1975). 
In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in 
vitro. The immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
10, or fragment thereof or a fusion protein thereof. Generally, either peripheral blood lymphocytes 
("PBLs") are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized 
cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell [Goding, 
Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 59-103]. Immortalized 
cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culture medium that preferably contains one or more substances that inhibit the 
growth or survival of the unfused, immortalized cells. For example, if the parental ceils lack the 
enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for 
the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient ceils. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are monoclonal, 
preferably human or humanized, antibodies that have binding specificities for at least two different 
antigens. In the present case, one of the binding specificities is for a protein encoded by a nucleic 
acid of Tables 1 , or a fragment thereof, the other one is for any other antigen, and preferably for a cell- 
5 surface protein or receptor or receptor subunit, preferably one that is tumor specific. 

In a preferred embodiment, the antibodies to CA are capable of reducing or eliminating the biological 
function of CA, as is described below. That is, the addition of anti-CA antibodies (either polyclonal or 
preferably monoclonal) to CA (or cells containing CA) may reduce or eliminate the CA activity. 
Generally, at least a 25% decrease in activity is preferred, with at least about 50% being particularly 
10 preferred and about a 95-1 00% decrease being especially preferred. 

(n a preferred embodiment the antibodies to the CA proteins are humanized antibodies. Humanized 
forms of non-human (e.g., murine) antibodies are chimeric molecules of immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen binding 
subsequences of antibodies) which contain minimal sequence derived from non-human 
15 !■* immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in which 
Jzf residues form a complementary determining region (CDR) of the recipient are replaced by residues 
J5 from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired 
41 specificity, affinity and capacity. In some instances, Fv framework residues of the human 
M immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies may also 
20 comprise residues which are found neither in the recipient antibody nor in the imported CDR or 

framework sequences. In general, the humanized antibody will comprise substantially all of at least 
y, one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond 
O to those of a non-human immunoglobulin and all or substantially all of the framework residues (FR) 
fy regions are those of a human immunoglobulin consensus sequence. The humanized antibody 
25 W optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that 
W of a human immunoglobulin [Jones et al., Nature, 321 :522-525 (1986); Riechmann et a!., Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. 
3 o These non-human amino acid residues are often referred to as import residues, which are typically 
taken from an import variable domain. Humanization can be essentially performed following the 
method of Winter and co-workers [Jones et al., Nature, 321 :522-525 (1986); Riechmann et al., Nature, 
332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)], by substituting rodent CDRs 
or CDR sequences for the corresponding sequences of a human antibody. Accordingly, such 

3 5 humanized antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less 

than an intact human variable domain has been substituted by the corresponding sequence from a 
non-human species. In practice, humanized antibodies are typically human antibodies in which some 
CDR residues and possibly some FR residues are substituted by residues from analogous sites in 
rodent antibodies. 

4 0 Human antibodies can also be produced using various techniques known in the art, including phage 

display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 
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222:581 (1991 )]. The techniques of Cole et al and Boerner et al. are also available for the preparation 
of human monoclonal antibodies [Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, p. 77 (1985) and Boerner et al. r J. Immunol., 147(1 ):86-95 (1991)]. Similarly, human antibodies 
can be made by introducing human immunoglobulin loci into transgenic animals, e.g., mice in which 
the endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans in all 
respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks etal., Bio/Technology 10, 779-783 
(1992); Lonberg et al., Nature 368 856-859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwiid et 
al., Nature Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); 
Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of a carcinoma with an antibody raised against an CA protein. 
As used herein, immunotherapy can be passive or active. Passive immunotherapy as defined herein 
is the passive transfer of antibody to a recipient (patient). Active immunization is the induction of 
antibody and/or T-cell responses in a recipient (patient). Induction of an immune response is the 
result of providing the recipient with an antigen to which antibodies are raised. As appreciated by one 
of ordinary skill in the art, the antigen may be provided by injecting a polypeptide against which 
antibodies are desired to be raised into a recipient, or contacting the recipient with a nucleic acid 
capable of expressing the antigen and under conditions for expression of the antigen. 

In a preferred embodiment, oncogenes which encode secreted growth factors may be inhibited by 
raising antibodies against CA proteins that are secreted proteins as described above. Without being 
bound by theory, antibodies used for treatment, bind and prevent the secreted protein from binding to 
its receptor, thereby inactivating the secreted CA protein. 

In another preferred embodiment, the CA protein to which antibodies are raised is a transmembrane 
protein. Without being bound by theory, antibodies used for treatment, bind the extracellular domain 
of the CA protein and prevent it from binding to other proteins, such as circulating ligands or cell- 
associated molecules. The antibody may cause down-regulation of the transmembrane CA protein. 
As will be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the CA protein. 
The antibody is also an antagonist of the CA protein. Further, the antibody prevents activation of the 
transmembrane CA protein. In one aspect, when the antibody prevents the binding of other molecules 
to the CA protein, the antibody prevents growth of the cell. The antibody may also sensitize the cell to 
cytotoxic agents, including, but not limited to TNF-a, TNF-3, IL-1, INF-Y and IL-2, or 
chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, and the 
like. In some instances the antibody belongs to a sub-type that activates serum complement when 
complexed with the transmembrane protein thereby mediating cytotoxicity. Thus, carcinomas may be 
treated by administering to a patient antibodies directed against the transmembrane CA protein. 

In another preferred embodiment, the antibody is conjugated to a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the CA protein. In another 
aspect the therapeutic moiety modulates the activity of molecules associated with or in close proximity 
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to the CA protein. The therapeutic moiety may inhibit enzymatic activity such as protease or protein 
kinase activity associated with carcinoma. 

In a preferred embodiment, the therapeutic moiety may also be a cytotoxic agent. In this method, 
targeting the cytotoxic agent to tumor tissue or cells, results in a reduction in the number of afflicted 
5 cells, thereby reducing symptoms associated with carcinomas, including lymphoma. Cytotoxic agents 
are numerous and varied and include, but are not limited to, cytotoxic drugs or toxins or active 
fragments of such toxins. Suitable toxins and their corresponding fragments include diphtheria A 
chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, crotin, phenomycin, enomycin and the like. 
Cytotoxic agents also include radiochemicals made by conjugating radioisotopes to antibodies raised 
10 against CA proteins, or binding of a radionuclide to a chelating agent that has been covalently 

attached to the antibody. Targeting the therapeutic moiety to transmembrane CA proteins not only 
serves to increase the local concentration of therapeutic moiety in the carcinoma of interest, i.e., 
lymphoma, but also serves to reduce deleterious side effects that may be associated with the 
therapeutic moiety. 

15 y* In another preferred embodiment, the CA protein against which the antibodies are raised is an 
O intracellular protein. In this case, the antibody may be conjugated to a protein which facilitates entry 

into the cell. In one case, the antibody enters the cell by endocytosis. In another embodiment, a 
~j2 nucleic acid encoding the antibody is administered to the individual or cell. Moreover, wherein the CA 
y, protein can be targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that 

20 M* target localization, i.e., a nuclear localization signal. 

The CA antibodies of the invention specifically bind to CA proteins. By "specifically bind" herein is 
; : m meant that the antibodies bind to the protein with a binding constant in the range of at least 10" 4 - 10~ 6 
m M~\ with a preferred range being 10" 7 - 10" 9 M" 1 . 

O In a preferred embodiment, the CA protein is purified or isolated after expression. CA proteins may be 
25 H; isolated or purified in a variety of ways known to those skilled in the art depending on what other 
components are present in the sample. Standard purification methods include electrophoretic, 
molecular, immunological and chromatographic techniques, including ion exchange, hydrophobic, 
affinity, and reverse-phase HPLC chromatography, and chromatofocusing. For example, the CA 
protein may be purified using a standard anti-CA antibody column. Ultrafiltration and diafiltration 
3 0 techniques, in conjunction with protein concentration, are also useful. For general guidance in suitable 
purification techniques, see Scopes, R., Protein Purification, Springer-Verlag, NY (1982). The degree 
of purification necessary will vary depending on the use of the CA protein. In some instances no 
purification will be necessary. 

Once expressed and purified if necessary, the CA proteins and nucleic acids are useful in a number of 
35 applications. 

In one aspect, the expression levels of genes are determined for different cellular states in the 
carcinoma phenotype; that is, the expression levels of genes in normal tissue and in carcinoma tissue 
(and in some cases, for varying severities of lymphoma that relate to prognosis, as outlined below) are 
evaluated to provide expression profiles. An expression profile of a particular cell state or point of 
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development is essentially a "fingerprint" of the state; while two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the generation of a 
gene expression profile that is unique to the state of the cell. By comparing expression profiles of cells 
in different states, information regarding which genes are important (including both up- and down- 
regulation of genes) in each of these states is obtained. Then, diagnosis may be done or confirmed: 
does tissue from a particular patient have the gene expression profile of normal or carcinoma tissue. 

"Differential expression," or grammatical equivalents as used herein, refers to both qualitative as well 
as quantitative differences in the genes temporal and/or cellular expression patterns within and among 
the cells. Thus, a differentially expressed gene can qualitatively have its expression altered, including 
an activation or inactivation, in, for example, normal versus carcinoma tissue. That is, genes may be 
turned on or turned off in a particular state, relative to another state. As is apparent to the skilled 
artisan, any comparison of two or more states can be made. Such a qualitatively regulated gene will 
exhibit an expression pattern within a state or cell type which is detectable by standard techniques in 
one such state or cell type, but is not detectable in both. Alternatively, the determination is quantitative 
in that expression is increased or decreased; that is, the expression of the gene is either upregulated, 
resulting in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via standard 
characterization techniques as outlined below, such as by use of Affymetrix GeneChip® expression 
arrays, Lockhart, Nature Biotechnology, 14:1675-1680 (1996), hereby expressly incorporated by 
reference. Other techniques include, but are not limited to, quantitative reverse transcriptase PCR, 
Northern analysis and RNase protection. As outlined above, preferably the change in expression (i.e. 
upregulation or downregulation) is at least about 50%, more preferably at least about 100%, more 
preferably at least about 150%, more preferably, at least about 200%, with from 300 to at least 1000% 
being especially preferred. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene transcript, 
or the protein level; that is, the amount of gene expression may be monitored using nucleic acid 
probes to the DNA or RNA equivalent of the gene transcript, and the quantification of gene expression 
levels, or, alternatively, the final gene product itself (protein) can be monitored, for example through 
the use of antibodies to the CA protein and standard immunoassays (ELlSAs, etc.) or other 
techniques, including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Thus, the 
proteins corresponding to CA genes, i.e. those identified as being important in a particular carcinoma 
phenotype, i.e., lymphoma, can be evaluated in a diagnostic test specific for that carcinoma. 

In a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. an 
expression profile, is monitored simultaneously, although multiple protein expression monitoring can 
be done as well. Similarly, these assays may be done on an individual basis as well. 

in this embodiment, the CA nucleic acid probes may be attached to biochips as outlined herein for the 
detection and quantification of CA sequences in a particular cell. The assays are done as is known in 
the art. As will be appreciated by those in the art, any number of different CA sequences may be used 
as probes, with single sequence assays being used in some cases, and a plurality of the sequences 
described herein being used in other embodiments. In addition, while solid-phase assays are 
described, any number of solution based assays may be done as well. 



26 



In a preferred embodiment, both solid and solution based assays may be used to detect CA 
sequences that are up-regulated or down-regulated in carcinomas as compared to normal tissue. In 
instances where the CA sequence has been altered but shows the same expression profile or an 
altered expression profile, the protein will be detected as outlined herein. 

5 In a preferred embodiment nucleic acids encoding the CA protein are detected. Although DNA or 

RNA encoding the CA protein may be detected, of particular interest are methods wherein the mRNA 
encoding a CA protein is detected. The presence of mRNA in a sample is an indication that the CA 
gene has been transcribed to form the mRNA, and suggests that the protein is expressed. Probes to 
detect the mRNA can be any nucleotide/deoxynucleotide probe that is complementary to and base 

10 pairs with the mRNA and includes but is not limited to oligonucleotides, cDNA or RNA. Probes also 
should contain a detectable label, as defined herein. In one method the mRNA is detected after 
immobilizing the nucleic acid to be examined on a solid support such as nylon membranes and 
hybridizing the probe with the sample. Following washing to remove the non-specifically bound probe, 
the label is detected. In another method detection of the mRNA is performed in situ. In this method 

15 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid probe for 
;y : sufficient time to allow the probe to hybridize with the target mRNA. Following washing to remove the 

0 non-specifically bound probe, the label is detected. For example a digoxygenin labeled riboprobe 
Cf (RNA probe) that is complementary to the mRNA encoding a CA protein is detected by binding the 
~% digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue tetrazolium 

disss *"* 

2 o and 5-bromo-4-chloro-3-indoyl phosphate. 

yj In a preferred embodiment, any of the three classes of proteins as described herein (secreted, 

transmembrane or intracellular proteins) are used in diagnostic assays. The CA proteins, antibodies, 

JIT nucleic acids, modified proteins and cells containing CA sequences are used in diagnostic assays. 
This can be done on an individual gene or corresponding polypeptide level, or as sets of assays. 

1 %~ 
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25 13 As described and defined herein, CA proteins find use as markers of carcinomas, including 

r* lymphomas such as, but not limited to, Hodgkin's and non-Hodgkin lymphoma. Detection of these 
proteins in putative carcinoma tissue or patients allows for a determination or diagnosis of the type of 
carcinoma. Numerous methods known to those of ordinary skill in the art find use in detecting 
carcinomas. In one embodiment, antibodies are used to detect CA proteins, A preferred method 

3 0 separates proteins from a sample or patient by electrophoresis on a gel (typically a denaturing and 

reducing protein gel, but may be any other type of gel including isoelectric focusing gels and the like). 
Following separation of proteins, the CA protein is detected by immunoblotting with antibodies raised 
against the CA protein. Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the CA protein find use in in situ imaging techniques. In 
35 this method cells are contacted with from one to many antibodies to the CA protein(s). Following 
washing to remove non-specific antibody binding, the presence of the antibody or antibodies is 
detected. In one embodiment the antibody is detected by incubating with a secondary antibody that 
contains a detectable label. In another method the primary antibody to the CA protein(s) contains a 
detectable label. In another preferred embodiment each one of multiple primary antibodies contains a 

4 0 distinct and detectable label. This method finds particular use in simultaneous screening for a plurality 

of CA proteins. As will be appreciated by one of ordinary skill in the art, numerous other histological 



imaging techniques are useful in the invention. 



In a preferred embodiment the label is detected in a fluorometer which has the ability to detect and 
distinguish emissions of different wavelengths. In addition, a fluorescence activated cell sorter (FACS) 
can be used in the method. 

5 In another preferred embodiment, antibodies find use in diagnosing carcinomas from blood samples. 
As previously described, certain CA proteins are secreted/circulating molecules. Blood samples, 
therefore, are useful as samples to be probed or tested for the presence of secreted CA proteins. 
Antibodies can be used to detect the CA proteins by any of the previously described immunoassay 
techniques including EL1SA, immunobiotting (Western blotting), immunoprecipitation, BIACORE 
10 technology and the like, as will be appreciated by one of ordinary skill in the art. 

In a preferred embodiment, in situ hybridization of labeled CA nucleic acid probes to tissue arrays is 
done. For example, arrays of tissue samples, including CA tissue and/or normal tissue, are made. In 
situ hybridization as is known in the art can then be done. 

O It is understood that when comparing the expression fingerprints between an individual and a 
15 :Sf standard, the skilled artisan can make a diagnosis as well as a prognosis. It is further understood that 
5 the genes which indicate the diagnosis may differ from those which indicate the prognosis. 

h* In a preferred embodiment, the CA proteins, antibodies, nucleic acids, modified proteins and cells 
!U containing CA sequences are used in prognosis assays. As above, gene expression profiles can be 
generated that correlate to carcinoma, especially lymphoma, severity, in terms of long term prognosis. 
20 p| Again, this may be done on either a protein or gene level, with the use of genes being preferred- As 
m above, the CA probes are attached to biochips for the detection and quantification of CA sequences in 
yj a tissue or patient. The assays proceed as outlined for diagnosis. 

o 

In a preferred embodiment, any of the CA sequences as described herein are used in drug screening 
assays. The CA proteins, antibodies, nucleic acids, modified proteins and cells containing CA 
25 sequences are used in drug screening assays or by evaluating the effect of drug candidates on a 
"gene expression profile" or expression profile of polypeptides. In one embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent, Zlokarnik, et al., 
Science 279, 84-8 (1998), Heid, et al., Genome Res., 6:986-994 (1996). 

30 In a preferred embodiment, the CA proteins, antibodies, nucleic acids, modified proteins and cells 
containing the native or modified CA proteins are used in screening assays. That is, the present 
invention provides novel methods for screening for compositions which modulate the carcinoma 
phenotype. As above, this can be done by screening for modulators of gene expression or for 
modulators of protein activity. Similarly, this may be done on an individual gene or protein level or by 

35 evaluating the effect of drug candidates on a "gene expression profile". In a preferred embodiment, 
the expression profiles are used, preferably in conjunction with high throughput screening techniques 
to allow monitoring for expression profile genes after treatment with a candidate agent, see Zlokarnik, 
supra. 
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Having identified the CA genes herein, a variety of assays to evaluate the effects of agents on gene 
expression may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as aberrantly regulated in carcinoma, 
candidate bioactive agents may be screened to modulate the genes response. "Modulation" thus 
includes both an increase and a decrease in gene expression or activity. The preferred amount of 
modulation will depend on the original change of the gene expression in normal versus tumor tissue, 
with changes of at least 10%, preferably 50%, more preferably 100-300%, and in some embodiments 
300-1000% or greater. Thus, if a gene exhibits a 4 fold increase in tumor compared to normal tissue, 
a decrease of about four fold is desired; a 10 fold decrease in tumor compared to normal tissue gives 
a 10 fold increase in expression for a candidate agent is desired, etc. Alternatively, where the CA 
sequence has been altered but shows the same expression profile or an altered expression profile, 
the protein will be detected as outlined herein. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene or the 
protein level; that is, the amount of gene expression may be monitored using nucleic acid probes and 
the quantification of gene expression levels, or, alternatively, the level of the gene product itself can be 
monitored, for example through the use of antibodies to the CA protein and standard immunoassays. 
Alternatively, binding and bioactivity assays with the protein may be done as outlined below. 

In a preferred embodiment, gene expression monitoring is done and a number of genes, i.e. an 
expression profile, is monitored simultaneously, although multiple protein expression monitoring can 
be done as well. 

In this embodiment, the CA nucleic acid probes are attached to biochips as outlined herein for the 
detection and quantification of CA sequences in a particular cell. The assays are further described 
below. 

Generally, in a preferred embodiment, a candidate bioactive agent is added to the cells prior to 
analysis. Moreover, screens are provided to identify a candidate bioactive agent which modulates a 
particular type of carcinoma, modulates CA proteins, binds to a CA protein, or interferes between the 
binding of a CA protein and an antibody. 

The term "candidate bioactive agent" or "drug candidate" or grammatical equivalents as used herein 
describes any molecule, e.g., protein, oligopeptide, small organic or inorganic molecule, 
polysaccharide, polynucleotide, etc., to be tested for bioactive agents that are capable of directly or 
indirectly altering either the carcinoma phenotype, binding to and/or modulating the bioactivity of an 
CA protein, or the expression of a CA sequence, including both nucleic acid sequences and protein 
sequences. In a particularly preferred embodiment, the candidate agent suppresses a CA phenotype, 
for example to a normal tissue fingerprint. Similarly, the candidate agent preferably suppresses a 
severe CA phenotype. Generally a plurality of assay mixtures are run in parallel with different agent 
concentrations to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of detection. 

In one aspect, a candidate agent will neutralize the effect of an CA protein. By "neutralize" is meant 
that activity of a protein is either inhibited or counter acted against so as to have substantially no effect 
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on a cell. 



Candidate agents encompass numerous chemical classes, though typically they are organic or 
inorganic molecules, preferably small organic compounds having a molecular weight of more than 100 
and less than about 2,500 daltons. Preferred small molecules are less than 2000, or less than 1500 
or less than 1000 or less than 500 D. Candidate agents comprise functional groups necessary for 
structural interaction with proteins, particularly hydrogen bonding, and typically include at least an 
amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional chemical groups. 
The candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. Candidate 
agents are also found among biomolecules including peptides, saccharides, fatty acids, steroids, 
purines, pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred are 
peptides. 

Candidate agents are obtained from a wide variety of sources including libraries of synthetic or natural 
compounds. For example, numerous means are available for random and directed synthesis of a 
wide variety of organic compounds and biomolecules, including expression of randomized 
oligonucleotides. Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant 
and animal extracts are available or readily produced. Additionally, natural or synthetically produced 
libraries and compounds are readily modified through conventional chemical, physical and 
biochemical means. Known pharmacological agents may be subjected to directed or random 
chemical modifications, such as acylation, alkylation, esterification, amidification to produce structural 
analogs. 

In a preferred embodiment, the candidate bioactive agents are proteins. By "protein" herein is meant 
at least two covalently attached amino acids, which includes proteins, polypeptides, oligopeptides and 
peptides. The protein may be made up of naturally occurring amino acids and peptide bonds, or 
synthetic peptidomimetic structures. Thus "amino acid", or "peptide residue", as used herein means 
both naturally occurring and synthetic amino acids. For example, homo-phenylalanine, citrulline and 
noreleucine are considered amino acids for the purposes of the invention. "Amino acid" also includes 
imino acid residues such as proline and hydroxy proline. The side chains may be in either the (R) or 
the (S) configuration. In the preferred embodiment, the amino acids are in the (S) or L-configuration. 
If non-naturally occurring side chains are used, non-amino acid substituents may be used, for example 
to prevent or retard in vivo degradations. 

In a preferred embodiment, the candidate bioactive agents are naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, for example, cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way libraries of 
procaryotic and eucaryotic proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian 
proteins, with the latter being preferred, and human proteins being especially preferred. 

In a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to about 15 
being particularly preferred. The peptides may be digests of naturally occurring proteins as is outlined 
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above, random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 
herein is meant that each nucleic acid and peptide consists of essentially random nucleotides and 
amino acids, respectively. Since generally these random peptides (or nucleic acids, discussed below) 
are chemically synthesized, they may incorporate any nucleotide or amino acid at any position. The 
synthetic process can be designed to generate randomized proteins or nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the sequence, thus forming a 
library of randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or constants at any 
position. In a preferred embodiment, the library is biased. That is, some positions within the 
sequence are either held constant, or are selected from a limited number of possibilities. For 
example, in a preferred embodiment, the nucleotides or amino acid residues are randomized within a 
defined class, for example, of hydrophobic amino acids, hydrophilic residues, sterically biased (either 
small or large) residues, towards the creation of nucleic acid binding domains, the creation of 
cysteines, for cross-linking, prolines for SH-3 domains, serines, threonines, tyrosines or histidines for 
phosphorylation sites, etc., or to purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid candidate bioactive agents may be naturally 
occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, digests 
of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a wide 
variety of which are available in the literature. 

In assays for altering the expression profile of one or more CA genes, after the candidate agent has 
been added and the cells allowed to incubate for some period of time, the sample containing the target 
sequences to be analyzed is added to the biochip. If required, the target sequence is prepared using 
known techniques. For example, the sample may be treated to lyse the cells, using known lysis 
buffers, electroporation, etc., with purification and/or amplification such as PCR occurring as needed, 
as will be appreciated by those in the art. For example, an in vitro transcription with labels covalently 
attached to the nucleosides is done. Generally, the nucleic acids are labeled with a label as defined 
herein, with biotin-FITC or PE, cy3 and cy5 being particularly preferred. 

In a preferred embodiment, the target sequence is labeled with, for example, a fluorescent, 
chemiluminescent, chemical, or radioactive signal, to provide a means of detecting the target 
sequence's specific binding to a probe. The label also can be an enzyme, such as, alkaline 
phosphatase or horseradish peroxidase, which when provided with an appropriate substrate produces 
a product that can be detected. Alternatively, the label can be a labeled compound or small molecule, 
such as an enzyme inhibitor, that binds but is not catalyzed or altered by the enzyme. The label also 
can be a moiety or compound, such as, an epitope tag or biotin which specifically binds to streptavidin. 
For the example of biotin, the streptavidin is labeled as described above, thereby, providing a 
detectable signal for the bound target sequence. As known in the art, unbound labeled streptavidin is 
removed prior to analysis. 
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As will be appreciated by those in the art, these assays can be direct hybridization assays or can 
comprise "sandwich assays", which include the use of multiple probes, as is generally outlined in U.S. 
Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 
5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of which are 
5 hereby incorporated by reference. In this embodiment, in general, the target nucleic acid is prepared 
as outlined above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including high, moderate 
and low stringency conditions as outlined above. The assays are generally run under stringency 
10 conditions which allows formation of the label probe hybridization complex only in the presence of 
target. Stringency can be controlled by altering a step parameter that is a thermodynamic variable, 
including, but not limited to, temperature, formamide concentration, salt concentration, chaotropic salt 
concentration pH, organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally outlined in U.S. 
is Patent No. 5,681 ,697. Thus it may be desirable to perform certain steps at higher stringency 
J2§ conditions to reduce non-specific binding. 

o 

42 The reactions outlined herein may be accomplished in a variety of ways, as will be appreciated by 
those in the art. Components of the reaction may be added simultaneously, or sequentially, in any 
order, with preferred embodiments outlined below. In addition, the reaction may include a variety of 
20 J other reagents may be included in the assays. These include reagents like salts, buffers, neutral 
|4 proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal hybridization and 
O detection, and/or reduce non-specific or background interactions. Also reagents that otherwise 
tU improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, anti-microbial 
Jjj agents, etc., may be used, depending on the sample preparation methods and purity of the target. In 
25j~? addition, either solid phase or solution based (i.e., kinetic PCR) assays may be used. 



Once the assay is run, the data is analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

In a preferred embodiment, as for the diagnosis and prognosis applications, having identified the 
differentially expressed gene(s) or mutated gene(s) important in any one state, screens can be run to 
3 o alter the expression of the genes individually. That is, screening for modulation of regulation of 

expression of a single gene can be done. Thus, for example, particularly in the case of target genes 
whose presence or absence is unique between two states, screening is done for modulators of the 
target gene expression. 

In addition, screens can be done for novel genes that are induced in response to a candidate agent. 
35 After identifying a candidate agent based upon its ability to suppress a CA expression pattern leading 
to a normal expression pattern, or modulate a single CA gene expression profile so as to mimic the 
expression of the gene from normal tissue, a screen as described above can be performed to identify 
genes that are specifically modulated in response to the agent. Comparing expression profiles 
between normal tissue and agent treated CA tissue reveals genes that are not expressed in normal 
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tissue or CA tissue, but are expressed in agent treated tissue. These agent specific sequences can 
be identified and used by any of the methods described herein for CA genes or proteins. In particular 
these sequences and the proteins they encode find use in marking or identifying agent treated cells. 
In addition, antibodies can be raised against the agent induced proteins and used to target novel 
therapeutics to the treated CA tissue sample. 

Thus, in one embodiment, a candidate agent is administered to a population of CA cells, that thus has 
an associated CA expression profile. By "administration" or "contacting" herein is meant that the 
candidate agent is added to the cells in such a manner as to allow the agent to act upon the cell, 
whether by uptake and intracellular action, or by action at the cell surface. In some embodiments, 
nucleic acid encoding a proteinaceous candidate agent (i.e. a peptide) may be put into a viral 
construct such as a retroviral construct and added to the cell, such that expression of the peptide 
agent is accomplished; see PCT US97/01019, hereby expressly incorporated by reference. 

Once the candidate agent has been administered to the cells, the cells can be washed if desired and 
are allowed to incubate under preferably physiological conditions for some period of time. The cells 
are then harvested and a new gene expression profile is generated, as outlined herein. 

Thus, for example, CA tissue may be screened for agents that reduce or suppress the CA phenotype. 
A change in at least one gene of the expression profile indicates that the agent has an effect on CA 
activity. By defining such a signature for the CA phenotype, screens for new drugs that alter the 
phenotype can be devised. With this approach, the drug target need not be known and need not be 
represented in the original expression screening platform, nor does the level of transcript for the target 
protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual genes and gene 
products (proteins). That is, having identified a particular differentially expressed gene as important in 
a particular state, screening of modulators of either the expression of the gene or the gene product 
itself can be done. The gene products of differentially expressed genes are sometimes referred to 
herein as "CA proteins" or an "CAP". The CAP may be a fragment, or alternatively, be the full length 
protein to the fragment encoded by the nucleic acids of Tables 1-10. Preferably, the CAP is a 
fragment. In another embodiment, the sequences are sequence variants as further described herein. 

Preferably, the CAP is a fragment of approximately 14 to 24 amino acids long. More preferably the 
fragment is a soluble fragment. Preferably, the fragment includes a non-transmembrane region. In a 
preferred embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, 
the c-terminus of the fragment is kept as a free acid and the n-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the CA proteins are conjugated to an immunogenic agent as discussed herein. In 
one embodiment the CA protein is conjugated to BSA. 

In a preferred embodiment, screening is done to alter the biological function of the expression product 
of the CA gene. Again, having identified the importance of a gene in a particular state, screening for 
agents that bind and/or modulate the biological activity of the gene product can be run as is more fully 
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outlined below. 

In a preferred embodiment, screens are designed to first find candidate agents that can bind to CA 
proteins, and then these agents may be used in assays that evaluate the ability of the candidate agent 
to modulate the CAP activity and the carcinoma phenotype. Thus, as will be appreciated by those in 
5 the art, there are a number of different assays which may be run; binding assays and activity assays. 

In a preferred embodiment, binding assays are done. In general, purified or isolated gene product is 
used; that is, the gene products of one or more CA nucleic acids are made. In general, this is done as 
is known in the art. For example, antibodies are generated to the protein gene products, and standard 
immunoassays are run to determine the amount of protein present. Alternatively, cells comprising the 
10 CA proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a CA protein and a candidate 
bioactive agent, and determining the binding of the candidate agent to the CA protein. Preferred 
embodiments utilize the human or mouse CA protein, although other mammalian proteins may also be 
M 5 used, for example for the development of animal models of human disease. In some embodiments, as 
15 outlined herein, variant or derivative CA proteins may be used. 

J* Generally, in a preferred embodiment of the methods herein, the CA protein or the candidate agent is 
non-diffusably bound to an insoluble support having isolated sample receiving areas (e.g. a microtiter 
H= plate, an array, etc.). The insoluble supports may be made of any composition to which the 

l V. I 

: ^ compositions can be bound, is readily separated from soluble material, and is otherwise compatible 
20 tk w ' th * he overall method of screening. The surface of such supports may be solid or porous and of any 
p convenient shape. Examples of suitable insoluble supports include microtiter plates, arrays, 
PJ membranes and beads. These are typically made of glass, plastic (e.g., polystyrene), 
W polysaccharides, nylon or nitrocellulose, Teflon™, etc. Microtiter plates and arrays are especially 
W convenient because a large number of assays can be carried out simultaneously, using small amounts 
25 of reagents and samples. The particular manner of binding of the composition is not crucial so long 
as it is compatible with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies (which 
do not sterically block either the ligand binding site or activation sequence when the protein is bound to 
the support), direct binding to "sticky" or ionic supports, chemical crosslinking, the synthesis of the 
3 0 protein or agent on the surface, etc. Following binding of the protein or agent, excess unbound 

material is removed by washing. The sample receiving areas may then be blocked through incubation 
with bovine serum albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the CA protein is bound to the support, and a candidate bioactive agent is 
added to the assay. Alternatively, the candidate agent is bound to the support and the CA protein is 
3 5 added. Novel binding agents include specific antibodies, non-natural binding agents identified in 
screens of chemical libraries, peptide analogs, etc. Of particular interest are screening assays for 
agents that have a low toxicity for human cells. A wide variety of assays may be used for this purpose, 
including labeled in vitro protein-protein binding assays, electrophoretic mobility shift assays, 
immunoassays for protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The determination of the binding of the candidate bioactive agent to the CA protein may be done in a 
number of ways. In a preferred embodiment, the candidate bioactive agent is labeled, and binding 
determined directly. For example, this may be done by attaching all or a portion of the CA protein to a 
solid support, adding a labeled candidate agent (for example a fluorescent label), washing off excess 
reagent, and determining whether the label is present on the solid support. Various blocking and 
washing steps may be utilized as is known in the art. 

By "labeled" herein is meant that the compound is either directly or indirectly labeled with a label which 
provides a detectable signal, e.g. radioisotope, fluorescers, enzyme, antibodies, particles such as 
magnetic particles, chemiluminescers, or specific binding molecules, etc. Specific binding molecules 
include pairs, such as biotin and streptavidin, digoxin and antidigoxin etc. For the specific binding 
members, the complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or indirectly 
provide a detectable signal. 

In some embodiments, only one of the components is labeled. For example, the proteins (or 
proteinaceous candidate agents) may be labeled at tyrosine positions using 125 l f or with fluorophores. 
Alternatively, more than one component may be labeled with different labels; using 125 l for the proteins, 
for example, and a fluorophor for the candidate agents. 

In a preferred embodiment, the binding of the candidate bioactive agent is determined through the use 
of competitive binding assays. In this embodiment, the competitor is a binding moiety known to bind 
to the target molecule (i.e. CA protein), such as an antibody, peptide, binding partner, ligand, etc. 
Under certain circumstances, there may be competitive binding as between the bioactive agent and 
the binding moiety, with the binding moiety displacing the bioactive agent. 

In one embodiment, the candidate bioactive agent is labeled. Either the candidate bioactive agent, or 
the competitor, or both, is added first to the protein for a time sufficient to allow binding, if present. 
Incubations may be performed at any temperature which facilitates optimal activity, typically between 4 
and 40°C. Incubation periods are selected for optimum activity, but may also be optimized to facilitate 
rapid high through put screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent 
is generally removed or washed away. The second component is then added, and the presence or 
absence of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the candidate bioactive agent. 
Displacement of the competitor is an indication that the candidate bioactive agent is binding to the CA 
protein and thus is capable of binding to, and potentially modulating, the activity of the CA protein. In 
this embodiment, either component can be labeled. Thus, for example, if the competitor is labeled, 
the presence of label in the wash solution indicates displacement by the agent. Alternatively, if the 
candidate bioactive agent is labeled, the presence of the label on the support indicates displacement. 

In an alternative embodiment, the candidate bioactive agent is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate that the 
bioactive agent is bound to the CA protein with a higher affinity. Thus, if the candidate bioactive agent 
is labeled, the presence of the label on the support, coupled with a lack of competitor binding, may 
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indicate that the candidate agent is capable of binding to the CA protein. 



In a preferred embodiment, the methods comprise differentia] screening to identity bioactive agents 
that are capable of modulating the activity of the CA proteins. In this embodiment, the methods 
comprise combining a CA protein and a competitor in a first sample. A second sample comprises a 
5 candidate bioactive agent, a CA protein and a competitor. The binding of the competitor is determined 
for both samples, and a change, or difference in binding between the two samples indicates the 
presence of an agent capable of binding to the CA protein and potentially modulating its activity. That 
is, if the binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the CA protein. 

10 Alternatively, a preferred embodiment utilizes differential screening to identify drug candidates that 

bind to the native CA protein, but cannot bind to modified CA proteins. The structure of the CA protein 
may be modeled, and used in rational drug design to synthesize agents that interact with that site. 
Drug candidates that affect CA bioactivity are also identified by screening drugs for the ability to either 
enhance or reduce the activity of the protein. 

15 y Positive controls and negative controls may be used in the assays. Preferably all control and test 
5? samples are performed in at least triplicate to obtain statistically significant results. Incubation of all 
jE samples is for a time sufficient for the binding of the agent to the protein. Following incubation, all 

samples are washed free of non-specifically bound material and the amount of bound, generally 
N* labeled agent determined. For example, where a radiolabel Is employed, the samples may be 

20 ^ counted in a scintillation counter to determine the amount of bound compound. 

fwj A variety of other reagents may be included in the screening assays. These include reagents like 
fy salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal 
iij protein-protein binding and/or reduce non-specific or background interactions. Also reagents that 
P otherwise improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, 
25 - ^ anti-microbial agents, etc., may be used. The mixture of components may be added in any order that 
provides for the requisite binding. 

Screening for agents that modulate the activity of CA proteins may also be done. In a preferred 
embodiment, methods for screening for a bioactive agent capable of modulating the activity of CA 
proteins comprise the steps of adding a candidate bioactive agent to a sample of CA proteins, as 

30 above, and determining an alteration in the biological activity of CA proteins. "Modulating the activity 
of an CA protein" includes an increase in activity, a decrease in activity, or a change in the type or kind 
of activity present. Thus, in this embodiment, the candidate agent should both bind to CA proteins 
(although this may not be necessary), and alter its biological or biochemical activity as defined herein. 
The methods include both in vitro screening methods, as are generally outlined above, and in vivo 

35 screening of cells for alterations in the presence, distribution, activity or amount of CA proteins. 

Thus, in this embodiment, the methods comprise combining a CA sample and a candidate bioactive 
agent, and evaluating the effect on CA activity. By "CA activity" or grammatical equivalents herein is 
meant one of the CA protein's biological activities, including, but not limited to, its role in 
tumorigenesis, including cell division, preferably in lymphatic tissue, cell proliferation, tumor growth 
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and transformation of cells. In one embodiment, CA activity includes activation of or by a protein 
encoded by a nucleic acid of Tables 1-10. An inhibitor of CA activity is the inhibition of any one or 
more CA activities. 

In a preferred embodiment, the activity of the CA protein is increased; in another preferred 
embodiment, the activity of the CA protein is decreased. Thus, bioactive agents that are antagonists 
are preferred in some embodiments, and bioactive agents that are agonists may be preferred in other 
embodiments. 

In a preferred embodiment, the invention provides methods for screening for bioactive agents capable 
of modulating the activity of a CA protein. The methods comprise adding a candidate bioactive agent, 
as defined above, to a cell comprising CA proteins. Preferred cell types include almost any cell. The 
cells contain a recombinant nucleic acid that encodes a CA protein. In a preferred embodiment, a 
library of candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or subsequent 
exposure of physiological signals, for example hormones, antibodies, peptides, antigens, cytokines, 
growth factors, action potentials, pharmacological agents including chemotherapeutics, radiation, 
carcinogenics, or other cells (i.e. cell-cell contacts). In another example, the determinations are 
determined at different stages of the cell cycle process. 

In this way, bioactive agents are identified. Compounds with pharmacological activity are able to 
enhance or interfere with the activity of the CA protein. 

In one embodiment, a method of inhibiting carcinoma cancer eel) division, is provided. The method 
comprises administration of a carcinoma cancer inhibitor. 

In a preferred embodiment, a method of inhibiting lymphoma carcinoma cell division is provided 
comprising administration of a lymphoma carcinoma inhibitor. 

In another embodiment, a method of inhibiting tumor growth is provided. The method comprises 
administration of a carcinoma cancer inhibitor. In a particularly preferred embodiment, a method of 
inhibiting tumor growth in lymphatic tissue is provided comprising administration of a lymphoma 
inhibitor. 

In a further embodiment, methods of treating cells or individuals with cancer are provided. The 
method comprises administration of a carcinoma cancer inhibitor. Preferably, the carcinoma is a 
lymphoma carcinoma. 

In one embodiment, a carcinoma cancer inhibitor is an antibody as discussed above. In another 
embodiment, the carcinoma cancer inhibitor is an antisense molecule. Antisense molecules as used 
herein include antisense or sense oligonucleotides comprising a singe-stranded nucleic acid sequence 
(either RNA or DNA) capable of binding to target mRNA (sense) or DNA (antisense) sequences for 
carcinoma cancer molecules. Antisense or sense oligonucleotides, according to the present 
invention, comprise a fragment generally at least about 14 nucleotides, preferably from about 14 to 30 
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nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a cDNA 
sequence encoding a given protein is described in, for example, Stein and Cohen, Cancer Res. 
48:2659, (1988) and van der Krol et al., BioTechniques 6:958, (1988). 

Antisense molecules may be introduced into a cell containing the target nucleotide sequence by 
formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other 
cytokines, or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand 
binding molecule does not substantially interfere with the ability of the ligand binding molecule to bind 
to its corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide or 
its conjugated version into the cell. Alternatively, a sense or an antisense oligonucleotide may be 
introduced into a cell containing the target nucleic acid sequence by formation of an oligonucleotide- 
lipid complex, as described in WO 90/1 0448. It is understood that the use of antisense molecules or 
knock out and knock in models may also be used in screening assays as discussed above, in addition 
to methods of treatment. 

The compounds having the desired pharmacological activity may be administered in a physiologically 
acceptable carrier to a host, as previously described. The agents may be administered in a variety of 
ways, orally, parenterally e.g., subcutaneously, intraperitoneal^, intravascularly, etc. Depending upon 
the manner of introduction, the compounds may be formulated in a variety of ways. The concentration 
of therapeutically active compound in the formulation may vary from about 0.1-100% wgt/vol. The 
agents may be administered alone or in combination with other treatments, i.e., radiation. 

The pharmaceutical compositions can be prepared in various forms, such as granules, tablets, pills, 
suppositories, capsules, suspensions, salves, lotions and the like. Pharmaceutical grade organic or 
inorganic carriers and/or diluents suitable for oral and topical use can be used to make up 
compositions containing the therapeutically-active compounds. Diluents known to the art include 
aqueous media, vegetable and animal oils and fats. Stabilizing agents, wetting and emulsifying 
agents, salts for varying the osmotic pressure or buffers for securing an adequate pH value, and skin 
penetration enhancers can be used as auxiliary agents. 

Without being bound by theory, it appears that the various CA sequences are important in carcinomas. 
Accordingly, disorders based on mutant or variant CA genes may be determined. In one embodiment, 
the invention provides methods for identifying cells containing variant CA genes comprising 
determining all or part of the sequence of at least one endogenous CA genes in a cell. As will be 
appreciated by those in the art, this may be done using any number of sequencing techniques. In a 
preferred embodiment, the invention provides methods of identifying the CA genotype of an individual 
comprising determining all or part of the sequence of at least one CA gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a number of 
tissues or different samples of the same tissue. The method may include comparing the sequence of 
the sequenced CA gene to a known CA gene, i.e., a wild-type gene. As will be appreciated by those in 
the art, alterations in the sequence of some oncogenes can be an indication of either the presence of 
the disease, or propensity to develop the disease, or prognosis evaluations. 

The sequence of ail or part of the CA gene can then be compared to the sequence of a known CA 
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gene to determine if any differences exist. This can be done using any number of known homology 
programs, such as Bestfit, etc. In a preferred embodiment, the presence of a difference in the 
sequence between the CA gene of the patient and the known CA gene is indicative of a disease state 
or a propensity for a disease state, as outlined herein. 

5 In a preferred embodiment, the CA genes are used as probes to determine the number of copies of 

the CA gene in the genome. For example, some cancers exhibit chromosomal deletions or insertions, 
resulting in an alteration in the copy number of a gene. 

In another preferred embodiment CA genes are used as probes to determine the chromosomal 
location of the CA genes. Information such as chromosomal location finds use in providing a 
10 diagnosis or prognosis in particular when chromosomal abnormalities such as translocations, and the 
like are identified in CA gene loci. 

Thus, in one embodiment, methods of modulating CA in cells or organisms are provided. In one 
embodiment, the methods comprise administering to a cell an anti-CA antibody that reduces or 
Q eliminates the biological activity of an endogenous CA protein. Alternatively, the methods comprise 
15 P administering to a cell or organism a recombinant nucleic acid encoding a CA protein. As will be 
y appreciated by those in the art, this may be accomplished in any number of ways. In a preferred 
£7 embodiment, for example when the CA sequence is down-regulated in carcinoma, the activity of the 
y, CA gene is increased by increasing the amount of CA in the cell, for example by overexpressing the 
yj endogenous CA or by administering a gene encoding the CA sequence, using known gene-therapy 

2 o 52 techniques, for example. In a preferred embodiment, the gene therapy techniques include the 

5* incorporation of the exogenous gene using enhanced homologous recombination (EHR), for example 

51 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. Alternatively, for 

% i J 

I, I example when the CA sequence is up-regulated in carcinoma, the activity of the endogenous CA gene 

Q is decreased, for example by the administration of a CA antisense nucleic acid. 

25 In one embodiment, the CA proteins of the present invention may be used to generate polyclonal and 
monoclonal antibodies to CA proteins, which are useful as described herein. Similarly, the CA 
proteins can be coupled, using standard technology, to affinity chromatography columns. These 
columns may then be used to purify CA antibodies. In a preferred embodiment, the antibodies are 
generated to epitopes unique to a CA protein; that is, the antibodies show little or no cross-reactivity to 

3 0 other proteins. These antibodies find use in a number of applications. For example, the CA 

antibodies may be coupled to standard affinity chromatography columns and used to purify CA 
proteins. The antibodies may also be used as blocking polypeptides, as outlined above, since they will 
specifically bind to the CA protein. 

In one embodiment, a therapeutically effective dose of a CA or modulator thereof is administered to a 
35 patient. By "therapeutically effective dose" herein is meant a dose that produces the effects for which 
it is administered. The exact dose will depend on the purpose of the treatment, and will be 
ascertainable by one skilled in the art using known techniques. As is known in the art, adjustments for 
CA degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as the 
age, body weight, general health, sex, diet, time of administration, drug interaction and the severity of 

4 0 the condition may be necessary, and will be ascertainable with routine experimentation by those 
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skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other animals, 
particularly mammals, and organisms. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, and in the most 
preferred embodiment the patient is human. 

The administration of the CA proteins and modulators of the present invention can be done in a variety 
of ways as discussed above, including, but not limited to, orally, subcutaneously, intravenously, 
intranasally, transdermal^, intraperitoneal^, intramuscularly, intrapulmonary, vaginally, rectally, or 
intraocularly. In some instances, for example, in the treatment of wounds and inflammation, the CA 
proteins and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a CA protein in a form suitable for 
administration to a patient. In the preferred embodiment, the pharmaceutical compositions are in a 
water soluble form, such as being present as pharmaceutical^ acceptable salts, which is meant to 
include both acid and base addition salts. "Pharmaceutical^ acceptable acid addition salt" refers to 
those salts that retain the biological effectiveness of the free bases and that are not biologically or 
otherwise undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, propionic 
acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, maionic acid, succinic acid, fumaric acid, 
tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, 
ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. "Pharmaceuticaily acceptable 
base addition salts" include those derived from inorganic bases such as sodium, potassium, lithium, 
ammonium, calcium, magnesium, iron, zinc, copper, manganese, aluminum salts and the like. 
Particularly preferred are the ammonium, potassium, sodium, calcium, and magnesium salts. Salts 
derived from pharmaceuticaily acceptable organic non-toxic bases include salts of primary, secondary, 
and tertiary amines, substituted amines including naturally occurring substituted amines, cyclic amines 
and basic ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: carrier proteins such 
as serum albumin; buffers; fillers such as microcrystalline cellulose, lactose, corn and other starches; 
binding agents; sweeteners and other flavoring agents; coloring agents; and polyethylene glycol. 
Additives are well known in the art, and are used in a variety of formulations. 

In a preferred embodiment, CA proteins and modulators are administered as therapeutic agents, and 
can be formulated as outlined above. Similarly, CA genes (including both the full-length sequence, 
partial sequences, or regulatory sequences of the CA coding regions) can be administered in gene 
therapy applications, as is known in the art. These CA genes can include antisense applications, 
either as gene therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

In a preferred embodiment, CA genes are administered as DNA vaccines, either single genes or 
combinations of CA genes. Naked DNA vaccines are generally known in the art. Brower, Nature 
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Biotechnology, 16:1304-1305 (1998). 



In one embodiment, CA genes of the present invention are used as DNA vaccines. Methods for the 
use of genes as DNA vaccines are well known to one of ordinary skill in the art, and include placing a 
CA gene or portion of a CA gene under the control of a promoter for expression in a patient with 
5 carcinoma. The CA gene used for DNA vaccines can encode full-length CA proteins, but more 

preferably encodes portions of the CA proteins including peptides derived from the CA protein. In a 
preferred embodiment a patient is immunized with a DNA vaccine comprising a plurality of nucleotide 
sequences derived from a CA gene. Similarly, it is possible to immunize a patient with a plurality of 
CA genes or portions thereof as defined herein. Without being bound by theory, expression of the 
10 polypeptide encoded by the DNA vaccine, cytotoxic T-cells, helper T-cells and antibodies are induced 
which recognize and destroy or eliminate cells expressing CA proteins. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant molecule with the 
DNA vaccine. Such adjuvant molecules include cytokines that increase the immunogenic response to 
the CA polypeptide encoded by the DNA vaccine. Additional or alternative adjuvants are known to 
15 jr those of ordinary skill in the art and find use in the invention. 



In another preferred embodiment CA genes find use in generating animal models of carcinomas, 
particularly lymphoma carcinomas. As is appreciated by one of ordinary skill in the art, when the CA 
gene identified is repressed or diminished in CA tissue, gene therapy technology wherein antisense 
fT RNA directed to the CA gene will also diminish or repress expression of the gene. An animal 
2 o 99 generated as such serves as an animal model of CA that finds use in screening bioactive drug 
jafe candidates. Similarly, gene knockout technology, for example as a result of homologous 
ip recombination with an appropriate gene targeting vector, will result in the absence of the CA protein. 
ItJ When desired, tissue-specific expression or knockout of the CA protein may be necessary. 

hi 

It is also possible that the CA protein is overexpressed in carcinoma. As such, transgenic animals can 
25 be generated that overexpress the CA protein. Depending on the desired expression level, promoters 
of various strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression level of 
the transgene. Animals generated by such methods find use as animal models of CA and are 
additionally useful in screening for bioactive molecules to treat carcinoma. 

30 The CA nucleic acid sequences of the invention are depicted in Tables 1-10. The sequences in each 
Table include genomic sequence, mRNA and coding sequences for both mouse and human. The 
different sequences are assigned the following SEQ ID Nos: 



Table I (mouse gene: Fscnl ; human gene SNL) 
Mouse genomic sequence (SEQ ID NO: 1) 
35 Mouse mRNA sequence (SEQ ID NO: 2) 
Mouse coding sequence (SEQ ID NO: 3) 
Human genomic sequence (SEQ ID NO: 4) 
Human mRNA sequence (SEQ ID NO: 5) 
Human coding sequence (SEQ ID NO: 6) 
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Table li (mouse gene Map3k6; human gene MAP3K6) 
Mouse genomic sequence (SEQ ID NO: 7) 
Mouse mRNA sequence (SEQ (D NO: 8) 
Mouse coding sequence (SEQ ID NO: 9) 
Human genomic sequence (SEQ ID NO: 10) 
Human mRNA sequence (SEQ ID NO: 11) 
Human coding sequence (SEQ ID NO: 12) 

Table ill (mouse gene Fosb; human gene FOSB) 
Mouse genomic sequence (SEQ ID NO: 13) 
Mouse mRNA sequence (SEQ ID NO: 14) 
Mouse coding sequence (SEQ ID NO: 15) 
Human genomic sequence (SEQ ID NO: 16) 
Human mRNA sequence (SEQ ID NO: 17) 
Human coding sequence (SEQ ID NO: 18) 

Table IV (mouse gene cmkbr7; human gene: CCR7) 
Mouse genomic sequence (SEQ ID NO: 19) 
Mouse mRNA sequence (SEQ ID NO: 20) 
Mouse coding sequence (SEQ ID NO: 21) 
Human genomic sequence (SEQ ID NO: 22) 
Human mRNA sequence (SEQ ID NO: 23) 
Human coding sequence (SEQ ID NO: 24) 

Table V (mouse gene: Ccndl; human gene: CCND1) 
Mouse genomic sequence (SEQ ID NO: 25) 
Mouse mRNA sequence (SEQ ID NO: 26) 
Mouse coding sequence (SEQ ID NO: 27) 
Human genomic sequence (SEQ ID NO: 28) 
Human mRNA sequence (SEQ ID NO:29) 
Human coding sequence (SEQ ID NO: 30) 

Table VI (mouse gene: Ccnd3; human gene: CCND3) 
Mouse genomic sequence (SEQ ID NO: 31) 
Mouse mRNA sequence (SEQ ID NO: 32) 
Mouse coding sequence (SEQ ID NO: 33) 
Human genomic sequence (SEQ ID NO: 34) 
Human mRNA sequence (SEQ ID NO: 35) 
Human coding sequence (SEQ ID NO: 36) 

Table VII (mouse gene: Wnt3; human gene: WNT3) 
Mouse genomic sequence (SEQ ID NO: 37) 
Mouse mRNA sequence (SEQ ID NO: 38) 
Mouse coding sequence (SEQ ID NO: 39) 
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Human genomic sequence (SEQ ID NO: 40) 
Human mRNA sequence (SEQ ID NO: 41) 
Human coding sequence (SEQ ID NO: 42) 

Table VI 1 1 (mouse gene: Batf; human gene: BATF) 
Mouse genomic sequence (SEQ ID NO: 43) 
Mouse mRNA sequence (SEQ ID NO: 44) 
Mouse coding sequence (SEQ ID NO: 45) 
Human genomic sequence (SEQ ID NO: 46) 
Human mRNA sequence (SEQ ID NO: 47) 
Human coding sequence (SEQ ID NO: 48) 

Table IX (mouse gene: irf4; human gene: IRF4) 
Mouse genomic sequence (SEQ ID NO: 49) 
Mouse mRNA sequence (SEQ ID NO: 50) 
Mouse coding sequence (SEQ ID NO: 51) 
Human genomic sequence (SEQ ID NO: 52) 
Human mRNA sequence (SEQ ID NO: 53) 
Human coding sequence (SEQ ID NO: 54) 

Table X (mouse gene: Notchl; human gene: NOTCH1) 
Mouse genomic sequence (SEQ ID NO: 55) 
Mouse mRNA sequence (SEQ ID NO: 56) 
Mouse coding sequence (SEQ ID NO: 57) 
Human genomic sequence (SEQ ID NO: 58) 
Human mRNA sequence (SEQ ID NO: 59) 
Human coding sequence (SEQ ID NO: 60) 
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TABLE I 



MOUSE NOMENCLATURE 
ICSGNM FSCHI 
Celera mCG23208 

HUMAN NOMENCLATURE 
HGNC • SNL 
Celera hCG15970 

MOUSE SEQUENCE - GENOMIC 

CCTCTCAAACTCTTGGTAATTCTCCTGCTTCGGCACGCTGGGGACTGGGACTACACCTGTGTGCCACCATGGCTGGCTATTTCTCTCTCTTGAACAC 
TGGAAGAGTGCTCAGAGT TTACTGG ATC TGGGAGAAGCTCGAGGCTGGGT T ATGGAAGCCGGCTCTG CTTG CTCCAGCT CAGGGAGGAGTTGCCTG A 
GGGCCAGGCACTTCGAAGGACAGATGGGACCCAAGGCCAGACACTGGGCCCCAGCTCACCAGAAGTGGAGCCCTGACTGTCTCTGAGAGGGTAAGCT 
GGGGTGGCTGCCAGGCGGGCAAGGCCAAAGCCTGGCAGCAGCCGGTGGCCCCTCTTCTGGC^^ 

CACTAAAGTGCCCTCTTAATTAGCCAGGCTCTTGCCAAGCACTGGCCACGATTGCCTTGTTTCACAGAATTCACCTTGGACTGGCACAGATGGGGAG 
GGTGGATAGCCCGGTGTTTTGTCTTTCTTCTAGGGGAGCTGGGCTCAAGGGCAGGACTCCTGGGCCAGCCCAGGTTTTTCCCTCAGAAACCACAGCA 
TGAATACTGGCATGATGGAGCACACCTGTAATCAGCAGAAATTGAGACAGGAAGATTGTTGAGAATTTGAG^ 

CTATCTCTACACACTCCCCCTCCCCCCGCCCCTGGAAATGCTGGAGGGTCTGGGGAAGATGGCTCAGTGGTTAAGGGCACTTGCTGCTCTTGCAGGG 
GACCCAAGTTTGATTCCCAGCATCTATAACTTGGTGGCTCACAATTCCAGTTCCAGGCACTCTGACACCTTCCTGTGGTCTGCAAGGCACCTGCATT 
CATTCTCACGTGCATGCGCACACACATGCGCGCGCGCACACACACACACACACACACACACACATCTCATAAATACAAATAAAATAAATCTTGAAAA 
CAACAGTGA<^ACAACACATGCTGAACATGGTGGTGCTCCTGACATGGTGGTGCTC^ 

GGTGGTGCTCCTGACATGATGGTAAGCCTGACATGGTGGTGCTCCTGACATGGTGGTGCTCCTGACATGGTGATGCTCCTGACATGGTGGTGCTCCT 
GACATGGTGGTAAGCCTGACATGGTGGTAAGCCTGACATGGTGGTGCTCCTGACATGGTGGTGCTCCCTGCACTGACAGTTCTGAGGAGGTGGAGAT 
^ AGGAGGATCCAGACTTGGAAGCTTGCCTGAGATAGACCCTGTCTCAGAATAGGCTGGAGGCTGCCTCTGCCCTTTTGCTTCTATCTCTTTTTCCATA 
TGTGTGTGTGCGTG TGTGCATATGTGTATGC ATATTTGTAAG T AT ATGT AT ATAAAT AAAC ACAC AC AC ACACAAACTTAC TCTGTATACTAGGCTG 

0 GCCTTGAACTCAGAGACCTGCCTGTTTCTGCCTTCCCAGTGCTGGGATTAAAAGCGTGTGCCATGGGCTGGTGAGATGGCTCAGTGGGTAAGAGTAC 
CTGACTGCTCTTCCAAAGGTCCAGTGTTCAAGTCCCAGCAACCACATGGTGGCTCACAACCATCCGTAACAAGATCTGACGCCCTCTTCTGGTGTGT 

"^2 CTG AAG ACAGCTACAGTGTA C TTAAAT AAATAAATCTT AAAAAAAAAAAAAAAAAAAAAAAAAGG CGTGCACCA CCA CTG CCCGGCTAGTTAGC TT T 
*p ATCTTCCGTGGATGTTTTGTCTGCAGTATGTCTGAGTGAGGGTGTCAGATCCTCTGTAACTGGAGTTATAGACAGTTGTGAGCAGCCATGTAAGTGC 
Li TGGGAATTGAATCCGGGTTCTCTGGAAGAGCAGTTCATGCTCTTAAGCACTGCGCCATCTCTCCAGCTGTCCTTATTTATTTTTAAGGGTCTCTTGT 

1 I AGCCATACAGGCCAGAAACCCACCATTAGCCACGGACCTTGAGCTCCAAGTGCTGGGTGACAGGCCCGCATCTCTCCTGGTTTGTGATCTGCTAGGG 
I 8 * ATGGGGTCTAGGACCTCACACAGGCTGCATAAGCATTCCCCTCACCCACCCTCATCCCCACCCAGTCCTGGACAGGCTCTGGCTGTGTAGGCTGGTC 
f g| TCCTCCTTTCTATCCCCCTGCCTC^GCCTCCTGAGTGCTGAGGNNNNNNNNNNNNNNN^ 

"* NNNNNNNNN2WNNNNNNNNNNNN)WNNNNNN^ 

GTTCAAGGCCAGCTTGGTTTACTTGAGACTCTGTCTCAAAAAGACCAAACCAGGG 
Mc GAACCTGAGTTTCATCCTGGAACTCATACAAAAAGCTGGCTGTGGTCAGGTGGCGCATGCCTTTAATCCCATCACTCAGGAGACAGAGGCAGGCAGA 
i« TTCTTGAGTTCGAGGCCAGCCTGGTCTACAGACTGAGTTCCAGGATAGCCAGGTCTACACAGAGAAACC 

AAACTAGGTGTGGCTAGCCTGTAACCCCAGTGCTGGGTAAACAGGGGCAGGTGAATACATCTCCGGGATTCGCCAGCCTGTCCTATCTATGAGCTCT 
fU AGGTCTGGGGAGACAGCCTGTCAAATCAATGCAAGCAGGTGACAACCGAGGAAAGACACCCAAGGTTGA 

| , I CCACAGCAAACAAAAACTAACAAAACCCAAACCATATTTTGCTGTGGTGGTGGTTGTTAAATTTTATTTATTCTGTGTGTATGGGTATTTCACCTGC 
^ ATGAATATCTACCTGTGTAGCATATACATGCCGGGTACTTTCGGAGGTCAGAAGAGGACATCAAATCCCGTAGAACTGGAGTTACAGAAAATGAACT 
Q TGTGGCTTCTGGGAATCTAACTTGGATCCTCTAAAGGACAGGCAGGACTCTGAATTCTTGAGCCACAGATACAGCTTTCCAAACCCATGTTTTGTAA 

ifW GAGACAGCAGTGGTGGTAACAGCCTGGTCTACCCTCTTGGTCAGAGGGAGGCAGGCGGAGAGTAACTCAGGGCCTGGGGGCAACACCCAGCTGGGCC 
TGGGGCTAGTGGGGGCTCAGC^GGCAGGACGTGGGTCCTCCTACTTCCTAAGCTCTGATGGGCAAGTGGTGCCAGTGGTACCAGGCTGCCCTGAGG 
GGCCCTTAGAGCAGGTGACCACAGAGCCACAAAGAGGCTATTCATGGCCTCCGGTTCACGAGGCTGCCCCTTTATTGAGGGCTTGAACCACAGAAAG 
CTGTGTGGTCTGCAGGGAAGCTAGTTCTGAGCTGTCTGGCCAGCTACAGTAGTGTGTGTGTGTGTGTGTGTGTGTATACACATGGTACTTTTTTCTA 
TCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTTTGTGTCTATGTATCTAATAATAAATTGATCTATTTAACTATGTAG 
ATATCAATCCATTGATCTATGTATCTATCAACTGATGTGTATGTATGTATGGATGTACCTACATGCGTACTTGAGGCAGGACTTAAAGTAGCCCAGG 
CTAGCCTTACCACATTTCATAGCAGAGGATGGTCTTGAACTTCAGGTCCTCCTGTCTCTACCTCGTGAGTGCTGGGATTACAGGTTTGTGCCACCAC 
GCCTGGCTTAGGAGGTT CTGTGGTT AGAATC CAGGAT TTC AC TAT CAACTGGGCCACATCGTCAG CCATAT TCAATATGCTTTCTTTTAC ATT CTTA 
CTTATTTACTACAAATGGGTCCCAGGAACCAAGGTCGGCAAGCCTGATGGTGAGTGCTCTTCCCTGCTGGCTGACTTATTTGCTCTTTTTTTTTTTT 



TCTGTCCCCAACCCCACCCCCAGTACTGGGATTAAAGGCATGCGCCACACGCCCAGCTCTT^ 

GCTGACCGTTCATGCACAATACAGCTAAGGCTTGCTTTGAATTTCTCAGTTTCTTTCTTCTACCCAACAGTTTCTGAGATTTCAGGTCCGTCCTATG 

TC^TGCCCAGGTTTTTTGTTTGTTTGTTTGTTTGTTTTTTGGTTTTGTTTTGCTGTTTTTCAACACAGAGTTTCTCTGTGGAACCCTGGCTATCCTG 

GAACTCTGTATACCAGGCTGTCCTTGAACTCAGAGACCCTCTGCCTCCCTAGTGCTGGGTTAAAGGTCCCTGCCATCACCGTGAAACTTAAGCCCAG 

CTTCTCATTTGGTGGCTGGAAATTTGCCTGCCTCTGCCTCTCTCCAGCTGCCTAGCA 

AGGACAAGATCTCTmsCAACCGTGAACAGCCCTACTGAGCCATCCTCCTGGGACAG 

GTACAA AGTTGACAC AACCTTGC TCT TCCCAGCAGCT GGG AGTT C CTG AGAAGTGTAAATGAGGTTTCC AGGAAAG CCAAGAGGTGAG AGACTGAAA 
AC-TTGGAGGGATAAGGGGGTGCCCATCAAGCTGCAGTGGAGACTTGTTATCTGTTGCGCATCCCTGACATCTCAGCAGGAGGATCAGGAGTTCAAGG 
GCATCCTCAGCAACATAGTAAGTCTATGGAGCTAGCCTGGGCTACACAGATGCCATTACAAAAAAAAAAAAAAAAGTAAAAAAAAAATCTTGGGATA 
TGGGGGATGGGTGGGTGAACTGTGTAGCCCAGGGAGTCACTACTGGTTCCTTTGGAAGTCCCTC^ 

CTCCAGGAGTCCTCCTTCAGGGAACTAAGTTGGGGAGCAGGGCAGCCTATCCTGTGGCCTTATTTCCTGTCTGGTGCAAGCGCCAGGCAAGGCATCC 
AG C CAGGAGCG ACT C AG GC CTGAC CTC CG CTAACTGTT CTGTTTTGTGCTGCACCCTCATCTCAGCAGGG ATCAGGTGGCACTTGGGTC CC C AGGGG 
ATCCAGGGACACAGTGTCC^TGTCCCC^TCTTTCCCCACAGGGACCTGGGGCAGGGCCGAAGAGGAGACCTCGTGATGTGTACAGGTGTAGCTCATC 
CCTCCGAGGACTCAGGGGACACTTGGGGAAGGTGTGACACAGAGCAGGCTGGAGCTCTGTGCGGCTCTTTATTAAGTGTGCCTACAACAGGGGCAGA 
GACACAGCAGCGACAATGATAACAGCCCACTCCCAACCGAAGCTCAATGGTTACCAGACTCAGCCAACTGCAGAATTCCAGTCCATAACGGCACCTC 
CTTTCCTTTCTCAGGAGCTTACCAGAGAGGTCTCACCCCAAGCCTTCACCCCATCTAGAGCAAATGCTTATATCCCCCATGTTTGCTCAGAGCAGAG 
CATGGAGGGACCTTCCTGTCACCAGAGGCAGAGGCATTCTGTATACGTGCATACACGCCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTGGT 
ACAGATCTAAGCATAGAAAATTGAAAAGCATTGAGGTTGGTGATTGAAAGGCTCACTGATGGGCAAGTCGGGTTCGAGTGCGCTAACTGTATGACTC 
TATGGATCCGAATATCATGCTGTTTTGGAAAGGAATGAGGGAGCTACTGTTTCTAAGGAAGAGAAGAATGGGCACCAAGACATAGGGAGGGAAGGTG 
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GGACAGTAAGCACTTACAGCCGAAGCCCAAGACCTGGAACCCAAGGGGAAAAACAGGTGTGGCCAGGCTTGGTGGCACGCGCCTTTAGTCTCTGAGT 
TC^GTCCAGCCAGGATTACATAGCAAAACTTTCCCTCAAGAAAAAAGGTAGGGGCTGGAGAGATGGCTCAGCAGTTAAGAGCTCTTCCGAAGGTCC 
TGAGTT CAAAT C CC AGC AAC CAC ATGGTGGCTCACAACCATCTGT AATGAGATCTGA CGCCC T CTTCTGGTGTGT CTGG AG AC AGCT ACAGTGTACT 
TACATATAATAAATAAATAAATCCTAAAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGG^ 

CAAAACTAACTACATAGCTGGGGGAGGTGGTGCTTGCCTATAATCCCAGCACTTGGAAGGCTGAAAGAGGAAATCAGGAGTTCC^ 

CTTATGTGAGCCTTTCTCATAAAACCAAGAGCAAAAAAGCAAAACATGAGGTGGTAACTCAGCTGGAGAGTGCTTGTCTAGATTCCCTCAGTGAGGG 

GCTGGGGTGTGACTCAGTGCCTAGAATCCCCCAGGGAGGGGCTGG 

GGGTGTGGCTCAGTAACAGAC TGCTCAC CTAAGATGTT CAAG GCC ATGACTTTC ATT GAAAACATC AC AAAAC AAAC AAAACC ATCAGTGAGGTTG A 
ATGTGTATCC&TGTTTGCTTGTCTCCATTTAAAGAACACCTGGA^ 

ATTTGA ACTGTGTGGCTGTGGGTGT ATT ATCTGT T CC CAAGGACTG CACAT GAAATG AG AAC AATGAAGGCTTCGGTTCTGCTGAG ATGAACCTCAA 
TTCTCCACAGATGATCT CAC C CAGAGCC AGATGG AAGT CCCC C AAC ATTTAAGC CTCTGTTCTGAGTGCTGGACCCAAACTCATCTTAAGGG AAAAC 
CC^IAAGAGCCTTTCTGTGGTTTCCTTCCACAACTGTCTTTCAATGTGTG 

ATGTGTGTGAGCATGTGTGTATATGTATGTGCATGAGTGTGTGTGTGTGTGTGTATACATGTGAGCATGTATGTATAGGCCAGAGGTTAGTGATGGG 
T AACTTTCT CC AGAGCT CTC C ACCTT AGTTTT ATTGTCTGTGTGTGAGCGGCTGTGGGCA CTGTACACCACAGCACATGTGTGGAGGT CAG AGAACA 
ACTT GTGGCTGTCACTTCTCT CCCT CTG TGTGGGT CTAAGACCTGGATCCAGCGCTTTCGCTTGCTTGAGACATCT CGT TGACTAGCT CCTCCGTTG 
TTAACACAGGGTCTCACTATGTAGGTCCAGTTATCTTGGAATTTGCTCTGTAGACCAG^ 

AGAGTACTGGGATTAAAGGTGTGCACCACTACCACCCAGTTTAAAATTCTAAAAAGATTTGTGGGTGCAGGGATTTGAGGAGGCCAGGAGAGACTGT 
TGGATCTCTGGGAACTGGAGTTAGAGGTGGTTGTGAGCCCCCACGGGTGCTGAGAACTGAACTAGGGTGCTTGAGAGGAGCTCACTCACTGCTGAGC 
C CC C ACTTT AG ATTTGAGTC CCTCGTAC TGAAGC TGA GTTCAGTG ATTTGG C TG AGCTG ACCGCT C AGGGTGGGCTTTT CGATGTGC ACC ACAGGCC 
TGGCTTTATAGGTGAGGTCATGCTTGCACAACTGGCCTTTTGGTGACTCAGCCAACTCCCC^GACCTTTTTTCTTTTGCATTTACGTGTGTGTGTGT 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTACATGAAGTGAGAGGGGGCAGTCTGGGAAGGGTCAGTGGACAACTTAAGGGAGTTGGTTTTGCCTTC 
CATCATGATGGTCTAGAGACTAAATTCAGCCCCTCAGCCTGGGAGCGTGCCTTTACCCTCCGAGCTCCCAGCTTTGGGTTTTTGAGGCAAGGTTTCA 
TATAGCTTAGGCTGGCCTGGCATTCAAGGTCATCTCAGAGGCCTAGGATGACCTTGTGTTGTCGGTTCTCCGGCCTTTACCTCTGGTGTGCTGGCGT 
TTGCCTTACCACACCCAGGCATCCCCCATAGCTTAAGCCTAAAGCTCTCGGATGGCCAGCACGCCCATGCCCTGACGTGAAGTTCAGTGACTAATCA 
ATACAGAGCGTTCACGGAGCCCTGACACATAGCAGCATTGACTTAACCTTAACAGCCTGGTGTGTAATGAGGCCAACTTGTCCGATTCTGCCCACGA 
TGCGATGCTCTGGGTTAAACAGGTTTGAACAGAGTACCGGATGGCAGGGAGGGCACCCTCAGTACAGGGTTTTGGCTGTTACTGCTTCCAGTCTCCT 
GACCTGCTCCCCCAAGGCTGGTCTTAAACTCAAAGCAATCCTCCTGTCCCTGCTTCTCAAGTTCTGGAATTACAAGTGTGACTCACTGTACCCAGGA 
ATATGTGTCTTTATTACAGAGGAAGGAGAGCTCAGAGAGTGAGAGTAACCTGTGCAAAGTCCCACAGCAGGAGAGGGTGGCTGGCTATCGGCCGTGA 
ACTTGGTGACCTTTCCTGTGACCTCTAAGGGAGGAACTAGATGTCCCAGCAAGGATTCTAGGGACTTGTCAGGACTGACCTTTGATGATGGGAAAGC 
CATGCCATCTCCCCTTCCCCTTTGATGGTCGCTGGAGGGGTTTGCCTGGGACAGAGATGTGTTTGGACACAGATCACCGTGAAGACTGGTCAGTTTC 
AGTACCCTGCCCAGCCTGTGGCTGGGAAGGGCCAGGGGCCCAGATCTTGTGGGCAGCCTGGGGTAGGGCCAGCATTCCAGGGGCCTGCCTGGCATGT 
GGGGAATGTCCCAGGAGAAGTCTCAGTCCTCCAGAAACTCAAAGGGAAGTGTTCTTACGTGGGGTGGTGTGTGGTGGGAGGTGGGAGGTGGTGTTTG 
TGGGAATGTCCCTCTGGTCTGGAACCCTGAGGGGCTGCAGTTAGGAGGTACATTGGGGCTGTGCCATTCTAGGGAGCTGACCTGCTTTTCTGATGTC 
TTCATAAACAGAGACCACTATTCCTGCCTTACCTACCCTGCTTCCTGGTAGACCAGCACCCCGGGGCTTGAGCATCACCACAGTAGGGACTTTTGTG 
GGCCACTGACTTGGGCCACAGTTACCACAATTTATTTAGAGTCAGCAAGGTGGCTGTACTCTATGCTGACCCGGGAATAGAAATCTCGGTTTCTGCT 
CAGGAGGCCCAGGCTACAAGGCCTTGAAAAACCCTGAGGCGGGCGGGGCAGGTAGGGCTCTGCATGCGCCTCTCTAGAAATTAACCTCCTTTCTACC 
CCAGACCCTTCTGGCGTCTCTGGCTTCTTCAAGGACCGGCTCTAAGGCTACTTCCTGGTACACCCTGGACCCTCCCTCCATCCTGAAGGGATCATTA 
TCTTAAACCTGGTCCTTCCCATCGACTGGGTTTGGGGGTAGGGGTGTTTCAGGATGGCCTCAGAGAGCAGGAGAAAGTTGGAGATCAGCCAACTACT 
GCAGAGCACGACTGTGGAGGAACAGTTAAATGGACTAGCAAACACGAACGGGCCAGAGGCAGAAGGACTGCTATGAGTTCGAGTCCAGCCTGAGCCA 
CACAGATACTTCAAGCCTAGTCTGAGTTACAGAGTGAGACCCGGTCTTAAACCCCAGAAAAACACTCCGGGGAAGCGCAGGGGACCGGGCGCGCGCT 
GCAGAGCCCCCTCCCCGGCAGGCCCGGGTAGGGGCGTGGCCACGGTGACGTCATCCTCCTATAAAACCCTGGGCGCCGCCGGGCTGGCTTTGTGGAG 
AACTGCAGCGGGCTAAGCCGTGTTGAACAAAGGAGGTCGGGCACAGCTATCCAAGCTCCCGGGGCCACCGGGCCGCCCTCCGCCACCATGACCGCCA 
ACGGCACGGCAGAGGCTGTGC AGAT T CAGTTCGGG CTC ATCAGCTGCGGCAA C AAGT ACC TGACAG CCG AGGCGT TCGGGTTCAAGGTGAACGC ATC 
CGCTAGTAGCTTGAAAAAGAAGCAGATCTGGACGCTGGAGCAACCTCCCGATGAGGCGGGCAGCGCGGCCGTGTGTCTGCGCAGCCACCTGGGTCGC 
TACCTGGCCGCCGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGACGGCGACTGCCGCTTTCTCGTCGTGGCGCACGACGACGGCCGCT 
GGTCGCTGCAGTCCGAGGCTCACCGGCGCTACTTTGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGAGCGTGTCGCCGGCCGAGAAGTGGAG 
CGTGCACATCGCCATGCACCCGCAGGTTAACATCTACAGCGTTACCCGCAAGCGCTACGCGCATCTGAGCGCGCGGCCGGCCGACGAGATCGCGGTA 
GACCGCGACGTGCCTTGGGGCGTCGACTCGCTCATCACCTTGGCCTTCCAGGACCAACGCTACAGTGTGCAGACGTCCGACCACCGCTTCCTGCGCC 
ACGACGGGCGCCTTGTGGCACGGCCGGAGCCCGCCACGGGCTTCACGCTGGAGTTCCGCTCCGGCAAGGTGGCCTTTCGCGACTGCGAAGGTCGCTA 
CCTGGCTCCGTCCGGGCCCAGCGGCACCCTCAAGGCTGGCAAGGCCACCAAGGTGGGCAAAGATGAGCTCTTCGCCCTGGAACAGAGCTGCGCTCAG 
GTGGTGCTGCAGGCGGCCAACGAGAGGAACGTGTCCACGCGCCAGGGTGAGTTGGGGACATGTCCCCTCCCTTGGGTCCTGACCCAGTGCACAAAAG 
CATCTCTGCACTCCTTCTATTTCCCTATTTCCTCCTGGGCTCCCTTTAGGCTCACAGGCCTCTGGGCTCCGCCATGGGGAGGTGGGCTTCTATTTTG 
AAAGACCTGATCTCACCAGATAGCCTTGGCTATCCTGGAACTCGCTATGTAGACTGGGCTGGCCCCGAACTCGCAGAAATCTTCCTTCCTGTCTCTT 
GTGCTGGGAATAAAGGCATGTACCACCCCTCTGTTCAGGACGTAGGCTTCTTCTGCTGGGAGAAGGTGACGTGGGGAAGTGGGTAAGGCCTAACACA 
TTCTAAAGCAATACTGTCCACCACCACCCCCCCACCCCCCCACCCCCCCGGCTGGGGGCGTGGGACTCCATAGACCTAGCTTAGATCTTCTCCGTTT 
CCTCTTCCTTCCCAGTCTGAGCCATCCCCATTCATGAGCCGATTCTTTAGGATAGGCTGTCCCAGGTCTTGGAGAGGAAGGGTTCCCCCTACACCCC 
CCCTTGTTCATTTCCTCGTGCGTCCTCTCCCGTTTCGCGGGCCTTTGGCTGGAGGGCCAGGGGACAGTCACCCCTGCCCTCCCCGCCTTAAATGGCG 
AGATAGAGGAGGTTGGAGGGGCTTTTGATCCCGGCGGGGTGTGCTGGGCGGGGTCGGCCCTGCGACTCTTGGTGGTGGTGGGGTTGTCCGTAGGGGC 
TTCCCCTCCCCCACTCCATCTCCCCTCGCTAAGGCCTGGCGCTCCGGGCCGCGGCCTTTGTGAGCAGGGGGGCGGGTCGCCACGGCTGGGCCCTCCC 
TCGTCCCCTTTGTCCTGCTAAGGCTCTGCAGATGGGAAAACCAGATGCGGCGGAGCT 

GCCACCCCCAACCCTCCAGGCGTTGTCCCTGGTCCAGGGTTCATTGAGAAGGCGCGGAGTGAAGCCGCCTGGCTCTTCCTTCGCACCCCGGCTGGCA 
GGCGGCGTGGCGCGGGTGGCTGCGCGCCAGGCGCCCTCCCTGCCCTCCTATGCACGCCTTGCGGTCCGGCACTCGGCGGAGTGTCCCTCCAAAAGCC 
TCTACTCTCGAGTGAACCGGTCCCCAAGGCCTCTCTGGCTGGGAACAACCGTTTG 

GGCCATTTCCGACCGGCATGGGCATGGTACAAGTAGGCGCCTCCTTTCTGGCGTACCCCCGACTCTTCAAAGGGTCTTTGCTGCCTGCTGGTAGGGG 
CTTGTGAGATCTCGCTTCCTTGGTTGGGAAACTGAACCCCAGGCTTTGGAGATAGCTGGATGGGAGCCAGTCTGTGGGGAGGAAGACCCCCCTCCCC 
CAATGCAGAGGGC^AAGCCTTGGGTGGGGCCTGCTTGAGTTTGGCTGAATAACGCTCAGAAAAGTGCGACCATTTGCTTACTGTTAC^ 
AGCACCATGCTGAACTGGGTGTACTTGATGCCAGAATGTTCTCTCATTATACACACACACACAC^ 

CCATGGCCTCTCTTGTTTCACAGTGCTGGGCGAGGGGGTCAGAGTTCCCACCTGTTTAGCAAACATTGAGATACGAGCTGCGGTAAACGTCTCTTG 
GAATGTGACTGGTATGTAAAGCAGAAGATTGGGGGTACAGGATCGGGGTAGTAGATTATAAAGATCTGGGGTCCCTTCTGAGGACCCAGGAGCTGCC 
CCAGGAGAGTGAATAGAAATGAGCATGCACCAAGTTTATGGACTGGTTTTGCAGGCAGGGGCCTCAGCAAAGGCCCGGAGGCAGGGAGTTGCAAATA 
GAGAAGCCCTGCATAGCTAGGAGGAAGGCTGGGATTGGAGGGAATCCTGCAGATGACGGGTACTCCAGAGCCTAGAGAGACCAGCTCTCTGCTTACA 
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GAGCTGTGGCCGA6GCTTGTGCCCTGGGGAAGGTTAGGCAAGGCACAGGTGACTTGGGCCCTTGGGGCCTGAAAGGAGCTAGAGGGTGTGTTTTGCG 
AGCCTGGACAGGTACCCTTTTCTGAGGGTGCCATGATTGTGCATGATTCACACTGATGCGGCAGGAGGGGGCGGGAATTGAAGGGGTGAGAATCTGA 
CGGGAACTTGGGGTCACACACAGTCAGACCTCTTCTGCCCTCCTATGTCTGGGTCC^ 

TGAGGGTGAGCTGTTGGGTGGGTGTGCCGGGGAGGCCATTTTGGGCAGGCTTGCGTGGCAGGGAGGCGGGGCTGGGCTTCGGGGAAACACGTGTTGA 
ACTGCTAGCCAGGAGTAGGGAGAGGGAAGGCTCAATCTGGTCTGGGGTTGTGAATCTTCCATGCAGATTTGCTGCTTCCGGCTTCCTG 
TCCCCATCTGAAAGCACAGAGGTACATTGAGCCAGGATCTTCCTGGGAGGTGGAGATCCTATGAGAGTAGGATGGGGGGTAGTGGGTAGGGTCTTTT 
CCTCCTTGCTTCTCMGACCCTCCC^ 

TCT AATGCTTTCAAAGAATTTTGTCCCCATC ATT TCTAGGGC GGCTGG CTAAGAATGAGTGAC AGGTG AGGCT GGGCTTTGTAGGGTATCCTTTTGA 
CGGCTTCCCTCTTGGGTCGAGGAATGGCTTAGTTTTGAGCACTGTACGACTTTGGGCGGTC 

GTTGTGAGGATTGCTTATTTCGAACAACAACGCTAGGGAGTCTGGCAAACAGCTAAGCCCAGGAAGGGGGTTGCGGGTTAGAGGACCCTCTGAGTC^ 
CCGCAGAGACGCCTAGGAAGCTGGAAGACAGCCACGCCCCCTC 

AAAAAAAAATCCCAC^GAAAACCTGTTACCATAGCAACGCAGCTCCAATATGATTCCAAGGCCAAGTGGGGCCTTTCCCCCTGGCT 
GGCCCATTCTGGAAGGGGGCTAACTTCCCTTTCCGGCTGAGGTGGGAGGGCCCA^ 

ACAGAATTGAGAGGCGATTCTTTTTGGCAAGGGAGGGAAACTGAGGCAGAGAGCGGAGAAGACATTCCCCGAAGGGTGCCTGGAGTCCACCTTTGGT 
GCCTTGGTGGCCTCTTTATTAAAAAGTGGTTGCACAGGCCCCTTTGT^^ 

TCTGGACGTCTCATTTTTCAGAGGTACTGGGAGGTTGAAGGTTGGTGTATGTCCACAGCACTGTGCTGTCTTGAACTCAGTGCGCATGACATCTTGG 

GCCAGGGCCTCTCCTTTGTGTGAGGGAAGCTAGCATGCTGGGAAACTCCGACTCCTTTAGTCCACCCACTTTAGTTGTAGTTTTTGTCTTGTAGCCC 

ATCTGGCTTCCAACTCTCCATCCTCCTGCCTCAGTCTCCAGGGTGCGGCGATGACAGGCATAGGTCACCACAGGCTTCTGGCCTCCACTTGATGGTT 

GCCGGAGTGGCTGGGAGTGGCTTTTCTTTTCTTTTCTTTTCTTTTCTTTACTTTCAGGCAGGGTTTATCTGTGTAGTCCGGCTGTTCTGGAACTATG 

TNNNNNNNNIWNimNIWNNNl^NNNt^N^ 

NNNNNNNNNNNNNNNI^NNNNIWNN^ 



NNNNl^NNNNNNl^NNNNNNNNNI^ 

TCCCTGACTGTTCCATTCCCCGCAGTGGGCACGGTTAGATCAGGTGGGGTCACTTAGGCTCGCTAGCATTGGGTGTGGCTCTTTTCTGCCCAGTATG 
CTGTGCTAACTCAG C TGT CC CTG CACGTGGCAGGTCAATTACAGGTAGG CTGAGGGG AC AGTGGGTGACAC CCTGT ATATGTGTAGCCAATTG TTCT 
CTTAATATTTAGCCATCCCTAGTGTGGGCCAGGGGCaGGAGAGCCATTTOTGCCTGAGAAGGTTGAACAGACTACAGCCCAGAGCTGTGGCCTTGGG 
GTGGCTGTCTAGCCTGGGATGGCCCTCCTACTCTCCTCTGTGCTTCAGTGGTAGATCCTGTGGAGGCACTGGGCCAGGGTCCTTGGGGACCCATCAT 
hh GGAAGCCATGTCTGCTGATCCATATGTCTAGAAGGGGAGGCATCTGTCCATCCCCACTCCTGAAGTCAAGGGACCCCTTGGTCTTGCAGGGTGACCT 
TGGGCCAAGGCCAGCCACCTCTGAGCTCCATCTTCTGCTTTGTGAGGTAGAGGTAAGAGCCTTTGGCATTGACAACACAGATAGAGACTCGGTTGGG 
TC^GGGTGGCATTGGCTGGCATACTCTTGGGTGTAGTGTCAACTAAGGATGAGTTCCTAGGGGCAAGGCTGAAGGTGGCATCTACCATCCCATACCA 
CGTTCTGCTGCCTAAGGACGCCTGAAGACAACTGTGAGACTGGCAGTGTACACTGCTTCCAAGCTGAGGACATTGTTAGAATCCCAGATCCTTGAAA 
; y ; . CAATGAGTTTATCTTGTTAGGACCCAAGATCCCACTCCTAGGTGGAGTCAAAGCCTTGGGAGCGGCACCTTATTGGGGCTCAGTGAGGTCATGTGAC 
^ ATGGAAACCATGTAGTGTTACTATGTAGCAAGAAGTTAGAAAGCCATGCAACAAACCTGTAATTCCAACTCCTGAGGCAGAGGCAGGTGGATCTCTG 
O ATTTCAAAGGTAGCT CCAGATAC ACTGTGAGACCTTCT CAAAACAAGAC AAG CAGACTGGAG AGATAG CTC AG AGGTTAAGAGCA CTGGCT ACT CTT 
fit CTAGAGGTCCTGAGTTCAATTCCCAGCAACCAGTTGGTGGTCACAACCATCTGTAATGGGATTCGATACCCTCTTCTGGGGTGTCTGAGGAGAATGA 

! ^ CAG TGTAGTCACATACG TAAAAT AAATAAAT CTTTAAAAAAAG AAAAG AAAAATG AAAAGGG AGGGGGGTG ATGGAG ACATGGCT C AGAC AGT TTT T 

5 « » 

<3ir$ TGTTTGTTTTGGCTTTTTGGTGTCrTTTTCTTCTTTCTTTCTTTCTTTCTTTCT^ 

Q CTCTCTTTCTCTATCTTTTCTCTTCTTGTCTTGTCTTGTATCTGTGTAGCAGCCCTGG 

ff CTCAGAGATCAGCCTGCCTTTGCCTTCCAATGCTGGGTTAAAGGTGTGCTCCAGCACTGTCCACTCTGGACC^CTTGTTCTTGTATTCACCTAGTTC 
TCACAGGGTGGCTCATGACAGTTCCAGGCGACCTGACCTCTTAGGGCACCAGGCACATGTGCGCATATTCATACACACAGGCAAGACGTTCACACAC 
G AT ACAT AC AATAC AAGACG TATCT TTCTAAAAATTA AAGAAAAT AAATGAGAAGCG AG AGGGAAAGG AGGGAG AG CCACTGTTTGGACTGGA CCA C 
CACTGGCAGGGAAATGCCAGAAGGTTCAGACAGCATTCAGGGTTTGTGGCTGGTGGGGGCAGGGATAAAGGAGGAGTTTTCTAAAGGAACATGTGGT 
CTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTACTAGGGGTAGACTCCAGGGCCTTCCACCATGCTGGGCAAACAATTTA 
C ACTGAGCTAT ATTTTCGAC CCC CT CTTTATTTT ATTTTTATTTT ATT TTA TATATAGCTCTG ACTGG CCTAG AACTCATTATGT AAACT AAGGTGA 
CCTCAAACTCACCCATACTCAAATGAGATTCACCTGCCTCTGCCTCCTGA 

AAAAAAAAAGCCTATTGCTTTTGTTTTTATTTTTGTCTGTGGAGGGTGGTGCCCACACCCAGTGGTGCCAGTTTGGAATCAGTTCTCCTGCCCTATT 
CTACCTAGGGATCGAACGCAGGTGGGTAGGTTTGGCAGCAGCTGCTTTTACCCCACGAGCCATCTCGCTGGCCTGTAGCCCTTTGTGTGTCTGTCTG 
TCTGTCTGTCTGTCTGCATGGAGTGGAGTCTCATGTAGCTCAGGCTGGCCTGCGAGTTATGATCTTCCTGACCCTGGTATCCAAGTGCGGAGACTGC 
AGGCTCACTCTGCCACGCCCTTTTACACAGTTCTGGGATGGACTCTGCATGTCAAGTGAGTTATAGCCACGGCCCTATTTTATTTTATTATTTTTGA 
GATCGGGCCCCACTCATCATGGAGGCTAAGGTCGGCCTCCTGAGTGGTGGGATGACAGGCGTGGGT^ATGTGCCTGGCTCCTACTCTATTCTTTTA 
CTTTCACACATACC CACTAC AGG AAGCG AGTAGCGAGCCC CCT TTT AC C AGC AC CCCGCCATGTG ACATTCCG CAGAACC AGCGCCATCT GTGTGTT 
TATTCAATGGTTTTGGTTGTATTTAAAGCAAATTCCCCACACTGGTGCTTCTTA 

TTACATAACGGCAGCGTTGCCTAACTGGAACCGATCTTTAAGACCCAATCCAGGCCCACGGGAGAGTTCTGCTAGGCCTTAGAACATCTTTGTGGGT 
GGTTCGTTTGAAGTTTGATGGCTTCCTTCCTCCCCTGCTCCCCTTTCTCCTT^ 

CTAAATCCCCAGATTTTTTTTTTTTTAAGTTTTTTTTCGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGG^CTCACTCTGTAGACCAGGCT 

GGCCTCGAACTCAGAAATCCACCTGCCTCTGCCTCCCGAGTGCTGGGATTAATGTCTTATGTATCCCAGACTGGTCTTGAACTTGTGGTCATGCCTG 

GAGTATCCTATTTTGATTGTTTATGTGTATACCCCTAATATGGATGTCAGAGATTC^TATC^TATGTGCCGTCTTTGGTCATCCTCCATACrTTTGG 

GGGATGTATGGGGGCAGGGATGGAGTCTAGCCTGGGCTGTGGTAACACACGCTTTTCATTCCAGCATTAAAGCAGAGATAGGTCGATCTCTGAGTTC 

TAGGACAGCCTGGGCTACACAGGGAAATTCTGTCTCAAAAAAAAAAAAAAACAAAAAGAAAGAAAGAAAGGAAAAGAAATAAGAGTCT 

GCCCTGGCTGTCCTAGAACTTATTATGTAGACCACGCTGGTCTTGGACTCACAACAGAGATCCATCTGCCTCTGCCTCC 

TTGGAGACAAGAATCGCTC^GTGGCCCGGATTCAGGGATTCAGCTAAGCTGG 

ACAGGATTATCGGCTCATGCCACCGCGCCTGGCTCTTACACCATTCCTAGCGATCTGAATTTAATACTCTATGGTCAAGCCATCCTTCAATGCTCCC 



CCTGGCTAGCCTTGGCTGCTGATTTGGTAGCCTGGTCCCGCCCACTGCTTAGGATTATCACTGAGGAGTTTAAGCCACACTTCAGGAAGCCCTGTGG 
ATCCTGGAATCTGAGTGAGACAAGGGTCACACAGTGGCCTGACCGGCTCTGGTCTCTCTCTGCAGGAATGGACCTGTCAGCCAATCAGGATGAAGAG 
ACCGATCAGGAGACCTTCCAGCTGGAGATCGACCGCGACACAAGAAAGTGTGCCTTTCGCACCCACACGGGCAAGTACTGGACACTGACGGCGACCG 
GAGGTGTGCAATCCACTGCGTCCACCAAGTGAGTGACACCCTACACCCCTTCATCACCTGGCCTGGCTCTTCCCCCAGGACTGGGCAGCCTGCTCGA 
TGCCCCCACGTTGCCAGCCCCTCTTCTCTTCCCCAGGAACGCCAGCTGCTACTTTGACATCGAGTGGTGTGACCGCCGGATCACTCTGAGAGCCTCC 
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AACGGCAAGTTTGTGACCGCCAAGAAAAATGGCCAGCTGGCCGCCTCGGTGGAGACAGCAGGTAGTCACTTGGGGGCACGTACCCTAA6CCTGTTTC 
CCTAGTACCCCGTGGTCAACCATCAGTCCCACCTGGACCTCTCTGTGTGTTCAGGGAATCCCTGTAAGCTGGTGTACCCTCAGGCCAGCAGCCTGTG 
ACCCTTAGCTTCTTGGTATCCCTCTCTGCTGAGCCATTCCCTGACTGGCCCATCTTTGTTGCTGTGAAGCTCACTGGCTCCCTTTCCCTGTGGCAGG 
GGACTCGGAACTCTTCCTCATGAAGCTGATTAACCGCCCCATCATCGTGTTCCGGGGGGAACACGGGTTCATTGGCTGCCGCAAGGTCACGGGCACT 
CTGGATGCCAACCGTTCCAGTTACGATGTCTTCCAGTTGGAATTC^TGACGGCGCCTACAACA^ 

GGC CATTCMAGCCGACATTAGGGAACGGGTGTCCTATGACCGCCTGGGGTAGCCCCTCCCCCCTTGTCTTAGAACTTTCCTGGCCCAACCTGGGCA 

GACAGCGAAGGTGGGAAGCAGCTAGGGAGAGTGGTCGTGGCTCCAATCTGGGAATCX3GAGACAGGA 

GACAGAACCCAGGGCTTTGAGCTTATTAGGCAAGCATTCTACTGCTGAGTTAAATCCCCAAC^ 

GAAAAAGAAGGTGGAGCCTGGTGGTGGTGGTGGCGGCGGCGGCGGCACACGCCTTTAGTCCCAGCACTCAAGAGG 

TTCTTGCCAGCCTGCTCTACAGAATACATTCTAAGACAGC 

GAAGATAGAAGATAAGGATATTCCTTCTTAGCTAGCTGTGGGACCAAAACCAGTGAGAGGACACAGCAATCCAGAGGTGTCAGTCAGAATCAGAGCC 

CTGTGCTGTGTGTGGCCTTAGCAAGGCAAACGGGACTCCTTTCTTACAGTGCTGAGGACTGGACTCA^ 

TCTAGGCAGGGGCTCTACCACTGAGCCACGCCCCCAGCCCCTCACTGGGGGATTCT^ 

CTGGGGGATTCTAGGCAGGGGCTCTACCACTGAGCCACACCCCCAGCCCCTCACTGGGGGATTCTAGGCAGGGGCTCTGCTGCTGAT^ 

TTTGTTTGTTTACTTGTTTGTTTGTTTGTCTTATTGTGGCCCTAGCTGTCCTAGAACTTGCTTTGTAGACCAGAGATATACCTGTCTCTGCCTCCCA 

AATGTTGGATTTAAAGGCATGTGCCACCATGCCCCACCCTAGCCTAGCCCTCTTATGTTTTGTAGTTTGAGATGGCATGTCAGTAGATTTCCCAGGA 

TGGCCTTGAATCTAGGTAATCCTCCTGCCTCAGTTTCTTGAGTATTTGAATTATGGGACTGTACCTTACATCCAGCTGAGCCCACTCAGTTCTGGAA 

GCT GAC AGCGGGAGGGAGTG TTAAT CCT TTGGGG ACTGACAAGGT GGGGTTAACATGTAGTCTCCCTCCTCAGACT CCACGGGCAAGT ACTGG ACGG 

TGGGTAGTGATTCCTCGGTCACCAGCAGCAGCGACACCCCTGTGGATTTCTTCCTTGAGTTCTGTGACTACAATAAGGTGGCTCTCAAGGTGGGCGG 

CCGCTACCTGAAGGGGGACCACGCTGGGGTCCTGAAGGCCTGCGCGGAGACTATCGACCCCGCCTCACTCTGGGAGTACTAGGGCCACCTGCCCTCT 

GCAGGCCGCTCTCGTCAGTCCCTCCTGTTATCCTTACTCATCGGGTGGCCCTGCAGCAGGTGGCAAACCCCTTGCCTTTCAAACTGGAAACCCAAGA 

GAAAACGGTGCCCTTGCTGTCACCCTCTGTGGACCCCTTTTCCCTAACTCACTGCTCCCCATGGGTCGGTGGCTGCAGACTGTCCCCAGGAGGGACT 

CTGGTTCCCTCTGTCCGCTTCTTTCCATGGGGAACTCTGGCACCTTTCTTCTGACCTCAGTCAACTCTGAGCCTTATTTCCCCCCAGGAAGTGGCCT 

AGGAGAAGCTACAGGGCCTAGGGACTTACCCTGAGCTTGTAACTGGAAGACCCCGTCCCTATCCCCGCTCCCGCCCCCACCCCACCCCACCCCTGCT 

CTGGCCCCAGCCTCTGGAGGCCAGCCTTTTGGCGGGACTGAAGCCGGGCATGGCCAACCTTGCCCACAAGTGTTTTTCTGGATCTTGGCTGGAAGGC 

AGTCTGTCCCATCCTGCAGTGTTTGGGCCTGGCTCTTTGACTCAAAGCTAGCTAGGTGGCACTCCGTGTCGCTCCTGCACATTCTGGAAGGGGCGGG 

TGTTGAAAGGCCCTAGGCAAGCCTGCCCTGTTGGTTGTAGTCGTGAGTGGTCTTGGGGGGAGATGCTTGGCTCCTGTCCCTGCCTCCCCAGCGGGTT 
CCCTCCCTCCCTCCTGCCTGACCACCCCAGCTCTGGCTCTGTGATTGGTGCTCCACGTCTTCCCAGACACCTCGGGGCTCCTGGGCGGGAGAAAGCC 
GGATGTGCCCCTCCCTGGGAGCCCTGGAGTAAACCTCAGGGGGCCCTTTCCCAATCACCCCCTTCCACCGACCCCTCAACACCATGCATCTCACTCT 
GGGTGTCTCGCTCCTTTATTTTTTTGTAACTGTCATTTCTATAACTCTGAAGACCCATGATAGTAAGCTTTGAACTGGAAAATAAAGTAAAATCAAG 
TCTGCGGCCCGTGTGTGTCTCAGGGGAAGTGGATGCTGGAGTGGGCAGAGGGCCGGCTGGGAGGGAGGCAGCGCTGATTGATTGGCAGCCCGCAGAC 
TCTTGTTTCGGTCTCGGACCCTAGCGGCTTCAGCATATGCCTTAGTAACTCTGGACTGAGAAATGGCTGACACGAGGTTCCCGAGCCTTACAAGCTG 
GGCAGGCTAGAGGGTTCAGCAGGTAAAGTTGTGTGCTGTCAAGCTTGATCAGACTGAGTCCCCAGCACCGTGTGAAAAGTTGGGCACAACGGTGTAC 
ACCCACCCCAGGGCCTCTGGCAAGCACAGAGACAAGAGGATCCTTGGAGCCTGCTGGCGGCCCAACCTTGCCCAATCTGTGATTCCCATTGAGTGAG 
CCATAAGGTGGAGATGGGTTTTGGAAGATACCCCACATAGGCCTGCGGCCCCCCACACCCGGAAATAAAAACAAGCCAGGCTTGTCATCCCAGTTAT 
TCGGGGAGCTGAGGCAGGAGGATCACAGATTTAAGACTGCCCAGGTTGAAGAGTAAGTTCAAGGGCAGCCTGGACATCTTGCTAGATTGTTTTTTTG 
TTGGTTGGTTGGTTTTTGTTTTGTTCTTTTTTTCGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACTCACTCTGTTGACCAGGCTGGCC 
TCG AACTCAGAAATCCG CCTGCCTTTGC CTC CCG AGTGCT AG GATT AA AGGTGTG CGCCACCATGC CAGGCTT AGATGTTTT AAATGAAG GGCTGTG 
GAGTGTGGTTCACGGGTAGCCTCGCTCAGACAGGCTGGGGCAGGAGAAATGGATGGCTCAGCATTGGCTGCTGCCTTAGGGTTTCCATTGCTGTGAA 
CAGACACCATGACCTGGCAGCTCTTCTGAAGGACATTGGACTGGGGCTGGCTTACAGGTTCAT 

AGGGGCTCAAGTCTACGACCCGGGTCTCAAAGCCCACCCCCACAGTGACAGACTTTCTCCAACAAGGCCACACCCACTCCATCAAGGCCACACCTCC 

TAGTAGTGCCACTCCTTGAGCCAAGCTTATTCAAACCACCAATCACAACTGCTTTTCTAGAGGACCCAGGTTCAATTCCCAGCATCCACATGACAGC 

TTACAATTGTAACTACAATTCCAGGGGATCCAACACCCTTACAC^GGCACACTTACAGGAAAAGCACCAATCAGTGCACATAAAAATACA 

TTAGAGAGAAGAGACCATTAGCTACCTTCACCCTTAACAGGCAAGCTGAGGTCCTGGGAGTGTGTTAGAGGTCCGCAGCTTGGAGTCTATAGGGTAG 

TGG C AT CGT CT ATACGTTCCTTC CT TGACAG GTAGGT ACTTAGCT GAG ACCTCTGGGGTATCCCAGAGACCCATTTTAC AG AGGGAGGC ACTG AGGC 

GGAGGGAAAAGACAACCTTGTTTGCTGGCTCAGAAGAGTGCCAGTGGGGTACCCACCCACAAGGCTCAGCATCCCGAGGCACTGTGGAGGCAGTTAC 

AAAGCCGCATTGATTGGCTGCTGATGACGAAATGAAGAGCCCCGCGGTGTGATTGGCGGTCCCAGAGGGAAGGCATGTGGGGAGATTGACAGGGGCA 

GATGGCCACAAAGGGTCCAGTGACTTCCCGCAGGGTGGAAGCTCTGGGCCCAAGCCAGTAAAGTGTCAATGACATGGAGAGACAGATGAATCCTTCC 

CAGTTGAATTCTCAGGCTCTGGTGGGGCCGCGCTGGGGAGAAAAGACACCCTGTCCGTTACAACCACCAAGCGCAGCCAGGCTACGTGATGACTGGG 

GGAAGTCCCAGACGTGACTTCCTGCAGCACTGGAGGGGAGGGTCCGACGGAGGAGGCTGAGAGTCTGTTGGAGGCAGCCGAGGATCGGAGGGAAGAG 

GCACAGGTCTTACATGGGAGGCATAAAGTCTGGTGGGAGTCCAAGGGAGGACGGATGTAGGGTGATGGGAAGAAAGGACCTAATCCAGAAAAAGGAA 

CAGAAGAGCCACCTACXrTGGTTAGGGGACGCTGGCCCTCCGCAGGGAAAGCTGTCAAAGAAAGGCCTGGCTAATGAGACCATGTCGTGGGGGGCCCA 

GGC AACTGTGG AGG CATCAGACACC TTGGGG CAAGGAGGGCA CCC ATG AG CTAGTACAC ACCACTAAGTATCTCAG AGTCCTGCACACAC ACT CATT 

GACACAGGTGGGTGAGCAGCACCCACGGGCAGCTGAAGCTGAGTATCCTGCCC 

ATGAATATGGCTTTTTTTGTTTGTTTGTTTGGTTGGTTGGGTTTTTGACTTGGTTTGGTTTTTCCA 

CCAGCTTTGTAAATCTGGTTTTTGTGTGTGAACCTATGAATTCCACTCCTGTTGCACTGATGGGAAATTGAGGO^GTCTCAAAGAAGCTGACAGC 
ACAGGCTAGTGTGTTCATTTTCTCTCTACAAAAAGAGGAAAAATTGGGGCTGGTGAGAT^ 

AGGT CC CGAGTT CAAAT CCC AGC AACCAC ATGGTGGCTCACAACC ATCTATAATGAGAT CTGAGACCCT CTT CTGGGGTATCTGAAG AC AGCTACAG 
TGTACTTAGATATAATAAATAAATAAATCTTTAAAAAAAGGAAAAATTATTTATTATTTTATTATTAAATTAATTGTTGTATTTATTTATTTTATTA 
ATCTATATTTAATTTATTTTTTTATTTTGTGTGCATGAGTGTTTTGCCTGCATGTGTGTGCAC^ 

AAGGCGTTGGGTCCGGGAACTGGGGTTGTATGGTTGTTAGCCGCCGCGTTCATGCTGGTTCTCTGCGAGGCGGCACTTAGTCCTGAGCCATCTTCTG 
TCAGCTCCTCTACTTTCTTTTAGACAGGGTCTCCTGTAGCTCAGGCTGGGCTTCAGCTTGCCTGGTGGTTAAGGGTGACTAAGTTAAATCATCTTAC 
ATTTCTGCCTCACTTCCACCTCCTGAGTGCTGGGCTTGCAGGCATGTGCCACTGGGATGGGTTTTGTTTCCTGCTGGCTTCTCACCCCGTAGCTCCG 
TGTGTTGGGTCAGGGCCAGCCCTGTGCCCACTGAGCCATTAACCCTACTGAGCCCACTGTAGGGAAAAACCATAGAGGAGCTTCTATGACCCTCATG 
GGTCCCTCCAGGTCTGTCAGAGTTCAGGAGGCATGGATCTCTCTGGC^GG^^ 

GCACCCTGTCGACGACCTGCTTCTGACTCCTCCTGTGAGGCTCTTTTGGAGTGATCTTAGTCCCCACAGGGCCCCCATCTTCTCCAGAGCTGAGTGG 
CCTCTCCTACTGGACAGAGAAGAGCTGTCCTGACTTGGTTAAAATGTTGAATGCAAGGAGCGGAGGAAACGGCCCAGCTGGTGAAGTGTTTGGCTCG 
ATC C CC AGAGC TGTGGAGTTCCC AACAGGATGGGGCTCACTGGCTGCC CTG ATC AGCAAGCT CAG AAACCAAG ATGG ATAGCTCCT AAGG ACT GGAA 
TTGGAGGATGACCCCTGGTGACAGACACACTTGCATATGCATCTGTCTGCACACATACACATATGCATGTAAAACCCTGATCCACGTGGTAGTGCCT 
GACTGTAATCCCAGTACTTGGGAGACAGGAACAGAGAATCACCTTCAGTTTGAGGCCAACCTCGTCTACATATCCAGCACCTTAGGAGACTCTGTTT 
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CCAACTCTTCAGCan'GGCTCACATCTCTAA^ 

GCCTCCAGAGGCAGAGGCAAGCAGATTTCTGAGTTCX3AGGCCAGCCTCTACAGAGTGAGTTCCAGGACAGCCAGGCAGCCAGGGCTATACAGAGCAA 
C CCTGTCTCAAACAACAA CAATAAC AAC AAC AAC ACAACAAG AAAACT TGAGTGTGCGTGTG CGTGTGCGTGT GCGTGT GTGTGTGTGTGTGTGTGT 
GTG TGTGTTTACTATTTACAGTAAACGAGTCTTTGGGG AGGCAGCCAAAGTAGGCGAGGAA CTC CTTGTC C CCTGCTGGGCTCTCTGGGTTG AGATC 
TTGTCTTGATGGGGAGATAGGAAGGGATTCTCTGTCCCTCAGGGTGCTATCAGCAGTGGGAACAGATGGGGTCGGGAGAGAGACTGGGGGAGGGGAG 
AATCAGGGTGGAGGAGGGCTGTGATTTAGGTTTACCIATGAAACCGGAGTCATC^ 

TTCCCTAGGACCCAGGCCTGATCCGGTGGGGAGGGGAATGATTGGAAAATGCTGCCTTCTAGAGTCCCTAGCGAGAGCCACACCCCAGGCCTCAGCC 
ATCAATGACAACCTTTCTGTTCTCACTGAAAGCAATTAAAGATCTG 

CCCTGGGACCCACATGGTGGAAGGAAAGAATGGACTGATGCCCTTGGACCCCCACGTGTGTGCTGTGGCACACACACACACGCATGCACACAAAAAT 

AAGTAATGTAGTTCAAAATTTAGAATAAATCAGGCGTGGTGGTGCACACCTTTAATCCCAGTACTCGGGAGGCAGAAGCAAGTAGATCTCTGTGAGT 

TTGAGACCAGCCTGGTCTACAGAGTGAGTTTCAAGCCAGCCATAGATAGCTACATAGCCAGATCTTGTGTTAGCCTCCTCCTTCTACCAACAGAAAT 

TAAGATCACAAATAAACAGGAAAGAGGGTGGAAGAGACAACTCAGTGGCTATGAACATTTGCTGTTCTTGCAGAGAAACCAGGATTGAGTCCCGGCA 

CCCAC^TAGTGGGTAAACATTATCTCTAGCTCAAATTCCAGAGGACCTGGCTTCTGAGAGCaCGGTACACATGGGGTGTTCTCTCTCTCT 

TCTCACAC&CACACACACACAC^CACACACAC 

CCAAGGGGATT CAACG ATGTCATGAGTCCACAGAAATC C TATCCTCC CCTTTTTCGGG GTGTTACACGGGCGGATCAAACCCAGGGNNNNNNNNNNN 

N^NNNNNNNNN^Nl^NNNNNNNNmWNNNNNNN^ 

NIOTNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNIWNNN^ 
H . NJ^NIWNNNNNNNNNNNNNNNNNNNNNN^ 

O NIMNNNNNNNNNNNNNN^NNNNNN^ 
™- NNNNIWNNNNNNNNNNNNNNNNNNNNIWN 
M NNNNNNNNNNNNNNNNNNNNNNNI^NNNN^ 
Q NNNNNNNNNNNNlINNNNNNNNlSrN 
? S Nl^NNNNNNNNNNNNNNNNNNlTOl^NNNNNNN 

7* KNNNNNNNNNNNNNNl^NNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

Mr NNNNN2JNNNIWNNNNNNN]OTNNN^ 

1^ NNNNNNNNNJWNIWNNNNNNN1WNNNNNN 

f ( NI^NNNNNNNNNNNNNJWNNNNNNNNNNNN^ 

flf NNNTSNNNl^NNNNNNNNNNNNN^ 

^ " NNNNNNNNNNNNNNN^NNNNNNNNNNNN 

f s NI^NNNNNNNI^NNNNNNI^NNNNNNNN^ 

NWNNNNl^NNNNNNNNNNNNNNNNNNNN^ 
fl NNNN^NNN^NITONNNNNNNimNNNNNNNNNNN^ 
J?! NNNNNNNNNNNNNNNNNNNITONNNNNNNNNNNt^ 
!y NNNNNNNNNNNN2WNIWNmN2WNNN^NNNNNN^ 
y 1 NNNNNNNNNNNNNNNNNNMJNNN^NNNNNN^ 
jJJ NNNNNNNNNNNNNNNNNNNNNNNNNNIWN1W 
Q NliHtfNNNNNNNNNNNNNNNNNNNN 

1^ NJSn^NNNNl^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGACTG CT CTTCC AAGGTCTTG AGT TC AAT CCC AGC AACCACATGGTGGCTCAC 
AACCATCCGTAATGAGATCTGACGCCCTCTTCTGGTGTGTCCGAAGACAGCTACAGTGTACTTACATATAATAAATAAATAAATCTTAAAAAAAAAA 
GAA AGAAAG AAAGAAAG AAAGAG AGAGAG AAAGA AAG AAAGAGGTGAGGG C TGGGTGTGGTGGCG C AC ACCTTT AATCC C AG CACTTGGG AGGC AGA 
GGCAGGTGGATCTCTGTGAGTCTGAGGCCAGCTTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTTGAAAGAAAG 
AAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGAAGGAAGGAAGGAAGGAAGAGGTGAATAG 
AAAATGAGGTCCAATAGGTTGGGGGCATGGCTCAGTGTTAGAGGGCTTGCCTAGCAAGTGAGAGGCCTGGGTTCAACCTCTGGCACTGAAGAGGGAT 
GGTAGGTAGGTAGGTAGGTAGGTAGGTAGGTATCTAGATAGATAGATGATAAATAGGTAGATGGATGGTAGATATATAGATAATAGATGTGGGATAG 
ATAGATGATAGAGATATACATAGATGATAGATTATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGACAGACAAATGGTAGG 
TATATAGATGATAGAAAGATGAGGGATAAATAGATGGTAGATAAGTAGATGATAGTAGGTAGATGAGAGGGAGAGAGAGAGAGAGCGAGAGAGAGCG 
AGAAAGGATATAATAAGGTCATTAGGCTAATTTGATCCAAAGACAGATTGGGGTTCTTATAAGACAAGTGATCAAGCAGGCCAGATATGGGGGGAAC 
TGATCTGTAATTCTAGCACCTGGGGTGGGAGATGGAGGCAGGAGGAAGGGGAGGTCGAGGGTATCCGCAGATACTTAGTGAGTTTGAGGCTATCCCC 
AGCCACGTGAGACTCTGGCAAATCCTTCCGCACTAACAACAAAAATTCTGAAAGGCAAACCTCCCTTAACTCATAAAGCATGAATTGGGGCGTATGC 
GGGCCTTCAGAGCCATGTGGTGTGTTTGGGAACAGACGACAGGAAACACTGTTGAGAGGACTCAGACCCAGAGAGGAAGGGACAGCAGAGTCACCAT 
GTGGACAGGCGTTGGGGCTTGCTGGGCCGCTCACAGGAAGCAGCGC^CACCAGAGCCATCCGCCTAGGAGCTGGGCCACAGTTTCCTGCAGAGTGGG 
CCCTGACACACCCGGCCAAAGCAAGCACTGGCCTGTGGACAGAAAGCAGACACAGGACGGAGCCCGTTCCGGGGTCAGCTCACTGTACTTGAAGATG 
TGAGCCCAAAGAAGTGGGGGGTGCCACGGCGTCTTTTGCAGATGTTACTATGCCAGTTGGTGTTAGACACTTAGATATATTTTTAAGAAGAGCAGTC 
GGTACTCTTAACTACTGAACCATCTCTCCAGCCCCACAATGACATTCTTCAAAGAGCAGAGCTTAGAGGCACAGGCACTTGTCACCAAGCAGC^ 
TGAGTCAGATCCCTGAAAATCC^C^GGCAGAAGGAGAGAGCTGACTCCTGTCCGTTGACCTTCATGTGAATACAGTGGTGTGTGGTGGTGTGTGTGT 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGCATTTATGCCATGTGTGTCAAGAGTGTGAGGAGGACCCAAGAGAGTTCCTAGACTTAAAGCTGGTTNN 

NNNNNNNNN^NNNNNNNNNNNNN^ 
NIINNNNNNNNNNNNNZWNNNNNNIOTNKN^ 

MOUSE SEQUENCE - mRNA 

GAGCTAAGCOSTGTTGAACAAAGGAGGTCGGGCACAGCTATCCAAGCTCCCGGGGCC^^ 

CAGAGGCTGTGCAGATTCAGTTCGGGCTC^TCAGCTGCGGCAACAAGTACCTGACAGCCGAGGCGTTCGGGTTCAAGGTGAACGCATC 

CTTGAAAAAGAAGCAGATCTGGACGCTGGAGCAACCTCCCGATGAGGCGGGCAGCGCGGCCGTGTGTCTGCGCACGCACCTGGGTCGCTACCTGGCC 

GCCGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGACGGCGACTGCCGCTTTCTCGTCGTGGCGCACGACGACGGCCGCTGGTCGCTGC 

AGTCCGAGGCTCACCGGCGCTACTTTGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGAGCGTGTCGCCGGCCGAGAAGTGGAGCGTGCACAT 

CGCCATGCACCCGCAGGTTAACATCTACAGCGTTACCCGCAAGCGCTACGCGCATCTGAGCGCGCGGCCGGCCGACGAGATCGCGGTAGACCGCGAC 
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GTGCCTTGGGGCGTCGACTCGCTCATCACCTTGGCCTTCCAGGACCAACGCTACAGTGTGCAGACGTCCGACCACCGCTTCCTGCGCCACGACGGGC 
GCCTTGTGGCACGGCCGGAGCCCGCCACGGGCTTCACGCTGGAGTTCCGCTCCGGCAAGGTGGCCTTTCGCGACTGCGAAGGTCGCTACCTGGCTCC 
GTCCGGGCCCAGCGGCACCCTCAAGGCTGGCAAGGCCACCAAGGTGGGCAAAGATGAGCTCTTCGCCCTGGAACAGAGCTGCGCTGAGGTGGTGCTG 
CAGGCGGCCAACGAGGGGAACGTGTCCACGCGCCAGGGAATGGACCTGTCAGCCAATCAGGATGAAGAGACCGATCAGGAGACCTTCCAGCTGGAGA 
TCGACCGCGACACAAGAAAGTGTGCCTTTCGCACCCACACGG 

GAACGCCAGCTGCTACTTTGACATCGAGTGGTGTGACCGCCGGATCACTCTGAGAGCCTCCAACGGCAAGTTTGTGACCGCCAAGAAAAATGGCCAC 
GTGGCCGCCTCGGTGGAGACAGCAGGGGACTCGGAACTCTTCCTCATGAAGCTGATTAACCGCCCCATCATTGCGTTCCGGGGGGAACACGGGTTCA 
TTGCGTGCCGCAAGGTCACGGGCACTCTGGATGCCAACCGTTCCAGTTACGATGTCTTCCAGTTGGAATTCAATGACGGCGCCTAC^CATCAAAGA 
CTCCACGGGCAAGTACTGGACGGTGGGTAGTGATTCCTCGGTCACCAGCAGCAGCGACACCCC^ 

AAGGTGGCTCTCAAGGTGGGCGGCCGCTACCTGAAGGGGGACCACGCTGGGGTCCTGAAGGCCTGCGCGGAGACTATCGACCCCGCCTCACTCTGGG 
AGTACTAGGGCCACCTGCCTCTGCAGCCGCTCTCGTCAGTCCTCCTGTTA^ 

TCAAACTGGAAACCCAAGAGAAAACGGTGCCCTTGCTGTCACCCTCTGTGGACCCCTTTTCCCTAACTCACTGCTCCCCATGGGTCGGTGGCTG 

ACTGTCCCCAGGAGGACTCTGGTTCCCTCTGTCCCCTTCTTTCCATGGGGAACTCTGGCACCTTTCTTCTGACCTCAGTCAACTCTGAGCCTTATTT 

CCCCCCAGGAAGTGGCCTAGGAGAAGCTACAGGGCCTAGGGACTTACCCTGAGCTTGTAACTGGAAGACCCCGTCCCTATCCCCGCTCCCGCCCCCA 

CCCCACCCCACCCCTGCTCTGGCCCCAGCCTCTGGAGGCCAGCCTTTTGGCGGGACTGAAGCCGGGCATGGCCAACCTTGCCCACAAGTGTTTTTCT 

GGATCTTGGCTGGAAGGCAGTCTGTCCCATCCTGCAGTGTTTGGGCCTGGCTCTTTGA 

CATTCTGGAAGGGGCGGGCCTCTCACCCACCTCATTCCTTTTCCCCCTGGCCTGAC^ 

TTTAAGGAAATGTTACTGTTGAAAGGCCCTAGGCAAGCCTGCCCTGTTGGTTGTAGTCGTGAGTGGTCTTGGGGGGAGATGCTTGGCTCCTGTCCCT 
GCCTCCCCAGCGGTTCCCTCCCTCCCTCCTGCCTGACCACCCCAGCTCTGGCTCTGTGATTGGTGCTCCACGTCTCCAGAC^CCTCGGGGCTCCTGG 
GCGGAGAAAGCCGATGTGCCCCTCCCTGGGAGCCCTGAGTAAACCTCAGGGGGCCCTTTCCCAATCACCCCTCCACCGACCCCTCAACACCATGCAT 

GTAAAATCAAGTCTG 



MOUSE SEQUENCE - CODING 

ATGACCGCCAACGG<^CGGCAGAGGCTGTGCAGATTCAGTTCGGGCTCATCAGCTGCGGCAACAAGTACCTGACAGCCGAGGCGTTCGGGTTCAAGG 
TGAACGCATCCGCTAGTAGCTTGAAAAAGAAGCAGATCTGGACGCTGGAGCAACCTCCCGATGAGGCGGGCAGCGCGGCCGTGTGTCTGCGCACGCA 
7\ CCTGGGTCGCTACCTGGCCGCCGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGACGGCGACTGCCGCTTTCTCGTCGTGGCGCACGAC 
H' GACGGCCGCTGGTCGCTGCAGTCCGAGGCTCACCGGCGCTACTTTGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGAGCGTGTCGCCGGCCG 
| r „4 ; AGAAGTGGAGCGTGCACATCGCCATGCACCCGCAGGTTAACATCTACAGCGTTACCCGCAAGCGCTACGCGCATCTGAGCGCGCGGCCGGCCGACGA 
I .. GATCGCGGTAGACCGCGACGTGCCTTGGGGCGTCGACTCGCTCATCACCTTGGCCTTCCAGGACCAACGCTACAGTGTGCAGACGTCCGACCACCGC 
Mi TTCCTGCGCCACGACGGGCGCCTTGTGGCACGGCCGGAGCCCGCCACGGGCTTCAOGCTGGAGTTCCGCTCCGGCAAGGTGGCCTTTCGCGACTGCG 
« AAGGTCGCTACCTGGCTCCGTCCGGGCCCAGCGGCACCCTCAAGGCTGGCAAGGCCACCAAGGTGGGCAAAGATGAGCTCTTCGCCCTGGAACAGAG 
CTGCGCTGAGGTGGTGCTGCAGGCGGCCAACGAGGGGAACGTGTCCACGCGCCAGGGAATGGACCTGTCAGCCAATCAGGATGAAGAGACCGATCAG 
GAGACC TTC CAGCTGGAGATCGACCGCGA CA CAAGAAAGTGT GCC TTT CGC ACCCAC ACGGGCAA GTAC TGGACACTGACGGCGACCGGAGGTGTGC 
13 AATCCACTGCGTCCACCAAGAACGCCAGCTGCTACTTTGACATCGAGTGGTGTGACCGCCGGATCACTCTGAGAGCCTCCAACGGCAAGTTTGTGAC 
CGCCAAGAAAAATGGCCACGTGGCCGCCTCGGTGGAGACAGCAGGGGACTCGGAACTCTTCCTCATGAAGCTGATTAACCGCCCCATCATTGCGTTC 
CGGGGGGAACACGGGTTCATTGCGTGCCGCAAGGTCACGGGCACTCTGGATGCCAACCGTTCCAGTTACGATGTCTTCCAGTTGGAATTCAATGACG 
GCGCCTACAACATCAAAGACTCCACGGGCAAGTACTGGACGGTGGGTAGTGATTCCTCGGT^ 

TGAGTTCTGTGACTACT^ATAAGGTGGCTCTCAAGGTGGGCGGCCGCTACCTGAAGGGGGACCACGCTGGGGTCCTGAAGGCCTGCGCGGAGACTATC 
GACCCCGCCTCACTCTGGGAGTACTAG 

HUMAN SEQUENCE - GENOMIC 

TTGCATTTGAAGCTACCATGACTGCAAGAGGGAGTTCCCGAAGGGGTCTGGGAATGGCCCAGGGAAATAGGGGTGAGGTGAGACTCCACTGCCCCGA 
CAGTGCCCATGTATGAATTAGAAAAAGTGGGCCAGGCGTGGTGGCTCACACCTGTAATCCCAGCATTTTGGGAGGCTGAGTGGGCAGATCATGAGGT 
CAGGAGTTCGAAACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGCTTGGTGGCACACATCTGTAGTCCCA 
GCTACTCAGGAGGCTGAGGCAGAAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCGAGGTCATGCCACTGCACTCCAGCCTGGGTGACA 
GAGTGAGACTTTGTCTCAAAAAAAAAAAAAAAAAAAAAAAGAAAAGAAAGAAAGAAAAAGAAAAAAAGTGGGCCAGGTG 

ATCCCAGCACTTTGGGAGGCTGAGGTGGGAGGATTGCTTACAGCCAGGAGTTTGAGACCAGGTTGGACAACATGGTGAGACCTTGTCTTTACAAAAA 
AATACAAAAACCTATCTGGGCATGGTGTTGCATTCCTTTAGTCCCAGCTACTTAGGAGGTTGAGGTGGAAGGATCACTTGAGTTAAGGGAGGAGACC 
ACCCCTCATATTGTCTTATGCCCAATTTCTGCCTCCAAAGAAAGAAAAAGTAAAAACTAAAAGGCAGAAATGAAAACCACAGGCAGACAGCCCAGCG 
CCACACCCTGGGCCTCGTAGTTAAAGATCGACCCCTGATCTAATCGGTGATGTTATCTATAGACTACAGACATTGTATAGAAATGCACTGTGAAAAT 
CCCTATCTGGTTTTGTTCTGATCTAATTACCGGTGCATGCAGCCCCCAGTCACGTACCCCCTGCTTGCTCAATCACGACCCTCTCACGTGCACCCCC 
TTAGAGTTGTGAGCCCTTAAAAGGGACAGGAATTGCTCACTCGGGGAGCTTGGCTCTTGAGACAGGAGTCTTGCTGATGCCCCTGGCCAAATAAACC 
CCTTCCTTCTTTAACTCGGTGTCTGAGTTTTGTCTGCGGCTCATCCTGCTACAGAGTCTAGGAGGCAGAGGTTGCAGTAAGCCAAATTCACGCCACT 
GCACTCCAGCCTGGGTGACAGAGCAAGACCCCACCAAAAAAAGAAAAGAGGCCAGGCGCAGTGGC^ 

AGACGGGGTTTCACCATGTTGGCCAGGCTGATCTCAAACTCCTGTCCTCAGGTGATCCGCCCACrTTGGCCTCCa^GTGCTGCGATAACAGGAGT 
GGAGGCT CAACTTTTTAAAGAAG AAAAGGACGAATCAGGACAGGAGACAATTAC AAG CT CTGTTC ATT CGGAATTCTCATTGGCTTAC AGAAATAAC 
TTTGGCTAGTGATTGGTTATATGTTGTAGACAGACCCACAGGGTGGATGGCGTCTGTGGCCACTTGGGCATTAGCn'GGTCCAGAGCCCAGAGTCCAT 
GTAGCAAGT AGCTT C AGGACGTAATTAT TTAGCTCAATAGAAAGTGAGAT GTG ACTGCTGTTAC TTTTTTTTT TTTTTGAG ACGGAGTTTCA CTCTT 
GTTGCCCAAGCTGGAGTGCAATGGCGCGATCTCGGCTCACTGCAACCTCTGTCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCT 
GGGATTACAGGCGTGTGCCACCAAGCCCGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTT^ 
CCTTGTCATCCACCGGCCTCGGCCTCTCAAGGTGCTAGGATTACAGGCATGA 

ATCATTTTTATTCTGCT AGTT CC AT AAAAGCAATGTAAAC ACCAGCATTCT ATACAT CTTGCTGGTGAACTCACATAATGCTTAGTTC CCTGATCCT 
TTG ACCTCCTTGTC TTC TCCAGTTATTTTCTGTTTGG ACC ACTGGCCA CAGG AATGG AGTGGGGTTGGGGCTTAGG AGAGAG CAATGTGGCTT TTTG 
GTATGACTTGTTTCATTGACTGCTGTTACATTTTAAATGCCTTTCTGGGCCTGATAATTTAAGGGGGCTGGCATTTCTCACATCAAA 
TTTTTTTGTTTGTTTCTCACTATTGCTCAAAAGCCACATGTGTCAAGGACAGCTCTTGTTAACGGGCAGAGAAGCTTATAGAACCTT^ 
GCGCGGTTGTTCATGTCTGGGGGACAAGAGCGAAACTCCATCTCAAAAAGAAAAAAAAAAAAAAGAACATTTCCAGGCCAGGCACAGTGGCTCACAC 
CTGTAAGTCCCAGCACTTTGGGAGATTGAGGTGGGAGGATCACTTGAGGCCAGGAGTTTGAGACCAGCATAGGCCACATGGCAAGAACCCAGTTTCT 



GGAGGCGGAGGTTGC^GTGAGCTGTGATCGCACCACTGCATTTC&GCCTGGGCAACAG 

AGAACATTTCCATTATTGCAGAATGTTCTATTGGACGGCGCTGGGCCAGATGCCTGCCCCAAGCCCTGGGACACCCAAACCTGGTGAAAGTGCCAAG 
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CTCCCATCTGGGGGTGCTCACGGAAGGGGCCGGGAGCTGGGATTACAAGCGTGGAGGCAGGCGCCACCCCAGAGGAGGTGGGTGATGTCTGAGCCAG 
AGTCTTTGGGATGAGCAGGGCTTTGGCAGGCAGCTTTGTTGGGCAGGCAACAGGCACAGCAGG 

GGGGAGAAG AG AAGGGAGGGTGGCAGGAGAG ATGGGC AGAGG CCC C CC AGGAGCCCAGAG CCTTCAGGGCTTTGGGCCCTCTTGGGGCAC TGGGGAG 
CCACGG GAG AAGTG TGGGGTGGAGAGGGG CG CC CTGGATTTG ACC ACCTTCAGGAACCTACCTTGGC TGC CCGGGTGGGTGGGATGGAGGTGATGAA 
GGTGGAGGTCGTAGCATTGTCACTGAGAGCGATGCTTATTCTGATTTTTGCCCCCGTGAGCCTCAGAAATCGGTCAGAACAGTGCTCGGGCTGGGCG 
TGGTGGCTCACGCCTGGAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCAAA 
ACC C CGTCTCTACTAAAAACACAAAAATTAG CTGGGTGTAGTGGTGCATGC CTGTAATC CAG CTACTCCAGAGGCTGAGG CCTCC CTGGGAGGCGG A 
GTGCAGTGAGCCAAGATTGTACCACCGCCCCTCCAGCCTGGGTAACAGAGTGAGACTCTGTCTCAAAACAAACAAAAAAGAACAAACCAAAACCAGT 
GCTCCAATTCACCTTGCTGAATTGGAGAAGAACAGGTGCCTGGGTCCTCCCGGCTGACG^^ 

TGTTTAC AG CAGAGGTGGTTC AGGGAGG AGTGAG TGG CAGATGGT GCAGGCTCAGGG AGGAGTGAGTGGCAGGTGGGGCCTG AGG CCAGAGGCAGGA 
GGACCGTTCAAGACTGAGGTGTCTTGGGCACTGGTGCCGGGGCCGGGGAGAGCCCTGGCATGCCGGGCCCCGGCCCTCAGTGAAAGCAGGATGAGG 
GGCGGCCAAGCAGGGAGGCCAAGGCCTGGCAGCAGCCGGCAGCCCCTCTCCTGGCAGGGACATCTTGGCAGGAGCCGGGCTCTGCCCAGGGCGCTAA 
TGTCCTCTTAATTAGCCAGGCACTTGCCAAGCACTTGCOS^ 

CGGGTGAGT AGGGGGGATGGCCAGGTGAGTAC CATGG CCAGAAGGTGCGGCCTCTGAGT CCC CAC TGAAGC CAATCC AGGCC C CG AC CTCTGGTCAT 
CATCAA CAGAAGAGTGAC AGAAACAGAAAAG ATATCAAGG AACACGTTTTC AGGGCCAGATT C CAGGT C CC CC ACCTATTTT CCTTTGCT CCTGCCT 
TGCCTGGCTCTGCCCCCTTGAACTCTTATTCACCCAGTGTAGGCATCACCTCCTCCT 
CCTGCACAGGGCAGCTCATGTATGGATCCACAGGTGAGTTCCTGTCTTTGAAGGGTT^^ 
CCACCCAC^TCCCTCTTTGCAAATCCCATGGGAACTATGAGCCTGGG^ 

TGT ATCCAGGC CTC AGTTTCCCTGCTTGT AACCAGAAGTC CATGC AGC C AG AAATTGTCAGGG AGCTGGGCAC GGTGGCTCATGC CTG T AATC CC AG 
CACTTTCAGAGGCTGAGGC^GGAGGTTCACTTGTGATCAGGAGTTTGGGACCCCAGCC^ 

AAAAACAATTAGCCAGGCCTGGAGGCATA<^CTTGTAGTCCTAGCTATTC^GGGAGGCTGAGGTGGGAGGATCACTTGAGTCCAGGAGTTTGTGGC 
TGGGGTGAGCTCTGATTGTACCACTGCACTCCAGCCTGGGTAAGAGAATGAGACCTTGTTTTATTTTATTTTATTTTTATTTATTTTGAGATGGAGT 
CTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATGTCGGTTCACTGCAACCTCTGCTTCCCGGCTTCAAGAGATTCTCCCGCCTCAGCCTCCC 
AAGTAGCTGGGATTACAGGCGTGTGCCACCACGCCTGGCTAATTTTTTGT^ 

AACTCCTGACCTTAGGTGATCCATGCCCCCTTGGCCTCCTAAATTGCTGGGATTATAGGCATGAGCCACCGGGCCTGGCCGAGACCCTGTTTTAAAA 
AAATTAATCAGGTCCAGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGAGGGCAGATCACCTGAGATCGGGAGTTCGAGACCA 
GCCTGACCAACATGCAGAAACCCCGTCTCTACTAAAAATACAAAATTAGCAGGGCGTGGTGGCGCACACCTGTAATCCCAGCTACTTGGGAGGCTGA 
GGCAGGAGAAT CAC TTG GAC CTGGG AAG CAG AGCTTG TGGTGAGC CGA GATTGC ACCATTCC ACT CCAGCC TGGGC AAAAAG AGTGAAACTCCATCT 
CAAATAATAAT AAT AAT AAT AAT AAT AATAAT AATAATAATAATT GGT AGAGATGTTGGTAG AGGCAG GAC AGTGGGAGTGACAG CCTGG TCCTGTC 
CTCCCGGGCTCTGGCTGACCATGCTGTAGTGTGGCTCAGGGCCCTGGAGGGCACAGCTGTCCTGTGGCCCTGTGGTCAGCCGGGAAGCCAGGCCTGG 
CAGGACTCCTGCCCTAGAATCAGCTCCTCCCACTCCCCAGGGTCCCATGCAGGGCGCTGGCCGTACCAGGCCTCCTCGCGGGTGCCACAGAGCCAGC 
GGCTGCAC^GCAACAGAGACGCCATTCACAGGCTCTCGGCCACGAGGCCGCTCATTTACAGAGGGCTGAGCCGCTGACATCTGTGCGTTTGTAACAT 
GGACAGCATTTCTGGTCCCATACAGACGCCTGGGCCAGCCACATTTATCTCCCCTTGAGAGAAGGGAAACGAGGCTCGGGAGGGAATGAGGGATTGG 
CTTGGGGTCACACAACT TTAATG ATG ACCCTGGG CAG GAAGCTCC CAGG ATGGCAACAGAGACTCAGGG CAGG CTGGGC ATGGTGGCT CA C ACT TGT 
CATCCCAGCCCTTTGGGAGGCAGAGGTGCAAGGATCACTCGAGCCCAGGAGCTCAAGGCCAGCCTGGGGAATATAGTGAGAACCCATCTCTAAAAAA 
AAAAAAAATAGGCCGGTCGCGGTGACTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACGAGGTCAGGAGATTGAGACCATC 
CTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATAGGCCGGGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGA 
GGCCGGCGG ATCACGAGGTC AGG AGTTCG AGATC AGC CTGAC AAACATG ACGAAACCCTGTCT CTAC TAAAAATACAAAAATTAGC CGGGCGTGGTG 
ACACGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCATCGAGGTGGAGGTTGCGGTGAGCCGAGATCGCACTACTG 
CACTCCAGTCTGGGAGACAGAGTGAGACTCCGTCTCAAAAAAAAAAAAAAAAATTAGCCGGGCGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGG 
GAGGCTGAGGTAGGAGAATGGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTAAAGCCTGGGCAACAGAGCGAGAC 
TCTATCTCAAAAAAAAAAAAAAAAATTTAGCCGGGCATGGTGGTGCATGTCTGTACCC^ 

CCCAGGAGTTTGAGGCGGGAGTGAGCTGTGATGGCACCACTGCACTCCAGCCTGGGTGACAGAGTGAGACCCCGTCTCTCCCTATCCCCCGCAAAAA 
AAAAAAAACAACTCAGGGCTTTCCCACCCCTGTCCTTGCACAGATGAAGAACCGAAGCT^ 

TGGTCTTTGGACTACAGTCAAGACCTAGCACAGGGCrCAGGCCTAAAAAAAAGCTGTTGAGGAGGGTGTGAGGCTGCAACGTTGCGGTGAGAAGGGG 

GTCCCCAGGGAGGGCGCAGCAGGAAGCCCCAGGGAAGTGCCTGGGAGAGGGAGGTTGTGCAGCCAGAAGGAGCAGCCCGCAGCCTTTGGTTGTTGAC 

TCCTGCTTTGTAAGTGGCAGTCTGGTTGGGTAGGACAGGGTCCGAGTCCTCACTCAGGGAACTGAGTCCAGGGTGAGCTGAGCAGCCCTTCCTGGTG 

GCCTCACTTTCCCCTGCAGAGCCTGCTGCATGGTGTCCAGTGGCCAGCCTGGGAGGAGCTCAGGACTGGCCTCACCTCGTGCCACGCCCTCGTACCA 

AGGTGGCTCAGATGGCACCTGGGTTCCCGAAGGGCCCAGGAACACAGCGTCCATGTCCCCATCCTTCCCCGGAGGGACTTGGGGTGGGGCCTGCAGG 

AAGGATCATGTGACTTGTTCAAGGGCAGCTTGTCCTGCCCCGGACACAGAGGTGCCCATCGTAAGCGAGATGCAGGCAAGGTGAGGACAGGGCATGG 

TGCCGAGCAGGAATGATTTTCGAAAATGCTACTCTGTGGTCGGGCACGGTGGCTCACTCCTGTAATCCCAGTACTTTGGGAGGCCCAGGCGGGCAGA 

TCATGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCTTCTCTACTGAAAATACAAAAAACTACCCGGGCGTGGTGGTG 

TGTAAT C CCAGCTACTC AGG AGGCTGAGGCAGGAGAATCG CT TGAACCCTGG AGGTGAAGGTTGCAATGAGCC GAGATTGCAC CACTG CACTC CGAC 

CTGGGCGACAGAGAAAGACTCCGTCTGAAAAACAAACAAACAAAACCCCGAGATTCTCTTTTCCCCCGTCCGGAGCTCTATGGCCATCTGGAGCTCG 

TTCTGTCCA CA AGGACA C ATTTCCTGGCAGCAGCTGTGGACC AGGG CT C ACTCACTCACATCTGTACCCAATCTAGAGCAGGACAAAC AC CTAT CAC 

CTGCACTGGCAATGGACAGAGGACAGTGGCAGCCCCATCACCAGAGGCATTCAAGCCAAGGCATTTTCTATCGTCTATTTATCTATCTATCTATCTA 

TCTATC TATCTATCTAT CTATCT ATCTAGATG AG ATC TTG CT ATGTTGCCC AGGTTGAACTCCTGGCCTCAAG CGATCCTCCTGC TTC AG C CT CCC A 

AAGTGTTGGGATTTCAGGTGTGAACCACTGTACCTGGCTGGAGTGCAGTGGCACTGTCATAGCTCACTGC^GCCTCCACCTCCTGGGCTCAAGGAAT 

CCTCTTTCCTCAGCCTCCTGAGTAGCTGGGACCACAGGCATGCACCAT^ 

AGCAAAGAAGCACAAAAAC^TGTTGGCTTGCGGTTGAGGGCTGATGGGAGGGGGTTCTGGCCA 

GTGGGAGGGGTTGCCAGGGGCGTGCGGGTTCTGGGGCTCAAGGCCCTGCCGGTAAACCCATTTGAAGCAGGATGGCAAGAAGGTGACACCATCTTCC 

CCCGCGCACTGAAAGCCCCTGGCTGGGATTGCCTGGGGCAGACACAGGCTCGGACCAGCCCCAGCAATCCCAGTTTATCAGCGAGCCGGCTGAGGGC 

CCGGAGTTATCTCAGTGCCCGGCCTGAGACCTTGTGGGCAGCCTGTGGTCATGCTGGCATTCCAGGGGCCTTTTGGCATGTGGG^AATGTCCAGGAA 

AAGCCrCAGCCTTCGGTGAGGCGCAGAAAAGGGAAGTGTCCCTAGAGGGGGTGGGTGAGGGCGTGGGAGGTGGTGTCTGCAGGGAATGTCCCCTTTG 

GGGG AGGAG GATGG AGGGTTGGG ATTCTG AGGATGGG GGGGGGGGCTGTAGCCAGCACCATGTCC CTC CTGTGTGACCAGCTC AG AGTCC C ATG AAA 

TTGGGGCTTGGGAGGGGAAGGGACACTGGCCTGGGAACCAGAGACCTGGGCTGGTCTGGCTCACAGTTGCTGGACCTCTGTGATCCGTGTCA 

CGAGAACACC^ATTCCTGTCCTGCCCACTCACCACCAGGTGGGACCTGAACCCTGGCATCGCCAGCATTGGGAATGTC 

CTCCCGGAGACCTATTTGGGCC^CCCGAGGCGGGTGCCTGGGCCACACGGAGGGGTCCTGGTGGTCTTCAGGGCAGCGGCTGTGGGGCTGAAGCCTC 
AAGGAACCACATCTCTGCATAGGAGGGCCAGGCTGCAGGGCCTCGGAGACAACCTAGTTGGGCGTGTTGGGGTCTGTGGGTCCCAGGTCCTGGCCTC 
ACCGGGTCCCCACCGCGCTGTCAGCTCCCAGCCTCTTTCCCTCGTCTGCCTCTGGGCTTCTGTAAGGCTATGTGCTCCAAGGCC^CCTCCTCCAGGC 
AGCCCTCAGACCCCCACCTTCGCCCAGTACCGATCTGCACCGTGGTCn'CTGAAGTCT^ 
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AGGCTTGGGGTCGGGGTGTCCGAGGTGGGGGACATCCGGGGGGGTTAGGTGGCTGGCGCGGGGAGCCGGGGTTGTGAGGGGTGATGTCCTCAGGCGG 
CGGCGCTGCGGGGTGCGGCGAGGACACCGGTGGGGTGAGAGCACCGGCGGGGCAGCAGCGGGGGCCGCAGCGCCGGGTCCCTCGGCCCGGGGCCCCT 
CCCGCGCGGAGCCAGGGGCGGGACAGGGGGGCGTGGCCTGGTGGCGCTGACGTCACCTCGCCTATAAAATGTCCGGGGCGCCGCTAGCTGGGCTTTG 
TGGAGCGCTGCGGAGGGTGCGTGCGGGCCGCGGCAGCCGAACAAAGGAGCAGGGGCGCCGCCGCAGGGACCCGCCACCCACCTCCCGGGGCCGCGCA 
GCGGCCTCTCGTCTACTGCCACCATGACCGCCAACGGCACAGCCGAGGCGGTGCAGATCCAGTTCGGCCTCATCAACTGCGGCAACAAGTACCTGAC 
GGC CGAGGCGTTCGGGT TCAAGGTG AACG CGTCCGCC AGC AG CCT GAAG AAGAAGCAGATCTGGACGCTGGAGCAGC CC C CTGACGAGGCGGGC AGC 
GCGGCCGTGTGCCTGCGCAGCCACCTGGGCCGCTACCTGGCGGCGGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGGTCCCGACTGCC 
GTTTCCTC^TCGTGGCGCACGACGACGGTCGCTGGTCGCTGCAGTCCGAG^ 

CGCGCAGACGGTGTCCCCCGCCGAGAAGTGGAGCGTGCACATCGCCATGCACCCTC^GGTCAACATCTACAGCGTCACCCGTAAGCGCTACGCGCAC 
CTGAGCGCGCGGCCGGCCGACGAGATCGCCGTGGACCGCGACGTGCCCTGGGGCGTCGACTCGCTCATCACCCTCGCCTTCCAGGACCAGCGCTACA 
GCGTGCAGACCGCCGACCACCGCTTCCTGCGCCACGACGGGCGCCTGXSTGGCGCGCCC^ 
CAAGGTGGCCTTCCGCGACTGCGAGGGCCGTTACCTGGCGCCGTCGGGGCCCAGCGGCA^ 

GAGCTCTTTGCTCTGGAGCAGAGCTGCGCCCAGGTCGTGCTGCAGGCGGCCAACGAGAGGAACGTGTCCACGCGCCAGGGTGAGTGGGGACGCTGCC 
CCCGCCTCTCCTGGTCCGTGCACAAAGC^ 

GCCCATTGCGCCCCCGCTAGGCACGCGGGCTACCCCGCCTGGAGGGGGCGAGGAGTGG^^ 

CCCGGGCTGGGATCATGGGCTCCCCTAGGCCCCGCGGAGTCGCAACCGTACGTGCACCCTCCTAACCCCCCCCCCCGCCCAATCTTGGCTCTCCCCA 
CGCGCGCTGATCCCTCGGATCAGCTGTCCCAGCTCTTGCGGAGTGGGAGCCCCTCACCCATTTCCTCGrGCCCCTCCCCCCGCCGGCTGTCCTGGAG 
CTCGGGGGTCCGAGGCAAGGGTCGCCACCCGCAAGGGCGCGCCTCCACCCCCACCGGCAGCCTTTCGCGGGCGAGATGGGGGAGGTTAGCCAGGCCT 
TTGATCCCGGCGGGGCGCGCCTCCACCTCCCCGTCTGCCCGGCCTTCCTCCACCTCCTCCCGCTGCCGGGCGGGGTCGGCCTCCGCTGGTGGGGGGG 
GGCGCGGGGTGTCAGCCCTTCCCCCCAGCCCCTCCTCCCGCGTCTGCCCCGCGCTCGAGGCCGCGGCCTTTGTGAGCAGGGGGGCGGGTCGCCTCGA 
CTGGGTCCTCCCTCGCCCCCTTTGTCCTGCTC<^TCTCTGCAGATGGGAAAACCAG^ 

CCCTGCAGAGGAGCCCCCCGCGCCGCCCCCGGGCCCAGGGCTCGGGTCCCGAGGTTCCCCAGGAGGCGGTCTCCCTCCTCGCGCCGCGGCCCGGGAA 
CGGCGTGGCGCGGATGGCGGCCCTCCAGGCACCCCGCCCTTCGCCCGCCGCGCGCGTCTCCAGGCCGGTGCGCTGAGCTCCGCTCCGCGGGCGACCG 
AGGGCGGCTTCAGCGCGAGCCGGGAGACCCCAGGCCAGCTCCCTCGGGAAGCCCCTACCCTCTGGTGAACCCATCCCCTAGGGCGCTCGCCGGGAAC 
AATTGACTGCAACTCGATCCGTCCCCTCCGGGGCCTCCTGCAGGCACGGCTATTTGCGACCGGCCTGTGCGCCACTCTTTCCCTTCTTTCTCAGATT 
CTCCCCGAGGGGCT TTCTCTTCCTTTTGGCGTCCTCTTCCACTCCTGGCGGAGAAGCTGTTCCTTGCTTGCAGACAGAGGCCCCGCTGGAGGGAGGG 
GCGTGGGGGGATCTTTTTCCCTGGATAGGAAGTCGGACCCCAGGCCTTGGAGATGGCTGGACGGGAGCCCAGTTTGTGTCCTCATCCCTCCTCTTGG 
AAGGAGCCCTCCTGACGGCCCCCCTCTTGGCTGTGGGCTCTGCAGGAGGGGGAAAGACCCCCCATGGCAGTGGGGGTGAGGGTGGAGGCCTGGGTGG 
GGTAATGGCCATTTTGTGCCTGAAGTTTGCTACCTTAAGT^ 

GCATGCAGAAGTGGGTGCATTTGGCTCCAGAAACCCCGTTTCCTGTTTCCCACAGCGCATGGCGGGCAGAGCTCCTCACCTGATCAGCAGGCATTGA 
GCCCTACTAAGGGGACCTGCTGAGCCATGAGTGAACCAGTGGGGTTCACGTTCTCTTGGGGACATTACGAAGTGTCAAGGAAAAGGCAGCGGGTACA 
GGTTATTAGAT GGCTGAGTC CTT T CTGAGCAGGAC ATTTG AG CTG GCCTGAAGG ATG AATAG AAGG TGGCAGGTATGGGGGGGGTGCTGTGTGGAC C 
AGCGTTGTAGGCTGTGGGTGTCTCCTGTGCAAAGGTCTTATGGCAGGAAGGTACAGAGAAAGCCCAGTGTGGCTGGGAGGAAGCTTGTGAGGTCTGG 
GCCACGGGACCTTGGGTGAGAGCTTAGTG<^GGGCTATGGTCCTTGCCTTGGGGTTGCAGCCACAGCCTCCCGTGGGGAAGAAGTCGCAGAAGTAGG 
TGGCCTTGGCCCTGGGGGCCAGGAAGTTGCAGAGAGCCTGAGGGTGTTTTTGGCGAGCCTGGGCAGATAACTCCCCTCTCCTGAGAAGCTTGCTGGG 
GGCGTCTGGTGT GTG ATT CAGGCTGAT AGGGTGGGAGG AAC CAG AAATTGCAGCC AGG AG AGGTGG TGG TGATGGCTCAGCTTGGAGTTGAAAGGGA 
TGTGGGACGGCGCGGGGGTGGGGGTACTTGGGGGCCGAGGTTGGGGCTCCTCTGGCCTCAGATCTTGGGGTCCTTATCTCGTTTCTATGTCAGCCAA 
CTCCCGGGGGGCTTTTGGGAACTGGAGTCTGCAGGGGTGGGCGTGGGGGGCGTGTCTGCAGAGGCCATTTTGGACGGGCCTGCGTGGCAAGGGGGAG 
GCGTCTGCTTCCGACACACATGCTGGGGAACGCCAGCCAGGAAGGGGAGGATCCGGACTTAGCTGGCAGGGGGGATGTGAATCATCTCTGCAGGCCT 
GCACGGGCGGGCCCGGTGGGGGCAGGGTGCTCCCCTTTGAAAAACGCTGCGCCCTGCTCTTCAGCCTCAGTTTCCCCATCTGTAAAGTGTGCATTCC 
AGGCCTCCTTGGGCTGGCCCAGAGCTGCCCTGGGCAGGGGCTTTCAGCCCTTCACAC^GAGGCTGGGTCAGCTGAGTGAGTGCTCCTTTGTCCTCC 

TCTCTCACATCTGGGAGTCCTCTCCCGCCAGCCTGTGCTTGCATCTGGGGATCCATGCCAGGGATAGGACCTGTCCTCCCTGCTGGCTTGGGACATG 
CCCTCTCCCAGCCACTCTGAGGAGGGCTGACTGAGGAAGGGCTCGTCAAGCTGGGCTTTGCAGGATGTGTAAGAGTTCTCCCCACGAGGCGCAGGGC 
ATTCTGAGCCCAGGGAATGGCTTGTATAAGGATGCAGAGGCATTTGAAATGGCCAAGTAGTTGCAGGAATCG^ 

AGCCATAGAACATTCCAGGCCCCCTCCCCTAAATGAGATGGAAGGAGTGCCTGTTTTGAACAAGCCAGGGGCATCTGGGGACCGTTAAGGCCTGGGG 

GTGGTGATGGGGACTGGAGGGTGTGAGGCAACCAGGGGGTGCCCCTCTGAGC 

CCTGTAATCCAAGCACTTTGGGAGGCCAAGGAGGGAAGATGGCTT 

AACATGGCAAAACCTCTGTCTACAAAAAATACAAAAAATTAGCCAGGCATGGTGATGCACGCCTGrOSTCCCAGCTACTCAGGAGGCTTAGATGGGA 
GGATCACTTGAGCCGGGGAGTTTGAGGCTGCAGTGAGCCGAGATCGTGCCACTGCACTCCAGCCTGGGCAACAGAACGAGACCCTGTCTTAAAAAA^ 
TTTTTTTTGGCCTGGTGCGGTGGCTCACGCCTGTAATGCCAGCACTTTGGGAGTCCAAGGTGGGTGGATCACCTGGAGGTCGGGAGTTTGAGACCAG 
TCTGACCAACATGGAGAAACCCCATCTCTACTAAAAACACAAAAATTAGCCGGG 

GGCAGGAGAATCCCTTGAAC C CGGG AGGCGG AAGT TG C AGTG AGC CGAGATCGC ACCATTGC ACTCCAG CCTGGGC AAC AAGAGTGAAACTCC ATCT 
CAAAAAACAAAACAAAACAAAAAAAACAACCAAAACAAACAAACAAAAAACTGTAA 

GAGGATGGCTCAAAGGCTGGAAACAGGGAAGGCCACTGTGAGGGGAGGACAGGCAGGGCTAGACCTCTCTGGAAGGAGGAGGGAGAGG 

GCCCGGCACAGGAGACrCTTAATCTCCTGGCTCCCGGGGGCTTCTGTGGGCCTC 

TGAAGAAAACACTGTTCCCATAGTAACACAGCTCTAATTAGAGACTCCAAGG 

GGACGACACCCCTTTCCCCTATCCCTCCCTGCTGGGGTGTGGCCTGATGTCTACGTGGCACCAGGCCCCTAAATCCTTTCAACATCCTGGCGGAGCG 
GAGAGGCTGGG CCCT TTTGAC AGGTGGGGAAACTG AGGAAGGTTGCAGGGAAGG CACTCATCCAAGGG TGC CTGGCCCCCCGCGGTGGGTGC CT CAC 
TGGCTGGCCTAAACCTTTGCATCAAACCAGTTCCAGGATTGAACGCAACAGGCTGATGGGGACTGAGTCGTGGTACAGATGGCGGCAGTTGTGTGGC 
CCTGGGCCCAGGAATGGCTGGGAAGCTCCCTTTCTTCTCTGGCTTGGGGGATGAGGTTAGTGTAAATTTCATGGAGTTGCCAGGAGGATTAAGGCCG 
ACGTGCATTCACACGTGGCCCCGTGGCTGCCGCCAGTTAAGCCCTGCGCAGCCATTGGAAGTCTCATTGAGAAGGAGCCTCCCGGCTTCCTGCATTT 
GTTCCAGGCGGGCTGGTAAGCGCCCTGCTTATTTGCAAGGCTGTGTGAGGACGAGCCCTGTAAACCCT^ 

TTCCACTGTCTCCCCTTCTCCTTTTGCCCCTCTTTCTGGGGAACCTGGAGCCCCTGATGCCCTCTTGTTTTTTCCTGCTGCTGGGGT 
CTCTCCCTCCATCCTTTCTGCTGCTGGGGTGGGCCTGCAGGAGGCTGTCCT 

GCCCCAGCGCATTGGACAGGTGGTGCTCTGTGGCCGGCACAGGCCAGGGCCCAGGGTGCGGCCCCTGCCTGTAGGTAGACATGCAGGCCCTTCTGAG 
CAGAATGAGGGGGCGGTGAGGGCAGCTGTGCGGGTGTGGCTGGCCTGACGGCTCCCTGCAGCCCTGCGGCTTCTGTGCAGTGGGGGTGGGGCGGGCC 
AGACCTTTGCAGGGCCTCTTGGCTGGTGGAGGGCTGAGTGAGCAGAGGCCCCAGCCCTCGTGTTCCCTGGGGTGTGGCCTTAGGATGGTCCCGGCAC 
CCTGGGGACCCAGCCCCTGCTCACCTTATCCTCCTCCCGTGCTTGGCTGGGGTGTGGTGGCTGAGCCAGCCCACCCAGTGCGGTGGATGCTCGTCGG 
AGGGCAGAGGGTGGGCTACCGGCTTAAGGGGCCCCAGGGAACCTGGGGGGTGGAGCCCAAGGGGTTCCAAGAAGGGGCGGGGCCAGGAACCCATAGA 
CAAGAGGGTGGGGCAGGGAAAGGGCGTGGCAAGAGGGCCACCCACCTCCGGTCCAGAGAGCCAGCCAGACCCTTACTTTCTCTTGCTGTGTGACCTT 



51 



m 



AGGCAAGGACATGCCACCACCGGCCTTAGTCTCCCTCTTTGTGAAGTAGGATGAAGATCCCCACCTGTGGAGCAGATGTGGGGCTGCATGGTTG<^T 

CAGGATGGAACAGGATGGGGAGGCCGGGCGTGGTGGCTTACCCCTGTAATCCCAGCACTTTGGGAGGCAAAGGTGGGAGGACTGCTTGAGTCCTGGA 

GTTTGGGCAACATAGOSAGACCACCCCCATCTCTACAAAATAATGTTAAAGTTAGCCAGGTC 

GACTGAGGCAAGAGGATCCTTTGAGCCCAGGAGGTGGAGGCTGTAGAGAGC 

CTATCTCTAAAAAAAAAATGGAATTGAGGATGTGTCCTCTGGGTGTGGGCTCTAC^ 

ACCGCTGAGGTCCCTGCAGCATCAAGGTGCCCAAACGAAGACACTCTCCAGCTCTGAGCCAGGCACCCATACAGAGCCCGGCAGACGCCTCTGCTGC 

TGC AGT TCTTAGAATCCAGAGCGCGTGGGGAATGTGAATTTGTGCTGCTGGAGCCAAACATCC CAC CT C CAGGTTTTAGCTGGAGGAATCC ATGTGG 

ATATGCAAAGCAATGTAGTTATAATGAATAGCAATTGTGGGCTGGGCGCAGTGGCTCATACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTG 

GATCACCTGAGGTCAGGAGTTTGAGACCGGCCTGGCCAACATAATGAAACTCCGTCTCTACTAAAAATACAAAAAAATTAGCCGGGCATAGTGGCGG 

GTG C CTGTAATCCC AGC T ACTCAGG AGGCTG AGG CAGGAGAAT CA CTTGAACCCAGG AGGCAG AGGTTT CAGT GAGGCGAGATCGTGCCATTGCAC T 

CCAGCCCGGGAGACACAGCGAGACTCTCTTTCTCAAAAAAAGAATAGCAGCTTTGCTATAGCTTAGGGGAGCAAAACCCGAAAGCCACCTTAAGTTT 

CCCGAAGATAGAGATCCCTGGGGTCCTATTCTGAGACAGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGGATCTCAGCTCACTGCAGCCT 

CCGCCTCCTGGGTTCAAGCAATTCTCCTGCGTCAGCCTCCCGAGCAGCTGGGATTA 

AGACAGAGTCTCGCATTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACTGCAAGCTC 



GATGATCTCGATCTCCTGACCTCGTGATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCTTGCCTGGTCCTTTTTTTTG 
TATTTTTAGTAGAGATGAGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCATGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGA 
TTACAGGTGTGGGCCACTGTGCCCGGCCAGTCCCATTCTACTTTTTAGGGACATGGAATCATGTAAAGATGCCATACAGAAAACAGTATGCAAATCC 
AGAGACAGAGGACATCTGTGCATGTTTCTTTCTTTCTTCCTTTTTTTTTTTTTTTTTTTTTTTGAGATGGATTCTTGCTCTGTCGCCCAGGCTGGAG 
TGCAGTGGTGCAATCTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGACTTCCAAGTAGCTGGGATTACAGGCATGC 
CCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGGCGGGGTTTCACCATGTTGGCAAGGCTGGTCTTGAACTCATGACCTCAGGTGATCCACC 
CGCCTCAGCTTCCCAAAGTGCTGGGATTACMGCGTGAGCCGCCGTGGCAACCCGC^^ 

GGTTCACTGG CTGGGCACATGGTGGCTTGTGT CTGTAAC CT CAG C ACTTTGGGAGGCTGAGTTAGG AGG ATC ATGTGAGGCCGGGAGTT CAAG ACCA 
TCCTAGGCAAC ATAGCCAGACCCTG TCTGTA CCAAAT AGAAAAAAAAATTAACTTGATGTGGTGGTGT GTGCCTGTAGTCTC AGC TAT TTGGGAGG C 
TGAGGTGGGAGGATCACTTGAGCCCAGGAAGTCGAGGCTACAGTGAGCTATGATTGTGCCACTGCACTCCAACCTGGGCAACAGAGCAAGACCCTGA 

AGAGACAGGGTTTAAAAAAAAATTTTTTTTTTTTTTTTGAGrGCGGA 

GCCTCC CAGGTTGA AGTG ATT CT CCTGCCTCAGCCTCAGG CAATT TTGTATTTTTAGTAGAGACGGGGTGACTC CATGTTGGT CAGGCTGGTCTCAA 

TAGGGTCTTGCTAAGTTGTCCAGGCTGGTCTTGAACTCCTGGTCTCAAATGATCCTCCTGACTTGGCCTCCCAAAAGGCTGGGATTGCAGGAGATAT 
GAGCCACCGCACCCGGCCCTTTTATAAGTTTTCTAAAGGAAAGGAACATGTACCATCGTGTTTTATTTTAATTTTTACATAGAAGCAGGAACAATAG 
CATGAACCCCCAGACACGGCCCCCCAAGATGCGCTGTTCCGCTGTGCTGCAGCCCCATTTGTCCTTTTCTATACCATTTTTGGTTGTATTTGAAGCA 
AATCCCTGACATCAGGGCATTTCATCCCAAAATACTTCCATCTGTGTTTCTAAAAAAACGAGGCCATTTTCTTATGTAACCACAACGTGATCAACAG 
TAATCAATGCTTAACATCTAATTCTGGGCCACGTTAAGACCCTCCTGAT 
Li TTTTTTGAGACAGGGTCTCACTGTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCACGGCTCACTGCAGCCTCAGCCTCCTGCCTCAGCGATCCTCC 
TGCCTCAGCCT CCC AAGTAG CTGGG ACCAC AGGTTTGC AC CAC CACT CC CAGCTAATTAAAAAAAG TAATT TGTAGAGACGGGGGTGGGGT GGGGGT 
W CTTGCTATGTTGCCCAGGCTGGTCTCAAACTCCTAGACTCAAACAATCCTCCTGCCTTGGCCTCCCAAAGTGTTGCGATCACAGGCATGAGCCCCTG 
flf TCCCTGGCCCCTATTATTTCTCTAACTGGGGAAAGTCATGTGGGAACAGATGTAGCTTGCCTTGGCCTCTGACCGGCCCTGCCTGCGTTCCTGGGTG 
3^ CTCTCTGCTGCTTCTCATGTGTGCCACTGTGGGGACTCGGCCGCCCACCCCACCCCGTGGTGTTACCTTGCGTGTGTAGTTCTGTGAGCTCAGGGCT 
W ATGGTCTGCCAGAACTAGGGGGCGTGGGGCCCCAGTACCAGCCCAAGGCCTCCTCTCTGCAGGTATGGACCTGTCTGCCAATCAGGACGAGGAGACC 
Q GACCAGGAGACCTTCCAGCTGGAGATCGACCGCGACACCAAAAAGTGTGCCTTCCGTACCC^ 

•f^ GCGTGCAGTCCACCGCCTCCAGCAAGTGAGTGCCTCGCTCCCACCTGTCACCGCCCCCACCACCTTGCCTGGGCTACCCCGCCTGACC CTGTCCCGC 
CATCCCCCAGGAATGCCAGCTGCTACTTTGACATCGAGTGGCGTGACCGGCGCATCACACTGAGGGCGTCCAATGGCAAGTTTGTGACCTCCAAGAA 
GAATGGGCAGCTGGCCGCCTCGGTGGAGACAGCAGGTAACACTAAAGCCCCAGTTCCCTGGAGCCGTCCTGGAGTCCTGGAGGGTCTGGCCATGCCG 
TGGTCACTTGGTAGCCCCAGCCAAGGCCTGCTCTGTGCTGGGCATCCCCCCGGACTGGCCCCGCACTGTCCTACCCTGGGGCTGACTGCTGTGTGAC 
CCCAGCTCCTGGCCCTCCCTCTCTGGTCACCCCAGCCTCCACCCCACTCCCTGCCAGGAGGCTCACTGACTCCCCTCTTTCTGGGACAGGGGACTCA 
GAGCTCTTCCTCATGAAGCTCATCAACCGCCCCATCATCGTGTTCCGCGGGGAGCATGGCTTCATCGGCTGCCGCAAGGTCACGGGCACCCTGGACG 
CCAACCGCTCCAGCTATGACGTCTTCCAGCTGGAGTTCAACGATGGCGCCTACAACATCAAAGGCAGGTTCTCCTGTGGGCAGCT'GCTGGGCAGGGA 
ACCCCTCGGTCGGGGCTGGGGTCAGTGCTGCGGGGAGCGCCCTCTGCATCCACACTGGACCCTGGCTTGGCTCAGGGCCATTCCAGGCCCTAAAGGG 
ACAGGTGTCTGATGGCCACCAGGGGCTCTGGGATGCAAGCAGCCCCTTTCCCTCTTGTCTGTGTGGTTGGGGGGACTTACCTTGCCCACCTGACAGA 
GAGGTGTGTGGAGGGGAGAGCAGGGAGGGGAAGGAGAGCAGGGAAGGGGAGGGAGGAGAGCAGGGGAGGGGAGAGCCGGGAAGGAGAGGAGAGCAGG 
GGTGGGGAGGTTCTGGAAAGGGTGTGCAGGGGAGGACGCGCCTCGGTTATGGGACTGGAGCCCCTTCCCAGGAGGACCCCCAACAATCCAGAGGTGC 
CTGTTAGGATTCAGAACATGGTTTTTTTGTTTGTTTTTTTGAGACTCACTCCCTCACCCTGGCTGGACTGCAGTGGCGTTATCTCGGCTCACTGCAA 
CCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGCACCACCACGCCTGGCTAATTTTTATATTT 
TTAAAATTTATTATTTATTTATTTTGAGACCCAGTCTGGAGTGCAGTGGCGTTATCTCGGCTCTCTGCAACCTCTGCCTCCTGGGTTCAAGCGATTC 
TCCTGCCTCAGCCTCCCGAGTAGCTGGGACTATGTGTGGGAGCCACCATGCCTGG^ 

TGTCCAGGCTGGTCTTGAATT CGTGGCCTCAAGTGATCCGC CC ACCTC AGC CTC C CAC AGTGCTGGGTTTATAGGTG TG AGCCACCAC ACCCGG CTA 
ATTGTTTTGTATTTTTAGTAGAGACGGAGCTTCACTATGTTGGCAAGGCTGGCTCGAACTCCTGACCTCAAGTGATCCGCCCACCTCAGCCTCCCAA 
AGTGCTGGG ATTAC AGG CGTGAG CCA CTGCGGCCGAGC AGAACAC GTT CTAGGAC CCTT GTTC ATGTGTCC AT CATGGACAGGAGGACGTGCGGGCC 
ATAGGGACCCTGGCTC ATTCCGGAG CCGGGACTGGAGGGTGGGG CGT CAC C CTTGGGAACACCCGTG C CC ACC CTCCGCTGCCCAGGGTAGGGGTGG 
GGAGCCAGGCTTTGGGCCCCACTTGATAAAGTCCCCTCCCCAGACTCCACAGGCAAATACTGGACGGTGGGC^GTGACTCCGCGGTCACCAGCAGCG 
GCGACACTCCTGTGGACTTCTTCTTCGAGTTCTGCGACTATAACAAGGTGGCCATCAAGGTGGGCGGGCGCTACCTGAAGGGCGACCACGCAG 
CCTGAAGGCCTCGGCGGAAACCGTGGACCCCGCCTCGCTCTGGGAGTACTAGGGCCGGCCCGTCCTTCCCCGCCCCTGCCCACATGGCGGCTCCTGC 
CAACCCTCCCTGCTAACCCCTTCTCCGCCAGGTGGGCTCCAGGGCGGGAGGG^ 

CCCCACCTGTCGCCCCTATGGACTCCCCACTCTCCCCTCCGCCCGGGTTCCCTACTCCCCTCGGGTCAGCGGCTGCGGCCTGGCCCTGGGAGGGATT 
TCAGATGCCCCTGCCCTCTTGTCTGCCACGGGGCGAGTCTGGCACCTCTTTCTT 

AAGGGACGGTTGGGGGCTGGGAGCCCTGGGCGTGTAGTGTAACTGGAATCTTTTGCCTCTCCCAGCCACCTCCTCCCAGCCCCCCAGGAGAGCTGGG 
CACATGTCCCAAGCCTGTCAGTGGCCCTCCCTGGTGCACTGTCCCCGAAACCCCTGCTTGGGAAGGGAAGCTGTCGGGTGGGCTAGGACTGACCCTT 
GTGGTGTTTTTTTGGGTGGTGGCTGGAAACAGCCCCTCTCCCACGTGGCAGAGGCTCAGCCTGGCTCCCTTCCCTGGAGCGGCAGGGCGTGACGGCC 
ACAGGGTCTGCCCGCTGCACGTTCTGCCAAGGTGGTGGTGGCGGGCGGGTAGGGGTGTGGGGGCCGTCTTCCTCCTGTCTCTTTCCTTTCACCCTAG 
CCTGACTGGAAGCAGAAAATGACCAAATCAGTATTTTTTTTAATGAAATATTATTGCTGGAGGCGTCCCAGGCAAGCCTGGCTGTAGTAGCGAGTGA 
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TCTGGCGGGGGGCGTCTCAGCACCCTCCCCAGGGGGTGC^TCTC^GCCCCCTCrTTCCGTCCTTCC<^TCCAGCCCCAGCCCTGGGCCTGGGCTGC^ 

GACACCTGGGCCAGAGCCCCTGCTGTGATTGGTGCTCCCTGGGCCTCCCGGGTGGATGAAGCCAGGCGTCGCCCCCTCCGGGAGCCCTGGGGTGAGC 

CGCCGGGGCCCCCCTGCTGCCAGCCTCCCCCGTCCCCAACATGCATCTCACTCTGGGTGTCTTGGTCTTTTATTTTTTGTAAGTGTCATTTGTATAA 

CTCTAAACGCCC^TGATAGTAGCTTCAAACTGGAAATAGCGAAATAAAATAACTCAGTCTGCAGCCCCAGGCCGGCCTGTGTGTGTCTTGGGGCTGA 

GGTGGGTGGGGGGGCTGAGGTGGGTGGGAGGGCTGGCGGGACAGGTAGGCGCCCTGGCTCCCCAGCTCAGTGCTGGGAGTGTGCAGTGGGAGGGAGG 

CCGTGGCTCCAGTGGGTGCTCCGGAGCTCGTGGGCCCAGCACACCTCCTTAAGCGGGGGATGGAGCGCTGGGAGGGGGTGGACTGTGGCCCATGCGA 

CCC CCAGAGCCATTAGGAGG AGT TCTGTGGTGAG AAGTGGCTGTGG CT CCTCGTAGGCTACG TCCACCATGCGGGGGACCTCGGGGGTGTCTGGCGG 

TGG CACGCTGGATGTTGAGAAGGCG C AG CCCAGGG AAC ACTC AAACCAGGAGACCCC ACATT ATCCTCTAGTTGACATG TGCCCTTCG ACTAGGGGA 

CTTGGTGGTAGGGGCAG CTC CGC CC CCATACTGCGAGGATCCGGGCCTT CCACTTCCCCAGACACAGCAGGTT GGGGTGCCC CTGGGGCTGGGAGAT 

GCTGCCCTGGGCCCCTCAGGGGGCGGCCGTGGTAGGATGGACCGGGCCGAGCGTGCAGACCCTGGGCAGGTCCCTACATCGGAACCAGCAGCCCTGG 

GGATATCTGGCATATCAGAGGCTGAGGACTGTTTCCAGACTTTACCAAGGCCACAGTCAGGGCGACCTGGGTTCTGAGCCTCCCTTTGCATCC 

CTGCAGGGACAGGGGCTGGGCAGAGGCTCGCGGACCTCGTGCAGCTTCATTCACTCCCAG 

GAGGAG CAGTAAGGCGC AGC CTG GAGCGG AGGGACCT GGCGT C CAGTC ATTGTGCGCTGGGAAGG CGGTGATGG CGACGGAATTTGGG AC ACGCGGG 

GCGTCCCGGGGGCAGCCAGGGCACTGCAGGCCGGAGCCCCCTGTTCCCCCGCATCCTC^ 

ATGACGCAACTGGGAGGGCTGCGCTGTGATAGGTGGTCCCTCGGGGGCGTG^^ 

GAAGTCGGGGTCACCTCAGCGAGGTTCCGGCGGCCACCCCTCTCCCCTCCCCCATGAAAGGCAGGCTGGAAGCCAGAAAAATCCCAGTGACTGGAAG 
GGACAGATTAATCCTCCGGAACCAAACTCTTTGAGACCTGGGGAGGAGGGTGGGAGGCACTGGGAAGGGGGATTGGGGGGAGCCTGGCTCATTTCCA 
CCCATGGAGCTGACCTAGAATGACCCTAAGGTCCCCGGAGCCACCAGCTGATTCCGAATTCCATTCATTCTGCAAAATTTGGCGCCTACCACGTGGC 
CGAGATTCCCGAGGCTGCAGCCAATGAGCAGCATAGTCCGATGGGGGAGGCTGGAAGGGCCGTTCACGAGATCTCTCGGGAGGAATCGGTAGGAGCT 
GCCCAACTGAAAGGGACGGGGGAATAACCCGGGGCGAGGGACCTGCCCAGGCCCTGCCCTCTGGGGGGAAGCCAGGCCAGGGGCCGGGGGCATCTCT 
GGGGTCCCAGGTGAGATGCGGGTCGAGGGAGCCAGGCCACGGTTCTCCAGACTCGCGTGGAGGCCACGAGCGTCTTCTGGAGGAGCGGGCACTGCGC 
L..l GGACCGGCAGACTCTGGGGCGCTCGTGCTCCCCATCTCCTGACCTTTTCCTACCTTCAGTTCCCTCCTGTCAGGACAGGTTCTGGGGCTCGCCGGGG 
lZ CAGGGAGCCAGGGCAGAGGGTGCAGTTTCCGCTCCTGCCTGAGTCACCTTCGCTTTCTCAGCGArrCTrCATTCACAAGGGCGTGGCCCATCACA 
fca? CACTCACACACTTGTGAACTTGCACGCACACACACATATGGACCTGTGCAAAGAGCCCCATGCACAGGCAGGTACACGTGCATGTAGGCGTGCACAC 
Q AGACACGCACAGATGCAGGCAGACCAACTCCCAGGAATCTGTCTCCAGCCTACAGGGATTCAGCACTCAGGGTACCAGCCTCTGTCCCGTGCACCCC 
5*5 CATTCCCGGCAGTTCTCCCAGGTGGGTCGGGGGCCACTAGGGACCCCATACCCCGAGGGTGGCTCCTCCATTAAGCCTGCCATGGGGTTGGGAGTGA 
GAGTCCCACCTCCCACTTGAGAGTCCCCAGTGAGTCAAGGCAAGTGCAGCCAGGGCTCCAAGATCCTrGTTGGGATTCACTGTGATCCCCAAAGATC 
JZ ACGACTTAAGGGAAAAC ATT C AATT AAACTTGTT AGAGCAAAGAA AGG AGAG ACTGGGCTGG GAG CAG TGG CT AACGCCTATAT AGTTCAGCT ACTC 
Cx AGGAGGCCCAGGTGTGAGGATTGCTTGAGTCCAGGAGGCTGAGGCTGCAGTGAGCTATGATGGCACCACCCACTGCACTCCAGCCTGGGTGACAAAG 

~ CAAGACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAGGAAAAAGAGGCCGGGCACAGTGGCTCACACCTG 
H TTAGCCAGACATGGTGGTGGGTGCCTGTAATCTCAGCTACTTGGGAGGCTGGGGCAGGAGAATCGCTTGAACCCAGGAGGCGGAGGTTGTGTGAGCC 
| . i GAGATTACACCACTGCACTC CAGCCTGGGCGACAGAGTGAGACTCTGTCTCCAAATAATAATAATAATAATAATAAAATAAAAAGAAAAAAGAAAGA 
AAGAAAGGAGAGACTGAGCTATATTGCCCAGTCCAGGCATGTGGGCAGTCTGGGGGACAGTGGGGCATACTCGGTGTCTCAGGGTCCCAGCCCTGGG 
* TGCCCCCACTGCTGTCTGCTGTGACCTCCTCAGGAGGCCTCATTGGGAGGGACCCTGAGTCCTGCGGGGGGCTGTGGAGCTGTCACCAGCCCCAGGA 
I=rl : GGACTCAGGCAAGTCCCTCTGGTGGTCGAGGCCTCTGAGCTTCCCAGTGTGTCTCCACGGGACACCCCCCCTCCAGACCTTCTGCCCCAGCTCAGCC 
^ TCTTCCAATTCATTATCAGGGGGTGGGGGAGGGGACAGGAAGCCCCAGACCCAGCTGGGCCCCATTTCCTCCAGGGCCACAGTGGCCTCTCCCAGAG 

fti ACTCTGTCGCCCAGGCTGGAGCGCAGTGGCGTGATCTCGGCTCACTGCAATCTCCACCTCCCACGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAG 
I p TAGCTGGGACTACAGGTACGTGCCACCACACCTAACTAATTTTTGTATTTTTTAGTAGAGATGGGGCTTCACCATATTAGCCTCCCAAAGTGCTGGG 
W ATTACAGGCATGAGCCACCATGCCCGGCCTTGACTTCACTCTTTTAAAAAAGTCAAATACTTGGTGCTTGAAATCCCAGCACTTTGGGAGGCTGAGG 
13 CAGGAGAACTGCTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAAC^TGGTGAGACTCCTGTCTCTGTAAAAACTACGAAAATTAGTCAGGCATGGT 
* . GGTGCGTGTCTGTCGTCCCAGCTGCTCAGGAGGCTGAGGTGGGAGGATGGTTTGATCCTGGAAGGTTGAGACTGCAGTGAGCTGTGATTCCACCACT 
GCAGTCCAGCCTGGGCCGCAGAGCAAGACTCTCTCTCAAAAATAAATAAAATAAAATAAAAAGTCTAAAACTTTTCTTTCAAACGACGGCTTTTGGG 
GCTCATAAGGTGCGTGAAGATGGAAAGGGGGCCCGGCAGTGCCCATTTCGGCTCCCTTGGGACCCCGGCTTCCCTGAAGACCCCTGAGCAATGGTTG 
GCTGGGTGCCTGGTCTCCCTGAACGGGCTGTGGGCAACAAGGTTGTCTGGATGGGGGAGGGGCTCTGAGACCCTTGGGGGCAGC^GAGGGAGAAGGG 
AACAGATGGGGTCTCCCAGGCAGAGAGGCCTGGAGGGGTGGGGCGGGGCAGGGCATTTGGCGTGA^ 

ACCGGGAGGAGGGGCCTCTGTGATGTGCAAATAATTAACACAGAAGCCTTCACTGTGCTCCCCTCCTCATCCCAGGACCCAGGTCTGACCCTCAGGA 
GGGGAGGAGGTGAG C AAGGGGGG AAC CC ACCTTCC AGGGA AC C CC C AG ATAGTGCCAGGCACCAGG CAAGTGC T CTAAAAGGCTTT ACTAATATTAT 
TTGAAGAAACTAAGCTAATAAATAAATACGGTATAAAATCAAAGGGTACCCACAGATATAAACAAAAATCCACAGCTAACCCGGCCCCAGCCCTCTT 



TGTCGCCCATGCAGCGCAGTAGCGTGATCTCGGCTCACTGCAACTTCCACATCCCGGGTTAAAACGATTCTCCTGCCTCAACCGCCTGAGTAGCTGG 

GACTACAGGGGCGCACCACCATGCCCGGCTAATTTTTTTTTTTTTTTTTTTTTTTTTTGAGCTGGAGTCTTGCTCTGTCGCCCAGGCGGGAGTGC 

TGGCGCAATCTCGGCTCACTTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCTGAGTGGGTGGGACTACAGGCTCCAGCCA 

TTTAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCATGATCGTGACTCA 

CCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCACCACCAC^CCCGGCTATTTTTTTGTATTTTTAGTAGAGACAGGGGTTTCACCATGTTAGC 

CAGGATGGTCTCGATCTCCTGACCTTGTGATCCTCCTGCCTCGGCCTCCCGAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCTCTTGTT 

TTTTTTTTTTT TTT TTTTTAT AG ACTTCTAAGGAAAG C AACACTAAGAAAATAT ATTTTTAAGAAATTATT CAGCCAGGCGTGGTGGCTC ACG CCT G 

TAATCCCAACACTTTGGGAGGCTGAGGCGGGTGGATCACTTGAGGCCAGGAGTTTGAGACCAGCCTGGCTCACATGGTGAAACCCTGACTCTACTAA 

AAATACAAAAAATTAGCCGGGCACAGTGGCATGCTCCTGCAATGCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCCCTTGAACTGGGGAGGCAGAG 

GTTTCAGTGAAC CAAGATCG AGC C ACTGCACTCC AGCGTGGG TGA C AGACTG AAACTGTG TCTCAAAAAAAAATTTTTGTTTTTGG AGATGGG GTCT 

TGCTCTGTCACCCAGGCTGGAGTACAATGGGGTGATCTAGGCTCACTGCAACCT 

GTAACTGGCACTACAGGTGCATGTCAGAACACCTGGCTAGTTTTTAACTTTTT^^ 

TCACCTTCTGGGCTCAGTGATCCTCCTGCTTCAGCCTCCCAAAATGCTGGGATC^ 

GTTTATTTCTCACATTTCTTCTAGAACAAACATTGCCAGCGCATCCCAGACTCTGAACTGTGGCTGGGGACTGCAGCATGGAGGTCTTTGCAGGCAC 
T CTGGG AAACATGCC AC AAG C AGGGT ATTAACCC AAG ACAGT CAGTCCCTCCAATGAGAAAAAGG AGCCGAGGGAC TGTGCATGCCTC AG AGC CAG A 
TGCAGCAGAACTGGTGGGTTGAGGGGGCACTGAGCGCCTGGTGCCTCCTGCTGCCCCCCACCAGCTCCGACAGTCCTCGTCCTTGTCCTCCCCATCA 
GAAGGGAAGAGTGAATTGTCTGTGGGATGTGCCAGTGTCCCCGGTTCTGAGGTTGGGCAGGCTGGGGAGCCCCAGACATGGCTTGCTGCCCACTGTG 
GAGCCTCCTGGACTTAGTTCCAGGCTCrGCCCGATCCTCTGAGGGTGTGGCTTCAAAGCCCTGCTTGGGCCTAAGGCCATCCCAGGGCAGTGACTGA 
GGGTAGACTGAGATGTCCCTTGGGCACCAGTCCTAGGGTACCTGGGCTGGGGGTCTTCAGCTCAGGCCCCCTAGTGCCCGTCCTTTGGGCTGTGGGC 
AGGTTTCCACAGCACTGGCCCTTTCTGACCTTACACCCCTGTGCATCCTCACTGGGGC^ 
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TTTTTTTTCCTTTTTTTTTTTTTTTTTTT 
TTCCGAGACAGAGTCTTGOTCTGTCACCTAGGCTGGAGTGCAGTGGCATGATCTC<3GCTCACTG<^ACCTCCTCCTCCTCCTGGGTTTAAGCAATTC 
TCTTGCCTCAGCCTCCTGAGTAGCTGGCATTACAGGTGCCCGCCACCACGCCCGGCTATTTTTGTAGTTTTAGTAGAGACGGGGTTTCACCATGTTG 
GCCAGGCTGGTCTCAAACTCCTGACCTTGTGCTCTGCCCGCCTTGGCCTC^ 

CAT TTT CTT'ACCAC AGCTTT 1 CC AGAAAT AAAGTTAGG ATGGAAAGAGAGAGAG AGAGATTGAGAG AGAGAGA 

GAGAGTTCTCAGCAAGAAGAGAACTCAGCTCAGAGTTGAGAATGTCTCAACTCAGCTGTCTCCCTGCTGAGACGTCCTGGGCTGAAGTGTCCCCTCA 
CCCCTGCGGGG CTGTGACTGCACACTGCAC^CATGGGTCACAGCAGGGCTTGGCGTTTGGAGCAGGCCGGGTTGGTGGCTTGTCCTCATCTCTTGCC 
TGGAGCTGTGCTGGCCTGTGGTTTGCTTTGGCTGCTACCATGTGGAGGGAGAGGTGCTTGTGTCCATGCCTGGCCTCT^ 

TGG AGAGAGGAAGG CCACCA CTGTGATGG AGAGAAGC CGGGGCCAG AC CGATGAGGG CTGAG AGCCCATGGGCAGGGGTTGTGGTCTCGGCCAAAC C 
CCAGCTGGCAGCTGATGGCAGCCACCCCGGGGTCCTCAGATATGGCACAGCT^ 

CTCAGGGATGTTGTGAGTAGTGGGCTGAGCTGTGGTTTATTTTTTAGAGACAGGGTCTTGCTCTGTCACCCAGGCTAGAGTCCAGTGGTGTAATCAT 
AGCTCACTGCAGCCTTGACCCCCCAGGCTCAAGTGATCCTTGTGTCTTGGCCTCCTGAGTAGCTAGGACTATAGGTGCATACCACCATGCCTCGCTT 
GTTTTTTGTTTTGATAGAGATGGGGGTCTTGCTAGGTTGCCCAGGCTGGTCTGTAACTCCTGGGCTCAAGTGATCCTCCTGCTTTGGCCTCCCAAAG 
TGCTGGGATTACAGGCGTGAGCCACTGTGCTGGCC1TGGGGTGTGGTTTTATGTGGCCATAGATGACTGAGATGGGTGAGAAGCGGGACCTAGGGGA 
ATATACAGGGCAGGGGGATCCGACAGGGGGACAGGAGGGCAGACAGCTCCTGGAAGGCCTGGGAAAGAGATATCAGGGCTGGACCCTGGAGAAGTGA 
GGCTTTGAGGGAGGGGAGGAGATGGGGCCTGGCGGGGTACGGGCGCTGCTCCTAGGTCTGAGGGCATCGGCGGTCTGAGAGCTTGCGGGAGGGAGGG 
CTGCCTCCCGGTGCTCGGTGCCCTGGCTCTCCTCCTTGCTGCTGGCTGGCCTCCTCCTCCTCATGCATCTGTGTGTCCTGGAACACAGGGCACTGTC 
CCTCTGCCTGTCCAGAAAAGCACGTGCTCCTTGGGGCCTGGCCTGCTCAGCCCTATGTCCCACCCGCTCCCTGCTGCTATTCCCAGGCCTTCGGGCC 
TCACTGCCTGGGGTGGCCGTGCTCAGCCCCACCCTGTGGTGCCATGGCCTCCCCACCCCTGCCCCGAACTGCCCCTGGGACCAGTTCCTTCAGCTCC 
CAGGACAGGGCCTGGCCATCCCTGTGGATTCTCCCTGTTCGTGGACCCCCATTGGTGCCCAGTCTCCTTGGACTGTCTTCTGAGCCTCTCTCAGATG 
TACCCCCAACTGTACCCTTTCACCCAGAGGCCATTTGGCAGCGTCCTCCAGGACTCCTGCCGTGGGGGAAGTCTCCACCAAGGCCAACGCGTGACTG 
h& CATCCCGAGGAGCTGATTTAAGATCTGGTTCTCAGGAACACCTGGAACTGGCCAGTCAGCATCCCCGGCCCTGCGGTCTGCACTTGGAC^AGGGTCA 
p| GGCAATGAGGATGAGC^GOVGGCGGGGCTTCTCCTGGAAGCCTTCCCAGCCTCCTGGGCCACTCCAGTGGGTCCCCTCCACAGACTCAGCCTGTTCC 
Zi CTGCCACCTGGTGCCGGTTTCTCTGTGGCACCTACTAGGGGCCGAGTGCCCCGGGACTGCCCAGCTGAGCAGCGCCGTCCAGGTCTGCACCCAGCTA 
W CACTCAGGGCACGTGCACAGGTGAGCAGCCGGGCACGCAGGCACCTGCCAGGTGTGGGTGTGGACTAGGTGCTCCCGG 

"5 HUMAN SEQUENCE - mRNA 

= : p GCGGAGGGTGCGTGCGGGCCGCGGCAGCCGAACAAAGGAGCAGGGGCGCCGCCGCAGGGACCCGCCACCCACCTCCCGGGGCCGCGCAGCGGCCTCT 
M= CGTCTACTGCCACCATGACCGCCAACGGCACAGCCGAGGCGGTGCAGATCCAGTTCGGCCTCATCAACTGCGGCAACAAGTACCTGACGGCCGAGGC 
GTTCGGGTTCAAGGTGAACGCGTCCGCCAGCAGCCTGAAGAAGAAGCAGATCTGGACGCTGGAGCAGCCCCCTGACGAGGCGGGCAGCGCGGCCGTG 
; " TGCCTGCGCAGCCACCTGGGCCGCTACCTGGCGGCGGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGGTCCCGACTGCCGTTTCCTCA 
|lj TCGTGGCGCACGACGACGGTCGCTGGTCGCTGCAGTCCGAGGCGCACCGGCGCTACTTCGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGAC 
GGTGTCCCCCGCCGAGAAGTGGAGCGTGCACATCGCCATGCACCCTCAGGTCAACATCTACAGTGTCACCCGTAAGCGCTACGCGCACCTGAGCGCG 
CGGCCGGCCGACGAGATCGCCGTGGACCGCGACGTGCCCTGGGGCGTCGACTCGCTCATCACCCTCGCCTTCCAGGACCAGCGCTACAGCGTGCAGA 
H CCGCCGACCACCGCTTCCTGCGCCACGACGGGCGCCTGGTGGCGCGCCCCGAGCCGGCCACTGGCTACACGCTGGAGTTCCGCTCCGGCAAGGTGGC 
p| CTTCCGCGACTGCGAGGGCCGTTACCTGGCGCCGTCGGGGCCCAGCGGCACGCTCAAGGCGGGCAAGGCCACC71AGGTGGGCAAGGACGAGCTCTTT 
GCTCTGGAGCAGAGCTGCGCCCAGGTCGTGCTGCAGGCGGCCAACGAGAGGAACGTGTCCACGCGCCAGGGTATGGACCTGTCTGCCAATCAGGACG 
IP AGGAGACCGACCAGGAGACCTTC(^GCTGGAGATCGACCGCGACACCAAAAAGTGTGCCTTCCGTACCCACACGGGCAAGTACTGGACGCTGACGGC 
|ji CACCGGGGGCGTGCAGTCCACCGCCTCCAGCAAGAATGCCAGCTGCTACTTTGACATCGAGTGGCGTGACCGGCGCATCACACTGAGGGCGTCCAAT 
jr GGCAAGTTTGTGACCTCCAAGAAGAATGGGCAGCTGGCCGCCTCGGTGGAGACAGCAGGGGACTCAGAGCTCTTCCTCATGAAGCTCATCAACCGCC 
M CCATCATCGTGTTCCGCGGGGAGCATGGCTTCATCGGCTGCCGCAAGGTCACGGGCACCCTGGACGCCAACCGCTCCAGCTATGACGTCTTCCAGCT 
GGAGTTCAACGATGGCGCCTACAACATCAAAGACTCCACAGGCAAATACTGGACGGTGGGCAGTGACTCC3CGGTCACCAGCAGCGGCGACACTCCT 
GTGGACTTCTTCTTCGAGTTCTGCGACTATAACAAGGTGGCCATCAAGGTGGGCGGG 

CGGCGGAAACCGTGGACCCCGCCTCGCTCTGGGAGTACTAGGGCCGGCCCGTCCTTCCCCGCCCCTGCCCACATGGCGGCTCCTGCCAACCCTCCCT 
GCTAACCCCTTCTCCGCCAGGTGGGCTCCAGGGCGGGAGGCAAGCCCCCTTGCCTTTCAAACTGGAAACCCCAGAGAAAACGGTGCCCCCACCTGTC 
GCCCCTATGGACTCCCCACTCTCCCCTCCGCCCGGGTTCCCTACTCCCCTCGGGTCAGCGGCTGCGGCCTGGCCCTGGGAGGGATTTCAGATGCCCC 
TGCCCTCTTGTCTGCCACGGGGCGAGTCTGGCACCTCTTTCTTCTGACCTCAGACGGCTCTGAGCCTTATTTCTCTGGAAGCGGCTAAGGGACGGTT 
GGGGGCTGGGAGCCCTGGGCGTGTAGTGTAACTGGAATCTTTTGCCTCTCCCAGCCACCTCCTCCCAGCCCCCCAGGAGAGCTGGGCACATGTCCCA 

TTGGGTGGTGGCTGGAAACAGCCCCTCTCCCACGTGGGAGAGGCTCAGCCTGGCTCCCTTCCCTGGAGCGGCAGGGCGTGACGGCCACAGGGTCTGC 

CCGCTGCACGTTCTGCCAAGGTGGTGGTGGCGGGCGGGTAGGGGTGTGGGGGCCGTCTTCCTCCTGTCTCTTTCCTTTCACCCTAGCCTGACTGGAA 

GCAG AAAATGACCAAATCAGTATTTTTTTTAATGAAAT AT TATTGCTGGAGGCGT CC CAGGCAAG CCTGGCTGTAGTAGCGAGTG ATCTGGCGGGGG 

GCGTCTCAGCACCCTCCCCAGGGGGTGCATCTCAGCCCCCTCTTTCCGTCCTTCCCX3TCCAGCCCCAGCCCTGGGCCTGGGCTGCCGACACCTGGGC 

CAGAGCCCCTGCTGTGATTGGTGCTCCCTGGGCCTCCCGGGTGGATGAAGCCAGGCGTCGCCCCCTCCGGGAGCCCTGGGGTGAGCCGCCGGGGCCC 

CCCTGCTGCCAGCCTCCCCCGTCCCCAACATGCATCTCACTCTGGGTGTCT^ 

CATGAT AGTAG CTT CAAACTGGAAAT AG CGAAAT AAAATAACTCAGTCTGC 

HUMAN SEQUENCE - CODING 

ATGACCGCCAACGGCACAGCCGAGGCGGTGCAGATCCAGTTCGGCCTCATCAACTGCGGCAACAAGTACCTGACGGCCGAGGCGTTCGGGTTCAAGG 

TGAACG CGTCCG CCAGC AGCCTGAAGAAGAAGCAGAT CTGGACGCTGG AGC AGC C CC CTGACG AGGCGGGCAGCGCGGCCGTGTGC CTGCGCAGCC A 

CCTGGGCCGCTACCTGGCGGCGGACAAGGACGGCAACGTGACCTGCGAGCGCGAGGTGCCCGGTCCCGACTGCCGTTTCCTCATCGTGGCGCACGAC 

GACGGTCGCTGGTCGCTGCAGTCCGAGGCGCACCGGCGCTACTTCGGCGGCACCGAGGACCGCCTGTCCTGCTTCGCGCAGACGGTGTCCCCCGCCG 

AGAAGTGGAGCGTGCACATCGCCATG(^CCCTCAGGTCAA(^TCTACAGTGTCACCCGTAAGCGCTACGCGCACCTGAGCGCGCGGCCGGCCGACGA 

GATCGCCGTGGACCGCGACGTGCCCTGGGGCGTCGACTCGCTCATCACCCTCGCCTTCCAGGACCAGCGCTACAGCGTGCAGACCGCCGACCACCGC 

TTCCTGCGCCACGACGGGCGCCTGGTGGCGCGCCCCX3AGCCGGCCACT 

AGGGCCGTTACCTGGCGCCGTCGGGGCCCAGCGGCACGCTCAAGGCGGGCAAGGCCAC^ 

CTGCGCCCAGGTCGTGCTGCAGGCGGCCAACGAGAGGAACGTGTCCAOSCGCCAGGGTATGGACCTGTCTGCCAATCAGGACGAGGAGACCGACCAG 
GAGACCTTCCAGCTGGAGATCGACCGCGACACCAAAAAGTGTGCCTTCCGTACCCACACGGGCAAGTACTGGACGCTGACGGCCACCGGGGGCGTGC 
AGTCCACCGCCTCCAGCAAGAATGCCAGCTGCTACTTTGACATCGAGTGGCGTGACCGGCGCATCACACTGAGGGCGTCCAATGGCAAGTTTGTGAC 
CTCCAAGAAGAATGGGCAGCTGGCCGCCTCGGTGGAGACAGCAGGGGACTCAGAGCTCTTCCTCATGAAGCTCATCAACCGCCCCATCATCGTGTTC 
CGCGGGGAGCATGGCTTCATCGGCTGCCGCAAGGTCACGGGCACCCTGGACGCCAACCGCTCCAGCTATGACGTCTTCCAGCTGGAGTTCAACGATG 
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GCGCCTACAACATCAAAGACTCCACAGGCAAATACTGGACGGTGGGCAGTGACTCCGCGGTCACCAGCAGCGGCGACACTCCTGTGGACTTCTTCTT 
CGAGTTCTGCGACTATAACAAGGTGGCCATCAAGGTGGGCGGGCGCTACCTGAAGGGCGACCACGCAGGCGTCCTGAAGGCCTCGGCGGAAACCGTG 
GACCCCGCCTCGCTCTGGGAGTACTAG 




TABLE II 



MOUSE NOMENCLATURE 
ICSGNM Map3k6 
Celera mCG20807 

HUMAN NOMENCLATURE 
HGNC MAP3K6 
Celera hCG19300 

MOUSE SEQUENCE - GENOMIC 

GGGCTGGACCTTCACAGAGAGGCAGAGCTACACAAGGCAATACC^TTCTCTGGGGCCTGAGGGAGGAAATGCCCCAAACCCTGCTGGCTCCAGTTG 

AGGTGGTGTGCCTCTGGGAGA(^TGGGGAGAAGTCTTGCTTCAAGTTATTTCCCCCACAGCCTGATCCATCTATGCTGGCACCCCCCTCTAGTAGA 

CCCTACAAGATGGGTCTACTATGAGCCCTACTGTTCTAGCTGCTCCTATTTAAAGGGCTGCCTGCCTGAGGGTACAGGGTCACCCCAGCTGCCCATG 

TAGGGTCCTACAAATGCTCACCACATCAACCACTCATTCTCTCCTCATCTCAAGAAGCAGGGTGAGAGAGTTTGGTAATCTGGCCTTTTCATGTAAA 

TTT ATGT AA AT AGG ATATAAATATC ACAACATATGTAAATAGGAT ATAAAT ACC ACAGAACC TCAGCAT CT CAGGAGGT CCTGAT ATCTAGAATCT A 

GGAGGTAGAAGGAATGGCAGAAAAGTCTGGAGGAGATATGTTCGCCCTGTGCCCGTTGGCCCTGTGTTTCCATGTAGAACCCAGGGAGCTGGAGCTG 

GAGCTGGAAGCCAGCAAAGTGGTCCCCCTGCCCAGTTATTCAGGTGTCCCAAGAAGTCCTAGAGAGAAGAGATCTGTAGGTTCTCAAGGTAAGGGAT 

GTC AGGGAC CT GGC AGAG AGGGCTC TCG AGGT AG CCTGTAGAGGCTGG ATGTGG CTGCTTGGGTAAGAATGGCTCT CAT AGG CT C ATATATTT AAAT 

GCTTAGTTATTAAGGAGTGGTACTATTTAAGAAGGATTAAGAGGTGTGGCCTTGTTAGAGGAGGTGTGTCACTGGGGGTGGGCTCTGAAGTTTCAAA 

AACCC^ACTCTGTCTTCCCTCTGCATGCCTTTGTGTGTATCTTTCTGTCTTAGGATCAGGATGTAGCCGGAAACTACTTCTCCAGCAGCATCCCTGC 

CTCTGTGACCTCGTGCTCCCCACTATGGTGATTGTGGACTAAACCTCTGAAACTGTAAACAATCCCCCAGTTAAATGATTCTCTACAGGAGTTGCTG 

CCTTCGTGCCCAGCACTCAGGAGGCAGAGGCAGGTGGATCTCTGTGGGGTCAAGGCCAGCCTGGTCTACAGAGTGAGTTCTGGGACAGCCAGGGATA 

TACAGAAAAGCCTGTCTCAACAAACAAACAAACAAATAGGAAAGGTAGTC^TGGTCATTGTGrCCCCTCCCAGCAATAGAACACTGACTAAGACACT 

GGGGTGCAGGTGCTGAGTGTGGACAATGACGACGTGGGAACCACCTCTGCACCAGGCTCC^GTGGGAGAGTGCCAGGATCCTGGGGTGTACCAGGT 

AGATTGCATTGTTCATGTTTTACTGCAGAGGACTGTGAGGCTCAGAAAGGAACAATTGCTTGTCCCCAATACAGAGCCAGAAAAGGATGCTAATAGG 

TGAGAACACCAAATTCTGTTTCCTCCTCTTCCTCTTCTTCCTTCTCCTCCTCCTCTTCCTCTTCTTCCTTCTTCTCCTCCTCTTCCTTTTGTCTTGA 

GCACTGGGCAGGACACAGGACATCTGCCTCTGGAATTCCAGGGCTACCTTGGCTCCCTAGGCAAGATGGATCCCAAGGGAGAGTCTACCCTCTGACT 

TTACTTTTTGCACAGACTAGGGCTGGGCCCTCTCAGCTACTTAGGGCTGTAAGTGCCCTCGGAGGAGCCAGCATTGGAAGGAGTAGTAAGAATTTGG 

AGGAGCGGGGGCTGGAGAGGTGGCCCAGTGGTTAAGAGCACTGGCTGCTTTTCCAGAGGTCCTGAGTTCAATTCCCAGCAACCACATGGTGGCTCAC 

AGCCATCTGTATTGAGATCTGACGCCCTCTTCTGGTGTGTCTGAAGACAGCTAGAGTGTGCTCATATACATAAAATAAATAAATCTTAAAAATAATA 

ATTTGGAGAAGGGCTGGCTCTGCCCCATGCTGGCTCATGGGGCCTCCCTTAGGGTACTCAGACCTCAGAGTCAGTTGACACCTGGTCCTTTATCTCT 

GGGCAGGACATCAGCCACCTGCTCTGCCACTAAGAGGCTAAGGAGGCGTCCACGGGAGTGAGGAACTTGGAATCAGGCCAGACGGGGGTGGTGGAAG 

TTTCCAGGCATGTAAGCCTCTCCTCGTAGCTCTATAGTGCGCACCTCTTCCCCAGGTCCCAGGATCTGAGAGGACTGCTCGGTCCCTTACTTGTTCC 

AGAATCTGGGTGGAGGCTTTTGGGAAACAACTTGGGGGCCTTTCTCAATCCACCACCCCCCCCCCTTTCCCAGGTCTCAAGAACTTTTCCAGTCTTT 

CCCATTTTTTGGACACTCCAGGGGGTAAGAAAAAAAAGAGGCAACAGGCAAAGCCCGGTGGTGACTTCTCTCTAACTCCCTAGGGGGAGCCAGAGTC 

CACTGAAGCCAGGCATACCTGACAGGACAGCTCCTACAGGGCCAGAAGTCTGAGAAGGCTTCATGGAGGAGGTGTTATTTGGAGGAAAGTAAAGGCT 

CTGTGAACATTCTCTAGGTCTGTGAGTGCAGAGTACTTTGCGGGTAGAAATAAAGGCGAGCAGGAGGAGGTGGGAGAGAGGAAAGGCTAAGGAAAGT 

AAGCTGGGGGTCGTGGGAGAGCCAAACATGAACCCAGAATCTTTGCAAAGGCCACTTCATAGAGGACATTGCATTACTGACATGGAGGGCAAAGGGG 

AGCC ATGGAAG AAT TTT AGGC AG AAAAGGGTCTGGGTCAG ACTTAG TTTTAGAAAGA TAAGT CAAGAGGCT GG AGAGATGGCTCAGCGGTTAAGAG C 

ACTGGCTACTCTTCCAGAGGTCCTGAGTTCAATTCCCAGCAACCCCACATGGGGCGACTCACAACCATCTGTAATGTGATCCAGTGCCCTCTTCTGG 

TGTGTCTGAAGACAGCTACAGTGTACTCATATACATAAAAAAATAAGAGAAAGATAAGTGAGGCGTGGAAGTGCTTGCCTGTAACCCTAACACTTGG 

GAGGCCGGGGGCAGAGGTATTGAAATTAGGGGCCTGGAGTGGGCCCAGGGAGGCCCTGTCTCAATAACAAGCGACATGGGGTTGGAGAGTGACTTTA 

TGGTTAAAAGCAAGCATTGACTGCGGAGCGCCCCCACTGGATAGCTCATAACTGCCCCTACCTCCAGCTGCAGCAGATCTGACGTGCTTGTCTAGAG 

TCCTCGAGCATCTGCGCACTGATGCACACACATTCTCTCTGTTAGACACATGACACCTATACTCACACGTGCATACACACACACACACACTCTACTC 

TCTCCCTCTCTCACCTATTTTTAATCTAAAAACATAACAAAACAATAACAGGCTGCACCCTAGAAGAGACACTGGATGGGGCAAGGCAGAAGGTGGA 

GACAGAGGTTAGGAATGGACAAGGCAGAAGACCAGAAGACCGTGGGAAGTGACATCCACAGTATTGGACATATGTCATCTGAAAGATACTGGAAGAG 

CAGCAAGCACCACCCCCACCCCACCCCACCCCCCTAAGATTTCCCTGCATAGCGATGGAACTTGCTCTGTAGACCAGGCTGGCCTCCAACTCCCAGA 

GATCCCATCTCCCGAGTGCTGGGATTGGGGAGCTTGCCACACCACTGGGCTGTCCTAAGAGCAGAGCTCCAGAGCTGGAGGACGACTTGGCTTGGGA 

GAGGGAGAAAAGGCCACGGGAAAAGGCTAGGAATCCAGCTGAGGCCTCTGGGGGCAACATCTATGCTCTAAATATCTTGGTGGTTCTTCTTCTTCTT 

CTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTCCTTCTTTTCCTTCTTTTCCTTCTTT 

TCCTTCTTTTCCTTCTTCTCCTTCTTTTCCTTCTTCTCCTTCTTCTCCTTCTTCTCCTTCTTCTCCTTCTCCTCCTTCTCCTCCTTCTCCTCCTTCT 

CCTTCTTCTCCTTCTTCTTCTCCTTCTTCTCCTTCTTCTCCCTCTTCTCCTTCTTCTCCTTCTCCTTCTTCTCTCCTCCTCCTCCTCCTCCTTCTGT 

AAGTATATATATATATATATATATATATATATACCATAGCTGTACAGATGTTTGTGAGCCTTCGTGTGGTTGATGGGAGTTGGACCTCTGCTTGCTC 

CAGTCAACCCTACTCACTCAGGTCATCAGCTCATCTCTGCTCACCCAGTCCCTGCTTGCTCCAGTCCAAAGATTTATTTATTATAATACATAAGTAC 

ACTGTAGCTGTCTTCAGACGCCAGAAGAGGGCATCAGATCTCACTACGGGTGGTTGTAAGCCACCATGTGGTTGCTGGGATTTGAACTTGAACTCAG 

GACCTTCAGAAGAGCAGTCAGTGCTCTTACCCACTGAGCCATTTCGCCAGTCCCTAATACCTGGGTTCTTAACCTGCAAATTAAATGGGTAACTGGC 

GACAGTATATATATATATATATTCATTCCAATCTTCCCTGGAGCTTACGAGATTTTTGTCATGAGTTGAGTGTCAACAAAAGTTAATGAAGAAACTG 

GATTTAT CATT C ATT ATT CATAGGCT AAACTC ACTCAGAG GAGTC ACATTTGAG COG TAAGCCCCCTC CCCACAATGCAAAGAGAAGGGAAGGGTCC 

AGCCAGCCCTCACACCCATCCAGCCCCTCGCTGCACTTCCCTGGCTTCCCCTGCCCCAGGCCAGTTTGGATTGGCAGCAGGTGCAGGGGTGGATTTC 

TTCCACTCCTGACTGCTACAGAGC^GTGGGGAGACAGGAGTCTGCGTTCTGGCATGGGAATGAAAATTTACAGGAGCTTCCTCGGTAGGGTGAATTC 

CAAGTGAGAGGGCTGGGCCAGCCCAAACCCCTACCATCTCAGTCTCCTCTTCCTCCCAGGTATCTGGGAAGGTCCTGTAGAGTCTGAAGACTTCAGC 

AGT AGCTGC AT C TTTGTTTATTATG CTACCTT CTGCCTC CTC AGGG AGGAGGAGCGCTGG AAGAGTAC ATGCTGGG TAAATTCTCAG AGGGCGCTG C 

AGGTGAGTGGCCTACAGGGGGTCCTGGTGTAACrTTAATTTTTTATTTTTTTAAATCTTATTTTTAATGATTTCTTTTAATGTTT 

TTGTAACCCACCAGAGGTAGTGGAAAAGAAAGAAAACGGGAGAAATGGACCTATTTAGAAAGGTTTTTTGGAACAACAGATGTCTGT 

AATCAGCAGTTCAGTTCACAGGTTAGCAGGCATCGATGGCTTGATCCACTAGCAAACACCTCACAGACACACCAGCAGGCCAGTTCAGTACAGTCGG 

GTTAGCAACGGTGGTGACACGACCTAGTAGAGACAGCCAGGCCTCAGCCTCAGCTTCAGTCAGCAAGAACGACACTGGGAGAAGTTCTCAACTGTGC 

CTCTCTCAAGGAAAGCCACGATCAGCAAAGACTCAAGACCCACAAGCGGTGTGTAGCTAGCTCTATAC^ 

TGGCGTCCTATTTATACCCTCCAAACATCACGTGTCCTCCATATGCCTTGTCTTGCCTCAGCACAGGCATCCAATCAGCCCCAGTCCTTAGAAGCAA 
CAAACA AACTG C AGC ACACC ACA AG AGTT TTGGTGTGTTT CT C TTTATGGAGTT CCG ACAAAGGC AGTT CAAC TAC ACAGTAT AAGGC AG ACC AAT A 
CATGCACGTTGTTAGCAAAAAAATCCTTCATCACGTGACTTTTCACATTCTTGCTTTAGCAGAACATCCTCTCTCCTGTGTCTGCTTCAGCGAAACG 
TTCCTTCCTGAGTCTGCCTTAGTTAGCCTTTCACACCTGTGTCCACTTTAACAAACTGTTCCTTCACGTGTTTGCCGAGCAAAACACCATCCGACCG 
ACCTTTCAAAGAACCCTTAAGTTTCCACTTCACCCGGGGATTGGGGGGGGGGCAGTGGCAGAGGAACAGGAAGGCTGGCTATTCTCTATATTGGCCC 
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AACATTCATGAATGAGCTCAGCT AACTT CTT CTCTTCTGTGG GCCTCAGTTT CC T CATCTGAGAAATGGCAGT CTTGAG ACAGGGTGGGGCCCGAGG 
AAAAGGACTCGGTTCTGGGGAGGAACAGCTTCAGAGTGCAGAGTTACTGTGTATTTACTGTGGGT^ 

AAATTGATGGGGGTGGTGCACGCAGTGTAGGAGTGGGGGGCACACAGTGCAGGGGCTGGGGGCACACAGTGCAGGGGTGGGGGAATGCAGTGCAGGG 
GTGGGGGAATGCAGTGCAGTGCTGGGGGGCATGCAGTGCAGTGCTGGGGGGCATGCAGTGCAGGGGNNN^ 



NIWNNNNNNNNNNNNNNNNNNNNNNNNNtt^ 

GCTGGGGGC ATG CAGTGCAGGGG TGGGGCTGTTATCTGCTGCTTTGGTGTGTGTGCACCAGTGTGGTC TGC TC CAGGAGAGGAAC CGG CAGACTTCT 
TTCAGAGAAGCAGTGTTAGG AGATAACTGAAAGGGGAAGTGGTTATGGCGGGGT GCC TCT AAGGAG CCCCCTTTCCTGCC C AGGC CCAAGAGGTTCC 
TCTCAGC TCTCCCTGGCAATGTTGC T CAGGCTGCTTG CTGGG ACTGAGAGT AGAC AAAGGGACTGGAGAGAAG CTGGGAACT CTGGAGGGGATAAG A 
CTTGGGT CAAGGGGGGAGGGT AT AGGGGTCT TTC GGG ATAGC ATT TGAAATGTAAATGAAGAAAAT ATCTAAT AAAAAAAAAAGACrTGCTGGGCGT 
GGTGGCGCACACCTTTAATCCCAGCACTCGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGCC^ 

CCAGGGATATAC AGAGAAACCCCTG TCTTGAAAAAACAAAACAAAACAAAACAAAAC AAAAG ACTTGGGTC AAACT GAT GGCTTGGAA CAAAAAAGT 
CCCACTTCC ACTGTGACTAG C TGAATGACCAT AG ACAAAC AAACC AAC AAAAATTGG CTCCTTGGT GCTGTTGG CT CTC ATGGGGCCTGATAACATT 
CACCTCATTACTGATTGGAGGCTGCCTTGGGGTAATATCTGTAAGGGCTTCAGGCATCCTGTAGGGTGCCAAGGTGTCTTCCACTCCTTGCTGCAGG 



GGGTGTCAG ATCTCATTATGGGTGG TTG TGAG TCACCATGTGGTTGCTGGG ATT TGAACT CAGGAC CTTC AGGAG AGCAGTCAGTGCCC TT AACTGC 
TGATCCACCTCTCCAGCCCAGTGGAAAGACTTCTACCACTGATGGTGCAGAGCCTTATTCAAGATGGGAC^ACTGTGCAGTAACTGTCCCTGGTGCT 
GGGTTGTACAGATCCTGCAACTGAACCAATCTCAAGGGGCATTATGCTGCATCTGAGGCCAC^ 

AAGGCGTGGGTCACCCCTACTGTGGGATTACACATAATGCTTTGCTAGAGCACTCAATTGTCAGTATCGTTGTCACTTCTTACAGCTTCCATCTCTT 
GGCCACCTCACTGCCAACTGCCTCTGCTTCCAGGGCCACTTTGTAAC^ 
N GGGACACAAGCTTCAGTTCCATTCAAGGAAGAACTTCTAATCAGGAAGATGAAGGGTTGAACTTC^ 

fl ATGCTAGGGGGCAGTGAGGTGAGAGATTCAACCACCCACTCGCAATCCTCCTGGGCCCCCGAGCCCAAAGTGGATGCATCTGAGGACAGCTTTCTCT 
CACCCTTTGTTTGCTGTTTGGTTGGTTGGTTGATTTTGTGAGACAGGGTTTCACTCAGCAGGCAGCCCTGGCTGTCCTAGAAACTCTGTAGACCAGG 
TTGGCCTTGAACTCAGAGATTCACATGCCTCTGCCTCCTGAGTGCTGGGCTTAAAGACATGTGCCATCACCACTTGGTTCCCATTTGTTTCTTGTAA 
AGTG TG TTT TCTTTTTTT CTT TTTT TTT C AT TTTTTATT AGG T ATTTAGCTC ATTTACATTT C CAATG C TATACCAAAAGTC CCT GAT AGTGATGAT 
CAGAACATTGTCTCCCTGATTGAAGAGGACACTGGGATTGGGCTCTGCTATAAACCAGACCCCCCCATCCCGAGACCCCCAAATCTCCTGATATTAC 
ATCCTGAGCCTCTATGAGTGTAAAGGAGCTCGTGTCTGATCTTCCACAGACGTGACTTTTTCGCCTTTAAGGGACCTCTTTAATAGGTCCCCCACCC 
TCTGCCAAAGTGTGCTCTCTCCCACCATTANNNNNNNNNN^ 

TTGCGTGTCCAATGGGCCTCTCTTTCCAGTGATGGCCGACTAGGCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCCGGGGTACTGGTTAG 
TTCATAAGGTTGTTCCACCTATAGGGTTGCAGATCCCTTTAGCTCCTTGGATTCTTTCTCTACCTCCTCCATTGGGAGCTAAAGTGTGTTTTCTACC 
CAAACCTCTGTAATGGGCCACATTCTCCCTTTCTGTTGCTCGTCTATCCCATTGGTTCAAACCTAGGAAGATATATTTTTTTCCTTCCATGAGACAA 
GAAGGCTGTGAAACAGAGGCTTCCTACCGTTTGTTGAAAGGTCAATTTGTCCAAAGCCCCAACATATTATTTTATAAAGTTTTAGTTTTATTTCATG 
TGCATCTTTTGCCTGTAATTATATATGTGCACCAAGTATGTGCTGGGCGCCTATGCAGGTCAGGAGAGTGCCAGGAATCCCCTGGGCCTACAGTTAC 
AGATGATGGTGAGCCACCATGTGGGTGCTGGGAACCAAACCAGGGCCCTCTGAAAGAGCAGCAAGTGGTCTTTAC^GCTGTGCTGTCTCTCATGCCC 
AAGCCCT AC CAT AAT AAACAATT ATG AAA C ATGGCC CT AAC CTCAGAGTCTC AAAGT C TCATT TCT AAGTTCAGGGC CCT CTAAGGTTT ATAGCAAC 
TTAGATTCATACTAGGATGTTAAAGCTTCTTGGTGAATTTGAAGCCGGGATAACCTTGTAACCCAAAAAGCGGAGACTTCTGGAGTTCCTTCAAAGG 
GTAGATAGGAAGTTTACCGTAGAAGTGCTTTTCAGCTCCATGGTCAAGTTTCGGGTGGACCAATAAGGTCCACAAGGTTTCACTACAAGAGAGGCTC 
TGAGGAAAGCGCTTGTCAACATTCTGACCCCTACCAGCAGGAACATTACATGTGCCGCCTACACAATGTTTTGCTCCCCCGTCCTTTCCCCGCCCAG 
TGTCCATTCATTTACCAGGTCCCAGCAGGGCGGGGCTGTCCTGTTTTCTATCCCTGGGCTGGGCACACGTCTTACGAGCGCCTGCAGTAAGCAGTCA 
ACAGTGTTTGGCCCGAATAGGGGCACAGGCTGTTCCGGAAGTGGACAGTGTCAGAGGTTCTAGCCGGAGAGGAGGGAGGGAGAGCCGCTGGGGAGCC 
TCATTGCCCCAGGGCCGAGGGTGGAGCCTTTCCCAGTCGGGAAAGGGCTGGGCCCAGCCTACATGTGAGGGCAGGACAGAGTGACTGCCAGCGCTCC 
CTTGAGAAGTCTGGGTGTAGCACGCTGGCGGTTGACTTCCCAGGACTCTTCTGTGACCTGTGAGCCGAGGATTACTGGGTACCCCAGATTCCCAGAA 
CCCAGTCCTGGATTGCAGATCTGTAAGGACCGCGAAAGATTGAGCAAAGAAGTCTTTCTGGACTTAAGAGTTCCGGAACAGGACGTCCAGCCCATCT 
CCGGATCCCGACTGGATCCACACCTTCAGCCTCGTGCCCTGTCCCGGGTTCCCCGCGCCCGGGGCCCCGCGCTTGAGCGCCCCCGCATGGCGGGGCC 
ATGTCCTAGGGCCGGGGTTCTAGAACGGGCGGGCAGCTGCTGGCAGGACCCGCTGGCGGAGGCGCTGAGCCGGGGCCGCTCGTCTCCTGCGGTCACG 
GGTCGGGGCTGCGCGCGGAGCCGGCCTCTCAGTGTGGTCTACGTGCTGACCCGGGAGCCGGGGCCCGGGGTGGAGCCCGGGTCGGGAACCGAGGCAG 
AGCCGCTGCCGTTGCGATGCCTGCGCGAAGCCTGCGCTCAGCTCCAGGGCACACGGCCACCCCCGCAGCTGCGCAGCCTACCCTTTGCGACGCTGGC 
GCTAGGCGACACCGCTGCGCTGGACTCCTTCTACAACGCGGGTGAGCGACTTGGGGCGGGGCTTGAGTCTAGGCGGAGCAAGGGGTGGAGCTTTAGA 
ACCCTGACGGGGCGGGTGTGGGTACCAGGTCTAGGGAATTTACGGTAAACTAAAGCCGAACTTGAGACAAGGCAGTAACGACAGGGCATTTAGTGGA 
TTATGGATGTGATGGGCTTGAGGCTGGATCTGGGGATTCAGGCCTGGCTCCGGAGTTTAAGTATCTTTCGGTGAGCGTGGTGCCAAGCAGGTGGGGC 
GGAGCAGGCGAACTCTAAGGGCGGGGCTGGGGAGGGACGGGGCTGAGGGAGGGGCAGAGCTGCGC^ 

GATGATCTCTCGTTGGACTTTTGAGGGTGAATC^GGCCTGAGACGGAAGTTCGGTCTATGGCAAGGGGCCATCTCTGCAGGATTGGGGTTGATAAGT 
CCTAGCATACTACATCCCACATGGGACTAAAGGGGCGGGAAAGGAAGTACTGAGGCAACTCTTCGGGACTGGATTATAGGAGAAAGGGATTGTATTT 
GGAGACTAGGAGTCGAAAGAAAATGGGTGGGGCAGGGGTGGGTCTAGAGGAGACACAGGGAGTCCAAGGGTGAGGTTGGAAGATAGCTATTCCTCTC 
TTTCCCACCTACCGGATGCTCCACCTCATCCCTATTTGTTTATCTCCGAGTCAGAGAGCCTTGTATCCTGGACATCTGCTAAATTGTAGCTTAAAAA 
TGAAGGCTGACCCGGTCAGACCACAGATAAAGTGGAATCATCTGCGTTGACTCATACAGTCCCTCCCTTGCGGACATTGTCACTTAAAGATGGCT 
ATACAGTAAGCTACACAAACACACATAAAAGCACGACGTTGTCTTCACACACGATGATGCAGATCACCCTCAGGACCCTGGCTTCAAGATTGGAGAG 
ATGGTTCTGAGGTTAAGAGCACTTGCTGCTCGCCCAGAGGACCCGAGGTTCCCAGCATGCACACGGGGTGTGCATGCTCACAACCACCTGTAACTTT 
ATCT CC AGAGGATC CTG ATG CCGCTTTCT AT CAT ACACTCACACC AACACACACATA CATATAAATAAAACAAGAGAGAGAGG AGAGAGAGAG ACAG 
AGAAAGAGACAGAGAATGGGACTTCTTTTTGCAAAGTTCCTGTCTCCTACATAACCTGGAA 

TAACACCACCAATAAGAGTACTGTGAAAACCCCGGGAAGCTCTCGAGAGCAGCACCGACTTTGGCATGAGGTGCAGGCAAGGCACTTGACGTGA 
GCATCTCCAGGCTCCTTCGCCACTGAGCTGGGCTTCCACACTCAGGCCTACACTTCTGTGTCCACAGATGTAGTGGTGCTGGAGGTGAGCAGCTCCC 
TGGCACAGCCTTCCTTGTTCTACCACCTCGGTGTGCGTGAGAGCTTC^GCMG^ 

GCAGGCCCTTCGTGTAAGTGGCCAGGGGGCTGGGGGCTGGGAAGACAGGTGCAGGGTTGGGGTGAGGCCACCC 

GCCTCCTGCAGGAGGATGTTTTCCAGAAGAACTCGGTGAGCAGAATCTACCCCTCATGCCAACGTTATAACCTCTGACCCCTACTAACCTTCCTAGC 
TTCCTCCTTTGTCTCTGACMCAGTGTGACCCTCAGCACCATGTAGCATCC^ 

CTAACCGCCAGTCCGTCTCTAACCCTGTGCTGACCTATGTGTGACCTCCACTAATTGCTGCCTTCTTCCCACCAGGATTGTGTTGGCAGCTACACAC 
TGATTCCTTATGTGGTGACAGCCACTGGCCGGGTCTTATGTGGCGATGCAGGCCTCCTGAGGGGCATAGCCGATGGGCTAGTACAGGCTGGGGCGGG 
CACTGAAGCCCTCCTCACTCCCCTGGTGGGCCGGCTTGTCCGTCTGTTGGAGGCTACACCCACAGACTCTTGGTAATGGGGCTCAGGGGGAGGCGGT 
ACTAGGTAGAGACTGATGAAGGTAAAGTGTGCTGGGGCTGAGGGGCAGGCTCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 
TCTCTCTCTCTCTGGTCTCTGGGCATCAGCGGCTATTTCCGGGAGACCATTCGTCAGGATATCCGGCAGGCTCGGGAGCGATTCAGTGGGCAGCAGC 
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TGAGGCAGGAGCTGGCTCGCCTGCAGCGGAGGCTGGACAGCGTGGAGCTGCTGAGCCCTGACATTGTCATGAATCTGCTGCTGTCGTACCGCGATGT 
CCAGGTGTGTGGCGTTCGGAGAGGCCATGACATAGGATATAACTCCAGCACTCAAGTGCTTGTGTCAGGAGGGGCAGGAGTCTGAGGTCATCCTGTT 
TACAAGCAAGTCTAAGGCCAACCTGGGCTTCAAGGAATCTTGTGTGTAAACCTAAGAAAGAGACCATTGTATGACAGACAGATACAAGTGCAGCTCC 
TGGCTGAGCCTCACTAGTGGCTTTAAGTAAATTGTCTGAGCCTCAATTTTCTTATTTGTTTCATCATCATTATTATTTTTTAAACATGGTCTCACTG 
TGCATTCCTGGCTGGCCTGGAACTCAATATGTAGACCAGGCTGGCCTACATAT^ 

TTATTGTTTTTTTTTTTTAAAGATGCATTTTATTTTTTTAATTGTGTGTATATATGTGTGTCTGAGTTAGGGTTTGTACCTTGTGTGTGCAGTTACC 
TGTGGAAGACAGGAGAGAGTGTTGGATCCTTTGGGACTGGAGTTAAAAGAGGTTGTGA^ 

CTGGAAGAGCAGCCTGCACTCTTAACTGCTGCTAGTTATCTCTTCAGCCCATATTTCCTTCTTTTTTGAGGGATGATGTGGTCATTGCCTCACAGGG 
CATTGGTGAGGTTAGAAACACCCAATGTGGGGTGCATAGTGGGTTCTCCAGGAAGGTGTGGTCTGTAAATCTGCTTGGCCCGAGGGAGGTGTGCAAA 
GTC TTCAGGTTGTGGGT AGT AGAAATTT CCT T CAAACTGACAGTAGTTACCCCTGTCTTAAATCCCAGCACTCAGT AAGCAG AGGCAGGCAGATAT C 
TGAGAGTTCAAGGGTAGCCTGGT CT ACAGGGTGAATT CCAGG ACACCC AGGGCTATGTAGAAAAACC TCATCT CAAAAAAAC AAGCAGAAAAAATGA 
TTTCTGCTGGCAGTGATGGTGCGGGTCTTTAGTCCCAGCACTCGGGAGGCAGTAGCA^ 

GAGTTCCAGGGCAGCCAGGGCTACCGATCTTTCCTCGCTGTCAAGGGCTCATCGAGATGCTGCACGCTGTCTCCCACAGGACTACTCGGCCATCATT 
GAG CTGGTGGAG ACGCTGCAGGC TTTGC C C AC CTGTGATGTGG CTGAAC AGCACAATG TCTGCTTTCACTA CAC ATT TGC CCT C AACAGGTGGGT AA 
TGGGGCACGGCTTGAGTCAGAGTCTTGGAAGTCTCTGTGACCTGCCAAGCAAGAGCACTG^^ 

CTGGGTCCTAATGTCACAAGGTTCCTGTTGGTCTGGGCTAAGGATGCATGGTGGGGACAGAGGACGTGGGATCCCCTTGATAGGGTCATAGCTGTCC 
AAGGTCGCTACTCCTTAGAGTTGTTTGCAACTAGAAATCCCTTCCATCTGGAGTCACTAACTCTAGAGATCAATGCTTCCCGGGGTTAGTGTCATTT 
GAGGCTCATTAATAGTAAGCCCCCAGAAGGGTCCCTTCCGTTTATGGTCTATTGGGTCCCAGGACTGGGTGGGTCAGAGCCATCAGACAGGTCTTCA 
GTGTATGTTCCTCCGCCTTAGGAGGAACAGGCCTGGGGACCGGGAGAAAGCCCTGGCTGTCCTGCTGCCATTGGTGAAGTATGAAGGTCCCGTGGCT 
CCGGATCTGTACTGCATGTGTGGTCGTATCTACAAGGACATGTTCTTC^ 

AATT CACAT AAGAGT AAAGAGAGGG CTT AGCGATGGG ACT GAAGGGTGGGGTTGCGGGCATC CAT TCAGCCTGG CAT CC C CTG CCCGACCCGACTC C 
AGGTACCGCAAGGCGTTTGATGTGGAGCCCAGCCTGCACTCAGGCATCAATGCAGCTGTTCTCCTCATTGCAGCCGGGCAGCATTTCGAAG 

f«| AGGAGCTCCGGCTGATTGGTGAGGCCTGCTGTTAGCCCACAGCACCCTGCACAGGCTCCAAAGGGCAAGGAAAGGGCCCCAGGCGGGAAAAAGCGCT 

:rf GACTC&AAGTGGGCTTCGAGACTGATCGGTGCCTGAGGTCTGAGGTCTCAGGAGCCTTTGAGTGCC^^ 

■U CACTTTATCCCACATCTATTTGTGCCCTAGCTCTCCCCTGTGCTAGGCTCTTAGCGCTGAGATCTTGCCACACCTACCAGATCTACATTCATATCCA 
rl CACTCCTGTGCGTGAGGGCCACAGTGGGACCCACACTTGAGCCTTACTCACTCCGGGACCTACACTGGATTGAGCCCACGCCACCCTCCCCCAGGCA 
TGAAGCTGGCCTGCCTGCTAGCCCGCAAAGGCTGTGTGGAGAAGATGCAGTACTACTGGGATGTGGGATTCTACCTGGGAGCCCAGATTCTTGCCAA 
CGACCCCATCCAGGTGGTGCTGGCTGCAGAGCAACTGTACAAGCTTAATGCCCCCATTTGGTAGGTGGCCCCTTCTTCAGCCCTGCACCCATGCCCG 
Jdb TTGGCACCCCTCCCCAAGCCATGACCCCAATACTTTGATGTCACTACGATAGATAGACAGTGCCCGTGTTCCCCTCTGTTCGTGGTATGTGATGCCC 
;: s GTATCCTGACTACATGCAGCTGAGTGGCACCAGCCGGCTCCTAGGCTGAGCTGAGGCCAAGGCTGTCTGTGCATTTAGGTACTTGGTCTCCGTGATG 
GAGACCTTCCTACTGTACC^GCATTTCAGACCCACACCAGAGCCCTCAGGAGGACCCCCGCTTCGAGCTCACTTCTGGCTCCACTTTCTGCTACAAT 
fjj CCTGCCAGCCTTTCAAGATGGCTCCTCCTCAGGAGGACCAGTGCCTGGTAAGGGCAGTGGGGCCTAGGGACCCAGGGGCCCAGGGACAAGCACATTA 
TGACCGATCCCTTTCCCCGCTACTGCTTACAGGTACTGGTGCTGGAGATAAACAAGGTCCTGCTGCCTGCCAGGCTTGAGATTCAGGGAACAGACCC 
CATGAGCGCAGTGACCCTAAGCTTGCTGGAGCCAGAGACTCAGGTGAGCTCTGAGACCCATTACTCACCGATCCACCTTTCTTTCTCCTATCCTCTC 
hk TAGTCCCAGCGCCTCCACAACCTCCACCCCTTTCTTCTCACAGGAGGATCCTTCCAGCTGGACCTTCCCAGTCACCTCCATCTGTGGGATCAGGTGA 
f«* GAGCTGAACCTTCCGGCAAGCTCCTGTGCCACCAAGCCTGGCATTCAGTAGCCTCTCTTCTTCCCTAGCACCTCCAAGCTGGACCAGCGCTGCTGCT 
Ir! TCCCTGTACGTACTTCCTCCGGCCCAGGACGTCCAGCTGTGCTTTCCCAGTGTGGAGCGCTGCCAGCGGTGCGCTCAACCTTTCTGAGTTGGGGATG 
111 GGACGCCCAGGGGACGGAACGGACCAGGTGCTCCTGGGGGATGGTTCCTGGCTGTTCCCCTTCCAGCTGTTTCCCCCTCTT'CAGGTTCTGTGGCCTA 
|jj ATTCAGGTCTTGGTGATGAATCCGGATTCCTCCGCGCCCACTGAGGAGGCAGAAGGCGCAAGGGAAGTGCTGGAGGTGAGGCTGCAGAAATGGGGGG 
GTTGGAGGAGAGCGAGGGGTGAAGGGTCCCTGCTAAGCGGGTGCTCTGGGTTCCACAGTTTGATTATGAATACTCGGAGACCGGTGAGCGGCTGGTG 
O CTGGGCAGAGGCACATATGGGGTGGTGTATGCTGGCCGAGACAGGCACACGAGGGTACGAATCGCCATCAAGGAGATCCCAGAGAGAGATAGCAGGT 
GGGGTGCATTGGTCGATGAAGGTCTGGGGTCTCGGGAGGGTATATGCAAACAGGGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTCTGTCTGTTGAGG 
AAACGTGAGAACTGACTGATTGTGGGCATGGAAAGAGCTCTTTAGTGGCTGGACTAGGGGTGGTGGCCCTTACAGTAGATAAGAGAGGTCTGTGGGC 
GGGGTGTTGGATCTGGGGGCATGGTTTCATGGTACAGGACCATGGGAGGTTATCTCTGCTCACATCTGCCCATTGCTCTCTCCTCGACACTCCCAGA 
TTCTCTCAGCCTCTGCATGAAGAGATCGCTCTTCACAAACGACTTCGTCATAAGAATATTGTGCGTTATCTGGGCTCAGCCAGCCAGGGCGGCTACC 
TCAAGATCTTCATGGAGGAAGTGCCTGGAGGTACCAGCCCTGCGTGGGAGGGGAGTGAAACCTGAGGTGTGGGGCGTGGGAATGGAGCCAGACCAGG 
GTGGCAACCCCTGAGGGGAAATGAACTCTAGCGTGCGTGGGTATCACCTAGAATTAGCTGACAGGGCTCACTGGGTGATGGAAGTCTGGGTCATGGG 
AATGGGTGTGTTTGGGCTTTGATCAGAGGGATGTAGGTAGAGTAAGCGGACATCAATTATAAACTTGGTTTGTCTCTGCCCAGGCAGCCTGTCCTCC 
TTGCTTAGGTCAGTGTGGGGACCCCTAAAGGACAATGAGAGCACTATTAGCTTCTATACACGTCAGATCCTACAAGGACTCAGCTACCTCCATGAGA 
ACCGCATCGTTCACAGAGACATCAAGGTCAGCCCTGTGGCTGCTCCGGGTGTGTGGAGACACAGTACCAAACTGAAGCTGACAGTAATGGTGGGAGA 
GAGCACACTTTGGCAGAGGGTGGGGGACCTATTAAAGAGGTCCCTTAAAGGCGAAAAAGTCACGTCTGTGGAAGATCAGACACGAGCTCCTTTACAC 
TCAT AG AGGCTCAGGAT GTAATATC AGG AGATTTGGG GGT CT CGGGATGGGGGGGTCTGGTT T ATAGCAGAGC C CAATC CCAGTGTCCTCTTC AATC 
AGGGAG ACAATGTTT TGATC AAC AC CTT C AGTGGGCTGCT CAAGATTTCTG ACTTTGGC ACCTCTAAGCGG CTGGCAGG CATCAC ACCGTGC ACGGA 
GACTTTCACAGGTAATGGGGGGTGGGGGTGGGGTGAGGGTGTCAAGCTCTTGGAGACTTCTGGGGAATGGAGTTTTTTTTCTTGGCACAGAGTTACA 
ATTTTAAAGATTTATTTATTTATTTTATGTATATGAGTACACTGTAGCTGTATAGATGGTTGTGAGCCTTCATGTGGTTGTTGGGAATTGAATTTAG 
GACCTCTGCTCTCTCTAGTCGGCCCAGCTTGCTCAGTCCCTGACTGCTCTGGCCCAAAGATTTATTTATTATTATAAATAAGTACATTGTAGCTGTC 
TTCAGACACACCAGAAGAGGGTGTCAGATCTCATTAAGGGTGGTTGTGAGCCACCATGTGGTTGCTGGGAATTGGACTCGGGACCTTCGGAAGAGCA 
GTCAGTGCTCTTACCCGATGAGCCATCTCACCAGCCCTTGAATTACAGTTTTAATGGTGAATGGAAGGAATCTGGCTGGAGACAGGTGACTGGATCC 
AGG AAC CGCTGGTAAAAT CTAT ACT CC AAGGTTGGGT AACT AATGAG AAAGACCTCAGCATCGAAAACGGATG CAAGG CCGCGTGTGTTGGGGGTGT 
GAGACGGGTTGGTAGAATTTGAAGGGAGGACGATGTAGCGTCACGTACAAGCCAGAACCCTCCTGATGGTTGAACGCTATGCAGCCTAGAATGAAGG 
ACGGAAGCTTGGCAGGTGACGGGGCCTAGGTGTGCACATGGCCAGTGGATGATGGGGACTTGATGCCTACCTGGCATTGTTGTTCCTATTTGTGGTC 
TAGGGACTCTGCAGTATATGGCCCCGGAAATCATCGACCAGGGCCCGCGAGGATATGGGAAGGCAGCTGACATCTGGTCTCTGGGCTGCACTGTAAT 
CGAGATGGCAACAGGTCGACCGCCCTTCCATGAACTAGGGAGCCCTCAGGCTGCGATGTTTCAGGCGAGTCTCTCTGGGGTGTGGCCTACGGAGGGT 
GGGCGGAGCCAGATGGGGCCAGTGGGGTTATCTATCCCATCTGCCCCACCTCCGCAGGTGGGCATGTACAAGGTACATCCACCCGTGCCCGGTTCCC 
TGTCAGCTGAGGCCCAAGCCTTCCTCCTCCGAACTTTTGAGCCAGATCCCCGCCTCCGAGCCAGTGCCCAAGAGCTGCTGGGAGACCCCT 
GCCAGGGAAGAGGAGCCGC^GCCCGGGCTCTCCTCGGCATACTCCCCGGCCCTCAGGTGCTATGGGGGTGGAGGGGGACAACCTAGTGAAGGGGAGA 
GGGGCTGATGAGGCGAAAAGACTAACAGCTTGTTCCTCCCTCAGGCGCCCCTTCCGGTCCTTCCAGTCCTTCAGCTGACTCAGCCACACAGTCTCAG 
ACATTCCCAAGGCCCCAGGCACCCTCTCAGCACCCACCCAGCCCCCCAAAGCGCTGCCTTAGTTATGGGGACACCAGTCAGCTCCGGTGAGCGCCTA 
GGGTCCTGGAGTGGGTTCTCTTCATCTTTTTTTGCCGTTGTATTAATTTCCTCCA 

CCAAGGACCGTATCTACTACCTCCTGCCCTCTCCCAGATGCCGGTGGCATTCACTCTGCCAACCCCCAGATGGGCTTAATTTCTATTATCCCCGTGA 
AGTAGGCATAAAAGAGCCAAGATTCAAACCCTGCAACATAGGTGGTTGGTTGCATGGTGTTGCCCACCCTCCATCTAATGCCCCTCATTGTGCCCAG 
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TGTGCCCGAGGAGCCCGCTGCCGAGGAACCCGCGTCCCCAGAAGAGAGTTCAGGCCTGAGTCTGCTGCACCAGGAGAGC^GCGCCGGGCCATGCrG 
G CTG CGGTGCTGGAACAGGAGGTGC CCAC ACTAGC AG AGAAT C TCCTGGAAC AGGAA CAGGT GGGCGGCCCGC CCC AGCGCCCCCTGGTGGTCG AGC 
TGGGATAAGAATTTATAGTGCACCGGCGAGCATCTCTGTTGACTGCGGATGCGAACATCCACGGACGTCCCCTCTTTTTAGGACTCTCGACTCAGCA 
AGATTCATGTGGAACTTCTGCTTCGGTGCCTTGGGGCACAAATCCAC^ 

GCTGCGGGCCCAGAGCCTGGGCCCTGCGCTTTTGAAGGGGCCGCTCTTCGCCTTTCCAGACGCGGTGAGGAAGCTATCTCGAAGTTAGGGCTATGC^ 
ATAACAGGTCGCGAGCAAGATCTTGATGTTCCAGGATTCTATGATTTCCAGGAAGACGCATGTCCAGTCCTGGCTTCCCTGTCCCACCCTTGACCCC 
AAGT CCG CC CACCT CTTTTAG ACTG CAG CCG ACTT TCT AGGGCTTTGC AACTCAAGCCC CAC CTCTGTTTG AG ATTCAACTGTCTCAACCGCGCCT T 
CCTTCCCTCCCCAGGTGAAGCAGATCCTCCGCAGAOSCC^GATCCGCCCACACTGGATGTTCGTGTTGGACTCGCTGCTCAGCCGTGCAGTCCGGGC 
GGCGCTGGCGGTGCTGGACGCGGGrAGGTGGGGTTGGATCCCGCCGGCCGGCCGGGGCGGGGTGCGTGGCCCTGGGGGCGTGGCGAACATAAAATTG 
GTACTTCGCGGCCCTCCCTTCTCCCTGACCGTCTGGGGTCCTAAACGGGCGCGGCGCCCACCTGGAGCTCAGTCTACTCAAATTGGTAGAGTCGGAG 
AAGAAAG CGGT CTTACCGAGGTC AG AAG AGTCGAGTAAAG AGG AGT CC CAG CAG AAACCTCAGGAGAGCCAGG CGC TGCAGAGCCAGCTCCCACCAG 
AGC AGGGAC CC CCATCGTTGATGGTGGAGTTGGGCCTTTTGCG AGCCGAGA CTGA CAGGTGAAGC CTTTTGGAATC ACTCGG ATAACC TGCAG AGGC 
TTCAATCTGCAGGCCCTTCCATTCTACCACCCCTACCTTCCCACAGGCTTCGGGACCTTCTGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTGCA 
ACAGGCCCTGCATCGGGTGCATGCAGAGACCAGGAAGTATGCCCCAGCTTCCGAGACCCCAGGTGAGCCGACTTGACTGGGAAATGGTGAAGGTAAG 
GCCACAGAGGACACCTTTGCCCTCCCTCATCACCTGCCTTTTTGAC^^ 

TGTAGATCCAGCCACTATCCAAACGGTGAGTAAGTGTGAGCTGGCTCCCACCACCCACCCACCAAAGGCATCTCCTCTGTCTTCCAGCTATTCAAAC 
TTTGTCATTGC TGC AAAGATGCT CAATA CT GAAAGAGCCCTT AGT ATGCAC CT CTC CTC TCATT ATAGGG ATGGGG AAACT CAGGCCACAAGGGGGC 
CGCCTGCCTGAGAGTTCTGTTAGAAGCAAGAACCCGAGTTTCTAGTTGAATTTCTGTGCAACTTGATCTTTCTCTGTTTCTGTGGTTCTCGGTTCAT 
AACCAGCTCCTGAGTC^TAGCTTCACCCTTCAAACCCTGCTCACCTGTGCCACTCAAGATGATCTAGTCTACACCAGAATCAGGTATGTCTTGGGCC 
TTCCAGGGGTGAGTCATGTAGCACTTGCTAGCAATTTCAGGACATCCCCTTCCCTATGGGCCTGGCCTGCCTTTGATTCCTCCCTGAAAACCATAGT 
GGTTCCTAAAG AGAGAGCATGGGCTGCCTTCTGGCTGAATTACAT TGAT CC CTG ATGTC CCC AGGGGAGGGATGGT ATGCCGGAT TTG GAG AGC CAT 

s . CTTGGCACAGCGAGCAGGAGCCACATCGGTCACCCCAGTACCCCGGGACGCTGAGTGATGACATCAAAGGACACAGATACGGATGGATGAACAGAGA 
AGACAC AAACAGCTTTGGAC ACACC TGT CCTAGAACCC AGGAGCAACT ACAGGG ACCAGGCCCAACCCCAA CAC C 

D CAAAGGACACTACATACAATCATAGACTATTAAACAGAACACTTGCTACCATGCTTGTCTTGGCAACTTCAGAAGGGTGTTGGGGGGAGTGAGGAGT 

f~- GTCCAAGGAGAAGCCCCTTGAGGTCTCCATGTCTGAGTGACCCTTTTTCATTTCCTGGGCTTTCAAGCTAGTGCCTTACTACCCAGACTTCATACCC 
AGTTAAGCAGTCAGTAGCTCTGTAAACAGACTTTATTGATTCCGGCCATGGGTGTGCTTTAGGTGGGGGGTCCACAGAGGGGTTTGACAAGGCTATG 

13 CCCTGGGGACCAGGTTGGTTCTGAGGGGCAGAAGGCCATCCACCCACTCACAGGGCCGCTCCAGGAGTGTCTGCCACAGCTGCTTCTCCTCCGGTGT 

X: GGAATCCATCCAGGGC^CCTGAAGCCCATAACTGCTGCCTGGATGTGGGTGAGCC^^ 

• \ AGCAACTTTCTGGTCCCAGATTATGCATTGTGCCCAAGCATCCCGAACCTCCCAGCTTGGCTCCCCTCAAGGGCCTGCTCTGCTCACCTGTACCAAG 
p& GCTGAGGCGTGTGCCCCCCAGCTGGCGACTGGCCAGGGCCCCATGGTCCCACAGGGAGAGCTCGGCACAGGCTTGGCGTAGGTCAGCAGGCCCGAAG 
M" CCGTCATAAAC CATGGTGTGGTTGAACACAGGGCTGAGACTTCTT CGAACCACTCTTGTGCGTTGGCGGCTGGCCCGACTGTCATCAGGCAGCACTG 
AGCTGCGAC AGGGTAATAAAGGAGGGCTAGTCTTAGGCAATCACCCTCTGACACTCCCT CCCTGAGCCAGCCAGGCACCATGTCCTGTCCTTTCAT A 
W TGGCCGTGTTCCCTGACCGTTCCCTGGCCCTGGCCTGAGATCAGGTCTCTAGCCCTCTTACACTAATCTTTTCTTAAGGCTTCGTACGTGAAGCTAA 
^ CTGTTCACAGGATTCCAGAGAGAAGTCTCTGAAATGCAATTCTGCTATTTTCTGGCCCAGTAACCTTCAGTATTCCCTGCCCTCCCCTTTAGTTTCC 
AAAGCCTCAACCACTCTGTAGAGTAGAATAATATTACTCTGATATAGAGTAATAGAAAGGAAGCCAGGACGGGAGCCCCAACTCAGAGCAGTCTACT 
lZ CTGTTTCACTCCTGGAGGGTGCGGTAACAACACCTGAAGCACCCAGAAACACTTTTAGAGGTTCTGTAAAAACTTACTTTGTAAACCACAGAGCTGG 
O GGTGTAGCTTAGTGGTAGAGCCATATATGTGTGTATGTATATATATGCATGCATGCTTAGGGTGACTGAAGCATTACGGTCAGTTCGCAGCATGTTA 
■ % AAAGACAACTACAATCACCA ACC ATTGACT CAC AAG TTCAG AGCC AAGCCCCATAGCT TGT C CCTCCAAGAGTTC CAAGGAGGACCCTT GTGGCCTT 

J - : : TGC ATC ATAGTG CGTGC ATAGTAGGTGC ACAGCCT CTG CAGAC TG C AGCCT ATG CTGGCTAATGTCAACTTGAC ACAAACTTGAGCAGTGGTTCTCA 
|JJ ACCTCCCTAAGGCTGTGGTCCTTTAAGACCCCCAACCATGAAATTATTTCATTGCTTCTTTGTAACTGTAATTTTGCTACTGTTATGACTTGTACTG 
p| TAAATATCTGATGTTTGCCACCCACAGGTTGAGAACCACTGAGCTAGAGTCCTCTGAGAGGAGGGAATAAGCTAAGCAAATGCCTCCATAAGATGGG 
j»5t T CATGC CGTGG CCAAGC CTG TAG AG C AT TTCCTT AAGT ACTG ATT GAG C AAGGCT AAGC AAG CCTTGG AGAGC AACC CATGAGGAGCCAGGCAGTCA 
p4 GCIAGCCCTCCTCTGTGGCCTCTGCATCAGCTCCTGCCTTCAGGTCCCTGCCCTGTTTGAGTTGCTGCCCTGGCTTCCTTCAGTGATGGACTACAGCG 
TGGAAGTGTAGAACCCTTTCCTCCCCAACCTGCTTTCTGGTCGTGGTGTTTCATCAAAATAATAGAAACCCTAACCACATAGAGCCCCTTAGTT 
CAACTCAGATTTTGTTGTTACACAAGGTCTCACTATGTAGCTTGGGCTGACCTGGAACTAGCAATGTAGACCAGGCTGTCCTGGAACTCAGAGACAC 
ACC TGC CTTTG C CT C CC CAGTGCTGGGATTAAAAATGCAC ACC AC CAC CAGGC AGTGGTGACGTATG CCTTTAATC CTAGC ACTCGGGAGGCAGAGG 
CAGGTGGATTTCTGAGTTCCAGGCCAGCCTGGTCCACAGAGTGAGTTCCAGGACAGCCAAGGCTACACAGAGAAACCCTGTCTCGAAAACAAACAAA 
AAAATAAAACCCAAAAACATGCCACCGGACCTAGCCTAAGCTTTTATTTTGAGACAGGGATCTTGTGTATCCCATGTAGCCTTAATGACCTTGAACT 
AATCCTCCTGCCTCCAATCCTAGTTTATGTAGGTTGTTAGAGATCAACCCAGAGCCT CT^ 

CCCCAGCCCCCAACTCCAACCCCCTGCCTGGCTTATCACCTCTGCCCTGCTGATCTTGGCTGGGGGGACAGGTGGATCAGAGGTCAAGAGCAGAAGG 
GACTAACACCTTAGGCGGCCTCTCACCATTGAATGTAAGTGTCCAGGGATCCTGGTCGCAGTGGTACCAGGCTCTGAGCTTCCTTTACCCAGAAGTG 
TAACTCCCCACTCTGAGGCTGTCCTCCTCCTGTGGGGACACTCCAGTTCAGACTACCACCTCACAGAGTGGGTGGGTCTCCCACCTCC 
TCCAGCTTAAGCTCTCTCTTGGCAGTCACTTACCCTCTGAGCCGGCAGGGACATACTTAAGGGACAGAGACAGCAGTCCACGGCTGGGAAGCTCATC 
TGG AGATGG AGG AAC CTGAAAAGGCGGGGAGTGG AGAGAAAC AGG ATT ATGTGTGAGAAGGG TGTTGGGGACAGGG CCTCTAC TG AGGAGCCCAATG 
G CAACGTTC CC CAGAAG AGG CGCCCTGG CTGGCAAGC CTGGCCTAATGGTATCAGTATG AGGCTAAGTGAGGCTCTGCAAACCGGTCCGTTGGGTGT 
TTGGTC CAGTG GGC CTGTGGG AATATCAGTG ACTGCACACCAGCCCTTACC CGAGGC TGCAGGGGGAGC CAGGTAGCCTCAGAGTCCC AGTTC CACG 
TGTCTAGGG GCA CT T CT ACT TCT CC CAG AAAGATGTTGCGACCCAG ACTTT CCCGGTGC C ACACGG ACAGGCT C AGCACG CGGCC TGG GAGATC AGC 
CTGCTG G AC AG AGTG CTGGGGAC AGGAGGTGAGTTTC AGAGGCATGCC CTGGCATCGCT GTGGCTCACCTGTT AGCTGGAGATGGTCT CCCAGGTTC 
CTTTCTAGGATGGGCTCCTGAGGCAGGGTTAACTGGGGTCCTAQCACAA 

TTCTGGGGCCGTC^GGAAAATTAAGGAACCCGGCAGCCCCTGCCACTCACCCCACTGAAAATGGTACCCTGTrTATTTGCAGGGATGACAGAA 
TGATGCCTCACCCGCAGAGTCTCGTTGAAGATCGGGTTCAGATTCCGTTTCTTCACTGACGTCTTACGCTTGCTCTGCTTATCCGGGAGGAGGTAGC 
TTT T CACGT AGCTGGGC AAGGGATGTG AAGGCTGGAG GTAAGGAAG AG CCCTTC C AGAGCTTTGTCCAGCTCTGGGGTCTTCT CT AGTCAC CATCCA 
TAACACAGCGTGCCCATGTGGATGTTAATTAACAGGCATCCAATTTAAGACGCTTCCCAAT 

AGGCCGAGGCAGTTAAGTGGTAGGCTGGGTGATGGAGCCCCGCCCCTCAGGCCCCCTGCAGCCCACOTGTGACGTAGGCACTCACGGATCGGAGCGG 
CGTCGCCGGGC AGCGGC CAG GCCCTGGC ACTGAATCACCTGT ACT CGC AGCTCAG AGGTGCCAGG CTCGTAGTGCAGCGAGAAGAG CACGG AG C CT C 
GCACCTGCACCGTGCCTGCCTCCTCTCCGGATAGGCTCATCAGGCTGCCGCTCAGCTGTGGGCGGGGCGGGTGGATGATCAGGGTCCCCATAGGGAA 
GCTTCCCACTGTGGGGTTTGCCTGGGCTTTCCCAGCTCCCAGCCCCTCCCTAGCTCTTCACCGTAGAGGAATTGAGGCTGGACACAGAGGAGCTGCT 
GCTGAGCATGCGGTCAAGAGAGGGGCCGAGCCCCGGGGCCTCTTCTCCATTCTCCAGGATCTTGGAGGTCGCCTTGGACTGGAGAGGGGAGAAGCGA 
CAGGGGCGCCCTCTTACTTCCCTGGACCTTGCAGGGGAGTTGGGGGTAAGGGGTCATTCACCGACGGTTCCCACCCCCAGGGCAGGTACACTCTCAG 
TCTGGAATCCGCTCCCGGTCTTGAGAACAGCCAGTGGCTACTCCCGCCTACCTGGGCTGGAGTTGGCTGGGACTCCTGCTCCGCCTTCTGCTCAGGA 
TCCAGCTCTGGGTCCGCCTCTGCCACCTGCACACCCCCTTCGCCAGGGGCTGGAATGGGGTACAAGTTGCTGAAAGAGCCCCTCTCCTATGTGTCCC 
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TAAGO^CCTGGAGGGTTCACCTGTCAGCTTCAGGCCCCAGGGGTGACCTAGAGTTGTCCCCGAAGGCCAAGTTGGGGTCCCAGGAAGAAGAGGAA 
GGTGGACATGATGGTTGTGGGGGCAGTCATTAAAAATGGCCTCTGATCCCCTC^CCTCTTAGGTTCCCCAGCTTACCCTCCAGCTCACATTCCTGG 
T CCTGGGGCTC CTC CTCCTG ACCTTCGG AAGCCTTTG AGGGTACATATGGT GAGGACAAATC AGG CCC CTG AAGAAAGC AGAGGGTAG AG GTC ACGG 
TGGGGGTTAATGAGAAAGGGATCAAGGGTTATGCTGGGGTCAGAGGTCAAAGACTCCAGCCTGCTCACCTTGCTTGGTCTGAGTTAGAGTAGGGCT 
CCCTCAGGGCAGGGCTACCTACCAGCTGCAGGACCAGTCAAGCCTGTACTTC^ 

CTCCTAGGCTCAGCTGGTGGGGGCCCCAACATCTGAGTCCCTGGCCTCACCCGCCTCAGTCCAACCCCACCTTTGCTGGCGACACCTCGTGTGTGGT 
TGAC AATGACCTTGGACTCCTGTTT CTACCTC CTG AGTGG CAGGAT CTGAGGTG TTCACCACAATGCCGGG TTTATGTGGTGGCTGGGGACCAAACC 
C AGAGCGTCATGCGTGTT AAG CC AG C ACTCTATCAACTGAGCTACATCCTTTGTGTT TGTTTTTCAAGGCGGCGTAGGG CGG GGTTCCCTGTGTAG C 
CCTGGCCTTCTGAAACTCTAGAGTTGTTAAAAATTGGAGGCTGGCCTCAAACTCACAGAGATCCACCTGTCTCTGCCTCC 

GGGGCAACCTGCCCCACAACCACAGCCCTGCGTGTATCCTTATTCTTTGGTTTTGTTGTTTATTTAGACTTATTTGTTTTATGTAAGTACAGACACA 

CC^AAAGAGAGCATCTGACCCCACTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGAATTGN 

NMINl^NNNNNNNNNNNNWNNNNNNNNNNNNNNN^ 

NIWNI^NNNNNNNNNNNNWNNNNNNN^ 

NNNNNNNNNNNNNNNIWNNNNNNNNNNN^ 

NNNNmtfNNNNNNNNimNNNNIWNNNNNNNNN^ 

NMJNI^NNNNNNNNNNNNNNNNNNNNNNN 

Nl^Nl^N^NNNNNNNNNNNNNNNNNNNbT^ 

NNNNHNNITONNNNNNNNNNNNNNNNNJ^ 
NNNNNNNNNNNNNIOTNNNimNNNNNN^ 
NIWNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNl^Nl^NNNNNNNNNNNNl^ 

CTCCAGGTTGAGCAGCATAGTAATCACTGTTCATTAAGTGCTTATTGTGGGCCATGCCTGATAGTTTATTTTGCACATTACATGTCATGTGTGCGTG 

GTATGCATGCCTGTATGCTCCCATGTATGTGGTCATACGCATGTGGAGGTCAGAGTATGAACTTGCTCATCTTCAGCTCTCTCCATCCTATATACTG 

AGGCAAGGTCTCTCACATAAGCTCTAAGCTCAGTGCTTTGGTGAGTCTAGCTAGCCAGCTTACTCCAGCAATGCCCTAGCACCATTGCCGGAGTGCT 

GGGACAGTGAGTGGGCCACCTTACCCCTGGCATTTATGTGGGTGCTGGGGATATGAACTCCATTTCTCAAGCCTATGCAGCAAGTGTTTTACCCACG 

GAACCATCTCCCTGACCCCTTAGCTGATGTTTTAGTGGAGAAAGGGGAGGTTCAAGGTCAAAGTAACTGCCCAAGGTTACACAGCTGTTCCCAGACA 

CTGCTTGTCACTCTTGAGGTCCTCTGCCCTTCTGCTCCTCTGGGGAGACGAAAT^ 

TGCTCATGGCTATATCAGGTTTATATATATCTGGGAGCTGCCAGGAACCGATCA^ 

AAGGGCTGAGGGAGGGTCCAGAAGGGTAGTACTAAGGACAGGCAGTATCTGTTCACCAAAGTATCCTGTCCCAAGGTGGCGGCCAGAGTAGAAAAAC 
CCAAAGACATGGAGGCTAGGGAGGGACCTGTCACAGGTTTGAGAGAAGTGAAGAGCTCTGAGCTAGCCAAGGTCAATGGAGACCAAAAGGTAGCACC 
AAGAACCACCCAGACCCAACCACCAGCATCCAAGGACCCCGAATCCCTTCCTTCTATCTGATGGCTGTCCTGTCTCACCTGACCCGCCCTTCCTCCA 
GCTGGCGCAGGTGGGCATCTCGTTTGAGGACATCAGAGATGGCCTCCTGTTCCTCCTCTGTCAGGAAGCTGAGGTCCAGCAACCCCTCAACTTCAGA 
TCCTGGTCCATGTGCCATGGGGAGGGAGGGCAGGGCCCAAAGCCTCTCCTGAGATGGGTGGCCTCTCTGGGGCATCAGCTGGGGTTGTGCCCCAGCT 
GGGCATATGTAGCTTCCTGTGGATAAACCAAGGGCTGATCAGTATCTCTO^AATGCACAGCACACATCCTGAAGCCAGCTGCCGGCTGTCTCCCAAG 
CATATACAGGAGTAATAAGGACCAATCAAAGGAAGGCCAGGTCAAGGTGAGGTGGGGCGTAAGTGATGCACTGCAAGAGATGCACATATGCGCATGC 
TCCTAAGGATGAATGACAGAGTATTGCAAGACAGTACACGCTGATACACATACCCACTGTTCTACACACATTCCAGATACCTAGACAAAGACAGTTA 
CAT G CAC AC AC ATC CCC ATG AAGGAACTGTG CGCACACATAC ATCATG AAGGGA CTGTGCA C ACAT ACC C AG AGTC CT ATGCC CAAG AG CTGTGC AG 
CACAGTAATTCTTAAACCTGCACACAGTATGCCCACTCGAGGACAAGTGTGCACTCACAAGAATAAGTCCCTGCATCTCCACAGATCTCTCTGGATT 
TCCTCACACACTGTCTAACCTGCTGCATACTACTAGAAACTAAGGATAGCTCACTCCCCTCAAAGCTAGTATTTTGCACACAACATAACCTGAGTCT 
TAATGAAATGCAAAAATATACCTGCATAAACATGTGGATGCATGCACAGCCCTCCCCACCGCTCATGTTTATCCCCTACCTAGCCTGCCTCTGCCCT 
TGCCTCCCTCCACCCTGCCCTGGCCCATTTGGCCTTCCTGTCTGAGGCAACCTCCTCCCCAACCCTCACCCCAACTTCCTCCCCCCACTCCCATAGC 
ACCCAGTTCCACTGAGGCCACCTGCCTGAGATTCCTCCCCACCAGCCCTTTCTCTAGGCGGAGCTTCTGAGCCCCTTGGGTCTGACCCCCGGGCTCC 
CCAACCTCCAGACTTGGCTGGAAGCCTCAGCTGGCAAAACAACTCATGCCCAGGTACCACAGCCACTCCGGAAGGAAAACATTGCCAGCCGAGAGGG 
GCATGTGGACATGGCAGGGGACCTGCACTCAGTAGTGGCCCTGGTCTGGGCCCGCTGGGCAGTCTCTCTCTTTCCCAGCCCCTCCTATGCCAGGCCC 
CCAGCTGGGGTTGAAAAGGGAGGAGGGCAGCTCAGGGGCAGGTCCTGGCTTGTGCCTGTCTCCTGCCTTGGTCTCCTCCCAGCTGAGTCAGTATCTA 
AGCACAGCAAGCAAGCGAGCCTCCTCTCCTACCCCACTTTCCCTCCGGGGGTTTCTGCATCrCCATCGCCATAGCCCTGGGCCCGATGCCTGCTCAC 
TGACAGTGACTCTTTCTCTGTACTTGAGGTGCTGGTATCAAGCTAGCTTGGGACTTTTCTATGGCGGGGTGTATGAGCATCGCCGTGTCCTGTTGTC 
CTGCTGCATTAGTAGACATGTCTGCAAGCCAGGGTAAAACATAGCATCTGTCCCTGAGTGCTGTGTTGCCTTATGATACTACAGAGTCTTACAGGAG 
TGTCGCTGCTGTGGGCAGTGACTTGCTGGGAGCCAGTGCCGTGTGGGTATGTGCATTTCTTCCTGTGTGGTCAGCCAGCCCAGGCTGGCTGGCTTGT 
GGTGTGTATGCTAGTACCCCGGGGGGCCACCAGATATGGGAGTGAAGGTGGAAGACATTTGGCAAAATTGCCTGGCTTTGTCTGACAGGTTGGCTTG 
GTCTGGGCTGTGGGCCTATACAAGGCCAGGATCTCTCTACCTCAAGTTGTATCTCT^ 

TAACACTGTGGGACAGGGACCCTTTAGTGATAGAAAGTCTGTATGTGTATCTGGGCAGGCCAGAGTCATAATCTTGCTTACACCTCACCCCCACTCC 
GCCCAAACTCCTGAGGTACCAGAATGTGGTCAGGAC1ACCAGGTGGCAGTGCCTGTCCCTTAGACCACTGGGTGTGTTACCTGTAAGGGTCCAGTAGG 
GACGACAACTTCAAAGGCGACTCCAGCTCTCCAGGAAGACAGTTCAGAGACTAGAAAGGCTGTGGGGACTAAAGCTGACTCGCCCAGCAGCTGCTGC 
C CCAGTC CCGAGGAGGTGGAGGAGG AGG AGAC ACCTCAGC AG GGC CTGGTGC AAAAC CG 

MOUSE SEQUENCE - mRNA 

TCTGGGTGTAGCACGCTGG03GTTGACTCCCAGGACTCTTCTGTGACCTGTGAGCCGAGGATTACTGGGTACCCCAGATTCCCAGAACCCAGTC<n , G 

GATTGCAGATCTGTAAGGACCGCGAAAGATTGAGCAAAGAAGTCTTTCTGGACTTAAGAGTTCCGGAACAGGACGTCCAGCCCATCTCCGGATCCCG 

ACTGGATCCACACCTTCAGCCTCGTGCCCTGTCCCGGGTTCCCCGCGCCCGGGGCCCCGCGCTTGAGCGCCCCGCATGGCGGGGCCATGTCCTAGGG 

CCGGGGTTCTAGAACGGGCGGGCAGCTGCTGGC!AGGACCCGCTGGCGGAGGCGCTGAGCCGGGGCCGCTCGTCTCCTGCGGTCACGGGTCGGGGCTG 

CGCGCGGAGCCGGCCTCTCAGTGTGGTCTACGTGCTGACCGGGAGCCGGGGCCCGGGGTGGAGCCCGGGTCGGGAACCGAGGCAGAGCCGCTGCCGT 

TGCGATGCCTGCGCGAAGCCTGCGCTCAGCTCCAGGCACACGGCCACCCCCGCACGTGCGCAGCCTACCCTTTGCGACGTGGCGCTAGGCGACACCG 

CGCTGGACTCCTTCTACAACGCGGATGTAGTGGTGCTGGAGGTGAGCAGCrCCCTGGCACAGCCTTCCTTGTTCTACCACCTCGGCGTG 

CTTCAGCATGACCAACAACGTGCTCCTCTGTTCCCAGGCAGAGCTCCCTGATCTGCAGGCCCTTCGTGAGGATGTTTTCCAGAAGAACTCGGATTGT 

GTTG GC AGCT ACAC ACTGATT CCT T ATGTGGTG AC AGCC ACTGGCCGGGTCTTATG TGGCGATG CAGGCCTC CTGAGGGGC ATAGC CGATGGG CT AG 

TACAGGCTGGGGCGGGCACTGAAGCCCTCCTCACTCCCCTGGTGGGCCGGCTTGTCCGTCTGTTGGAGGCTACACCCACAGACTCTTGCGGCTATTT 

CCGGGAGACCATTCGTCAGGATATCCGGCAGGCTCGGGAGCGATTCAGTGGGCAGCAGCTGAGGCAGGAGCTGGCTCGCCTGCAGCGGAGGCTGGAC 

AGCGTGGAGCTGCTGAGCCCTGACATTGTCATGAATCTGCTGCTGTCGTACCGCGATGTCCAGGACTACTCGGCCATCATTGAGCTGGTGGAGACGC 

TGCAGCTTTTGCCCACCTGTGATGTGGCTGAACAGCACAATGTCTGCTTTCACTACACATTTGCCCTCAACAGGAGGAACAGGCCTGGGGACCGGGA 
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GAAAGCCCTGGCTGTCCTGCTGCCATTGGTGAAGTATGJAGGTCCCGTGGCTCCGGATCTGTACTGCATGTGTGGTCGTATCTACAAGGACATGTTC 
T TC ACTTCTGGCTT CCAAAATGC CGGGCACCTGGAGCAGG CCT AT C ACTGGTAC CGC AAGGCGTTTGATGT GG AGC CCAGCCTGCACTCAGGCATC A 
ATGCAGCTGTTCTCCTCATTGCAGCCGGGCAGCATTTCGAAGACTCTGAGGAGCT^ 

AGGCTGTGTAGAGAAGATGCAGTACTACTGGGATGTGGGATTCTACCTGGGAGCCCAGATTCTTGCCAACGACCCCATCCAGGTGGTGCTGGCTGCA 

GAG CAA CTGTACAAGCTTAATGC CCCCATTTGGT ACT TGGTCTCC GTG ATGGAGACCTTCCTACTGTACCAGC ATTTCAGAC CCACAC CAGAGCCCT 

CAGGAGGACCCCOSCTTCGAGCTCACTTCTGGCTCC^CTTTCTGCTACAATCCTGCCAGCCTTTCAAGATGGCTCCTCCTCAGGAGGACCAGTGCCT 

GGTACTGGTGCTGGAGATAAACAAGGTCCTGCTGCCTGCCAGGCTTGAGATTCAGGGAACAGACCCCATGAGCGCAGTGACCCTAAGCTTGCTGGAG 

CC^GAGACTCAGGAGGATCCTTCCAGCTGGACCTTCCCAGTCACCTCCATCTGTGGGATCAGCACCTCCAAGCTGGACCAGCGCTGCT 

ACGCACTTCCTCCGGCCCAGGACGTCCAGCTGTGCTTTCCCAGTGTGGAGCS3CTGCCAGCGGTTCTGTGGCCTAATTCAGGTCTTGGTGATGAATCC 

AGATTCCTCCGCGCCCACTGAGGAGGCAGAAGGCGCAAGGGAGGTGCTGGAGTTTGATTATGAATACTCGGAGACCGGTGAGCGGCTGGTGCT 

AGAGGCACATATGGGGTGGTGTATGCCGGCCGAGACAGGCACACGAGGGTACGAATCGCCATCAAGGAGATCCCAGAGAGAGATAGCAGATTCTCTC 

AGCCTCTGCATGAAGAGATCGCTCTTCACAAACGACTTCGTCATAAGAATATTGTGCGTTATCTGGGCTCAGCCAGCCAGGGCGGCTACCTCAAGAT 

CTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTTAGGTCAGTGTGGGGACCCCTAAAGGACAATGAGAGCACTATTAGCTTCTATACA 

CGT CAG ATCCT ACAAGG ACT CAG CT ACCTCC ATG AGAACC GCATCGTT CAC AGAGAC AT C AAGGG AG ACAATGTTT TGATC AAC ACCTTC AGT GGG C 

TGCTCAAGATTTCTGACTTTGGCACCTCTAAGCGGCTGGCAGGC^TCAC^ 

AATCATCGACCAGGGCCCGCGAGGATATGGGAAGGCAGCTGACATCTGGTCTCTGGGCTGCACTGTAATCGAGATGGC^<2AGGTCGACCGCCCTTC 

C ATG AAC TAGGGAG CCCTCAGGCT GCG ATG TTT CAGGTGGGC ATGT ACAAGGTACATC CACCCGTG CCCGGTTCCCTG TCAGCTGAGGC CCAAG CCT 

TCCTCCTCCGAACTTTTGAGCCAGATCCCCGCCTCCGAGCCAGTGCCCAAGAGCTGCTGGGAGACCCCTTCCTTCAGCCAGGGAAGAGGAGCCGCAG 

CCCGGGCTCTCCTCGGCATACTCCCCGGCCCTCAGGCGCCCCTTCCGGTCCTTCCAGTCCTTCAGCTGACTCAGCCACACAGTCTCAGACATTCCCA 

AGGCCCCAGGCACCCTCTCAGCACCCACCCAGCCCCCCAAAGCGCTGCCTTAGTTATGGGGACACCAGTCAGCTCCGTGTGCCCGAGGAGCCCGCTG 

CCGAGGAACCCGCGTCCCCAGAAGAGAGTTCAGGCCTGAGTCTGCTGCACCAGGAGAGCAAGCGCCGGCGCATGCTGGCTGCGGTGCTGGAACAGGA 

GGTGCCCACACTAGCAGAGAATCTCCTGGAACAGGAACAGGACTCTCGACTCAGCAAGATTCATGTGGAACTTCTGCTTCGTGGCCTTGGGGCACAA 

ATCCACACTCCTAACCGCCGGCAGCTGGCCCAGGAGCTGCGGACCCTGCAAGCTCAGCTGCGGGCCCAGAGCCTGGGCCCTGCGCTTTTGAAGGGGC 

CGCTCTTCGCCTTTCCAGACGCGGTGAAGCAGATCCTCCGCAGACGCCAGATCCGCCCACACTGGATGTTCGTGTTGGACTCGCTGCTCAGCCGTGC 

AGTCCGGGCGGCGCTGGCGGTGCTGGACGCGGAGTCGGAGAAGAAAGCGGTCTTACCGAGGTCAGAAGAGTCGAGTAAAGAGGAGTCCCAGCAGAAA 

CCTCAGGAGAGCCAGGCGCTGCAGAGCCAGCTCCCACCAGAGCAGGGACCCCCATCGTTGATGGTGGAGTTGGGCCTTTTGCGAGCCGAGACTGACA 

GGCTTCGGGACCTTCTGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTGCAACAGGCCCTGCATCGGGTGCATGCAGAGACCAGGAAGTATGCCCC 

AGCTTCCGAGACCCCAGCCACTCTCCCAAAGGACCAGAACCTGGTGCGGTGGCTACAGGAACTGAGTGTAGATCCAGCCACTATCCAAACGCT 

AGTCATAGCTTCACCCTTCAAACCCTGCTCACCTGTGCCACTCAAGATGATCTAGTCTACACCAGAATCAGGGGAGGGATGGTATGCCGGATTTGGA 

GAGCCATCTTGGCACAGCGAGCAGGAGCCACATCGGTCACCCCAGTACCCCGGGACGCTGAGTGATGACATCAAAGGACACAGATACGGATGGATGA 

A(^GAGAAGACACAAACAGCTTTGGACACACCTGTCCTAGAACCCAGGAGCAACTACAGGGACCAGGCCCAACCCCAATGGGTGGAACTAACTAGGC 

AGGCACCCAAAGGACACTACATACAATCATAGACTATTAAACAGAACACTTGCTACC 

MOUSE SEQUENCE - CODING 

GCGGGGCCATGTCCTAGGGCCGGGGTTCTAGAACGGGCGGGCAGCTGCTGGCAGGACCCGCTGGCGGAGGCGCTGAGCCGGGGCCGCTCGTCTCCTG 
CGGTCACGGGTCGGGGCTGCGCGCGGAGCCGGCCTCTCAGTGTGGTCTACGTGCTGACCGGGAGCCGGGGCCCGGGGTGGAGCCCGGGTCGGGAACC 
GAGGCAGAGCCGCTGCCGTTGCGATGCCTGCGCGAAGCCTGCGCTCAGCTCCAGGCACACGGCCACCCCCGCACGTGCGCAGCCTACCCTTTGCGAC 
GTGGCGCTAGGCGACACCGCGCTGGACTCCTTCTACAACGCGGATGTAGTGGTGCTGGAGGTGAGCAGCTCCCTGGCACAGCCTTCCTTGTTCTACC 
ACCTCGGCGTGCGTGAGAGCTTCAGCATGACCAACAACGTGCTCCTCTGTTCCCAGGCAGAGCT 

CCAGAAGAACTCGGATTGTGTTGGCAGCTACAC^CTGATTCCTTATGTGGTGACAGCCACTGGCCGGGTCTTATGTGGCGATGCAGGCCTCCTGAGG 

GGCATAGCCGATGGGCTAGTACAGGCTGGGGCGGGCACTGAAGCCCTCCTCACTCCCCTGGTGGGCCGGCT TGTCCGTCTGTTGGAGG CTACACCCA 

CAGACTCTTGCGGCTATTTCCGGGAGACCATTCGTCAGGATATCCGGCAGGCTCGGGAGCGATTCAGTGGGCAGCAGCTGAGGCAGGAGCTGGCTCG 

C CTG GAG CGGAGGCTGG ACAGCGTGGAG CTG CTGAGC CCTGA CATTGTCATGAATCTGCTGCTGTCGTACCGCGATG TCCAGGACTA CTCGGC CAT C 

ATTGAGCTGGTGGAGACGCTGCAGCTTTTGCCCACCTGTGATGTGGCTGAACAGCACAATGTCTGCTTTCACTACACATTTGCCCTCAACAGGAGGA 

ACAGGCCTGGGGACCGGGAGAAAGCCCTGGCTGTCCTGCTGCCATTGGTGAAGTATGAAGGTCCCGTGGCTCCGGATCTGTACTGCATGTGTGGTCG 

TATCTACAAGGAC^TGTTCTTCACTTCTGGCTTCCAAAATGCCGGGCACCTGGAGCAGGCCTATCACTGGTACCGCAAGGCGTTTGATGTGG^ 

AGCCTGCACTCAGGCATCAATGCAGCTGTTCTCCTCATTGCAGCCGGGCAGCATTTCGAAGACTCTGAGGAGCTCCGGCTGATTGGCATGAAGCTGG 

CCTGCCTGCTAGCCCGCAAAGGCTGTGTAGAGAAGATGCAGTACTACTGGGATGTGGGATTCTACCTGGGAGCCCAGATTCTTGCCAACGACCCCAT 

CCAGGTGGTGCTGGCTGCAGAGCAACTGTACAAGCTTAATGCCCCCATTTGGTACTTGGTCTCCGTGATGGAGACCTTCCTACTGTACCAGCATTTC 

AGACCCACACCAGAGCCCTCAGGAGGACCCCCGCTTCGAGCTCACTTCTGGCTCCACTTTCTGCTACAATCCTGCCAGCCTTTCAAGATGGCTCCTC 

CTC AGGAGGACC AG TGC CTGGTACTGGTGCTGGAG AT AAA C AAGGTCCTGCTGCCTGCCAGG CTTGAG ATTCAGGGAACAGA CCC CATGAG CGCAGT 

GAC C CT AAG CTTGCTGG AGCCAG AGACT CAGGAGGAT CCT TC CAG CTG GAC CTT CCC AGTCACCT CCATCT GTGGGATC AGC ACCTCCAAGCTGGAC 

CAGCGCTGCTGCTTCCTGTACGCACTTCCTCCGGCCCAGGACGTCCAGCTGTGCTTTCCCAGTGTGGAGCGCTGCCAGCGGTTCTGTGGCCTAATTC 

AGGTCTTGGTGATGAATCCAGATTCCTCCGCGCCCACTGAGGAGGCAGAAGGCGCAAGGGAGGTGCTGGAGTTTGATTATGAATACTCGGAGACCGG 

T GAGCGG CTGGTGCTGGGCAG AGGC ACATATGGGGTGGTGTATGC CGGCCG AGACAGGCACACGAGGG TACGAATCG CCATC AAGGAGAT CCCAGAG 

AGAGATAGCAGATTCTCTCAGCCTCTGCATGAAGAGATCGCTCTTCACAAACGACTTCGTCATAAGAATATTGTGCGTTATCTGGGCTCAGCCAGCC 

AGGGCGGCTAC CTC AAG ATC TTC ATGGAGGAAGT GCC TGG AGGCAGCCTGTCCTCCTTGCTTAGG TCAGTG rGGGGACCCCTAAAGGACAATGAGAG 

CACTATTAGCTTCTATACACGTCAGATCCTACAAGGACTCAGCTACCTCCATGAGAACCGCATCGTTCACAGAGACATCAAGGGAGACAATGTTTTG 

ATC AAC ACCTT CAGTGGGCTG CTCAAGATTT CTG ACTTTGGCACCTCTAAG CGGCTGGCAGGCATCACACCGTGCACGG AGACTTTCACAGGGACT C 

TGCAGTATATGGCCCCGGAAATCATCGACCAGGGCCCGCGAGGATATGGGAAGGCAGCTGACATCTGGTCTCTGGGCTGCACTGTAATCGAGATGGC 

AACAGGT CGAC CGC CCTTCCATGAACTAGGG AGC CCTCAGGCTGCGATGTTTCAGGTGGGCATGTACAAGGTACATCCACCCGTGCCCGGT TCCCTG 

TCAGCTGAGGC CCAAGC CTT CCT CCT CCGAACTTTTGAGCC^GAT CCC 

CAGGGAAGAGGAGCCGCAGCCaSGGCTCTCCTCGGCATACTCCCCGGCCCTCAGGOSCCCCT 

AC^GTCTCAGACATTCCCAAGGCCCCAGGCACCCTCTCAGCACCCACCCAGCCCCCCAAAGCGCTGCCTTAGTTATGGGGACACCA 
GTGCCCGAGGAGCCCGCTGCCGAGGAACCCGCGTCCCCAGAAGAGAGTTCAGGCCTGAGTCTGCTG(TACCAGGAGAGCAAGCGCCGGCGCATGCTGG 
CTGCGGTGCTGGAACAGGAGGTGCCCACACTAGCAGAGAATCTCCTGGAACAGGAACAGGACTCTCGACTCAGCAAGATTCATGTGGAACTTCTGCT 
TCXSTGGCCTTGGGGCACAAATCCACACTCCTAACCGCCGGCAGCTGGCCCA^ 

CCTGCGCTTTTGAAGGGGCCGCTCTTCGCCTTTCCAGACGCGGTGAAGCAGATCCTCCGCAGACGCCAGATCCGCCCACACTGGATGTTCGTGTTGG 
ACTCGCTGCTCAGCCGTGCAGTCCGGGCGGCGCTGGCGGTGCTGGACGCGGAGTCGGAGAAGAAAGCGGTCTTACCGAGGTCAGAAGAGTCGAGTAA 
AGAGGAGTCCCAGCAGAAACCTCAGGAGAGCCAGGCGCTGCAGAGCCAGCTCCCACCAGAGCAGGGACCCCCATCGTTGATGGTGGAGTTGGGCCTT 
TTGCGAGCCGAGACTGACAGGCTTCGGGACCTTCTGGCTGAGAAGGAACGTGAGTGCCAGGCGCTGGTGCAACAGGCCCTGCATCGGGTGCATGCAG 
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AGACCAGGAAGTATGCCCCAGCTTCCGAGACCCCAGCCACTCTCCCAAAGGACCAGAACCTGGTGCGGTGGCTACAGGAACTGAGTGTAGATCCAGC 
CACTATCCAAACGCTCCTGAGTCATAGCTTCACCCTTCAAACCCTGCTCACCTGTGCCACTCAAGATGATCTAGTCTACACCAGAATCAGGGGAGGG 
ATGGTATGCCGGATTTGGAGAGCCATCTTGGCACAGCGAGCAGGAGCCACATCGGTCACCCCAGTACCCCGGGACGCTGAGTGA 

HUMAN SEQUENCE - GENOMIC 

ACCACCAGGCAAGATGGGCCCCAAGGGTGAGCCAGGTGAGTAAGTGGGGCTGGAGGCCTCCCCGCTTTCTTTGCCTCTGTCCCTGGACTCTAGGGTG 
GGATCTCTGCTTGGCTGCTGTGCCTATGATGGCTCTCTCACAGGGGAATGGATGGCTGGGAGGAGTGGGAAGGATTAGGAGAAGG 
TATGATGGCCCTCCTATTGGGGATGGAGGGCTGGGAGGAGTGGGGAGGATTAGGAGAAGGGCCTGGGTACCCTATGAAGGCCCTCCTACAGGGTTTG 
GAAGGCTGGGAGGAGTGGAGAGGATTAGGAGAAGGGCCTGGGTACCCTATGATGGCCCTCCTACAGGGTATGGAAGGCTGGGAGGAGTGGGAAGGAT 

tcggaaaagggcctgggtaccggctccactggtggct^ 
cagtgaacotgctccggtgccaggaaggtgaggcagcccttgcaggagccaggaact 

TGGCCAAGATCCTCCCCACAGCCTCCTCAGAACCCTTCTTCAGGGACCTGAAGGAGCC^ 
GGGGTGGGGGTCAGGGAATAACCTGGGCACCACCCCAGTCCATCTCCCCTTTCCCAGGCCCC^^ 

CTTGAGCGGCTGGTACCATCTGTGCCTACCTGAGGGCAGGGCCCTCCCAGTCTTTTGTGACATGGACACCGAGGGGGGCGGCTGGCTGGTGAGTGTC 
T GAG AAATGGGC TAACAGGC C AGGC C CA CTGGTCACTTCTCC C CC TTGGCCC CT AGGGG AGC CAG AACC CACC AGAGCAAGGCATTTCCTAGAGGAA 
TTCACTGACAAATGTTTAGTGAGCCCCCAATGGGACCTGAAGAAGTATGAGGGCTTCACAGAGGAGGTGCCATTTGAAAAGACCTTAAGTTGAGTGC 
AGT AGCTCA CG CCTGTAATC CCAGCTACTTGGGAGGC TGAGGCAAGAGAATTGC C TGAACCCAGGAGGTGGAGGTTG CAGTG AGC CGAGTTCTTGCC 
ATTGCACTC CA G CCTGGGTGAAAAG AGTG AAACT C TGTCTCAAAAAAAAAAAAAAAAAAAAA AAAAATAGG GT AGGGAGGGGCATTCCAGCTAGAAG 
TAAAGATAAACTCAGGGAGGCAGCAGAGTAAAAGGTTTGCAGGGAACGCGTGAGTGAGCTGAGAGCAGGCGTGGGTGTCGGCTAAAGTCTGCCGGCA 
CTGACGGAACATGGATTGAGGACCTCTGTAAAGGCCAGTTCATGGAAGGCACTAAATGATCMCTTAAGAAGTGTGCAGCGCGCCCAGACGCGGTGG 
TTGGCTGGACGTGGTGGCTCACGCCTATGATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCACCTGAGGTCAAGAGTTCGAGATCAGCCTGGCC 
AACATGGTGAAACCCCATCTTTACTAAAAATATAAAAATTAGCCAGGCG 

GAATTGCTTGAACCCCGGAGGTGGTGGTTGCAGTGAGCCAAGATTGAGTCACTGCACTCCAGCCTGGGCAACGGAGTGAGACTCTGTCTTAAAAAAA 



CGGGCTAC^CCATGAAAGATGCACTGCATTGGGGGACCAGTGGAGGCAGGGACATGGTTTAGGAAGACAGCAAGGAGCCAGGCAGTAAAG 
AGCAGGCCTGACACAGTAGCTCACACCTGTAATCCCAGCACCTTTTGGGAGGCCGAGGTGGGTGGATTGCCTGAGCTCAGGAGTTCAAGACCAGCCT 
GGGCAACATGGCAAAACCCCATCTCTACTAAAAATACAAAAAATTAGCTGGGCGTGGTGGCAGGTGCCTGTAATCCCAGCTATGCAGAAAGCTAAGG 
4S CACGAGAATTGCTTGNNNNNNNNNNNNNNITONNNNAGCCGAGATCAC^ 

ATAAAT AAATAAAATAAG AC AGTGG AGC AATGGAAATAGAAGTACTT AGGTTG AC AACT CT AT AGGACCTCGGGCC TACTTGG CTT GGAGGGGGAGG 
TGGAAGACATCAGGGTAAGGCCAGGGGTCCAGCTTGGGCTGTGGGAGTGGCCTCTCTAATGTAAAATATCTTGGTTTTTTTGTTTGTTTGTTTGTTT 
H" TGAGACGGAGTCTTGCTCTGCTGCCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGCAGTTCTTGTGC 
g s | CTCAGCCTCCTGAGTAGCTGGAACTACAGGTGTGAGCCACCACACCCGGCTAATATTTGTATTTTTAGTAGAGACGGGATTTCACCATGTTGGCCAG 
WW GCTGGTTTCAAACTCCTGACCTCAAGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGAATTACAGGCATGAGCCACCAAGCGTGGCCAAATATCT 
TGGTTCTTATAGTAAATGAAATTGGGTTATTTTGACTGACACCCAGCAGCGCTGGCCATCTCCTGTGGGTGGGGACAGTGTTTTGTCAATTCAGTCT 
TCCCAGGGCTTGCAAGGCTGTGGCATGAAGTTGAGTGTCAGTAAAAGTTAACAAGG 

ATCCCCATTTTATCAACAGGCCCCAATCCAAAGAGAGAGTGGGTATCCACCCACCCCTCACAGTTCCTGCACCCTCAGCTGCCAGTGCCCAGCCTGG 
Wl AGTAATAGCTGCTGTGCCCCCCCTCCCCCAGGTGTTTCAGAGGCGCCAGGATGGTTCTGTGGATTTCTTCCGCTCTTGGTCCTCCTACAGAGCAGGT 
f | TTTGGGAAC CAAGAGTCTGAATTCTGGCTGGGAAATGAGAATTTGCACCAGCTTACTCTCCAGGGTGAGTTCCCAGCAGGGTCCTGGAGTCTGGGGG 

l s TTACTGTCTGCCATCCTGAATCCCTGCCCCACCCCCAAGAACTGAGGATTCCTCCATCCCTGCCTCCCCTTCTTTCCAGGTAACTGGGAGCTGCGGG 
W TAGAGCTGGAAGACTTTAATGGTAACCGTACTTTCGCCCACTATGCGACCTTCCGCCTCCTCGGTGAGGTAGACCACTACCAGCTGGCACTGGGCAA 
Q GTTCTCAGAGGGCACTGCAGGTGAGTGAGCCTAAGGGGAGCAGAGGAAGGAGGCTGGATCCTTTCCCAGTGCTGCCACTGCCCCCATGAATGACCCA 
GGG CAATCCTGTCTCCTCTGTGGGCCTT AGTTTCCGC ATCTG AGAAATGGCAATCATGAAACAGGATGGTGCAACAGTGAAGAATTTGGGATCTGGA 
GGCAAACAGACCTGAGTTTGAATCCATCTCTGCCATTTAGTACTTATGTGACCTTGGGCAAGTCCTACCCCTCCACTGAGCCTCAGTTTCCTCCTCT 
ATAAAATGGGCAGGGAGGCCGGGCACGGTGGCTCATGCCTGTAACCCTAGCACTCTGGGAGGCCGAGGCGGGCAGATAACTTGAGGTCAGGAGTTCA 
AACCCAACCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTAGCCGGGCGTGGTGGAGGGTGCCTGTAATCCCAGCTACTCG 
GGAGGCTGAGGCAGGAGAATCACTTAAATCTGGGAGGCAGAGGTTGCAGCGAGCCAAGAGAGTGTCACTGCACTCTAGCCTGGGCGACAGAGCGAGA 
C TC C ATCTC AG AAAAAAAAAAAAGGCGG GGGGAG CGCAGGGAGGCAAGGAGGAAGTTTTGGGTGCCGC CTG AGGCT CTG TTG AGTGTTAGTGTTCTC 
TGCTTTGGAAGAGGAACCACGGAAGTCTTCTTTCAGAGAAACAGTGACTATTGGGGAGAGATTTAAAAGAGGGAAGTGGTTATGAGGGAGTGCCTCA 
GGGAACCACCTTTCCTGGCCAGGCACCTAAGAGGTTCCTCCCAGCTTTTCCTAGCAGAGTTGCTCAGACTGGATGGGAGCTGAGAGAAGACAAAGGA 
GCTAGAGGGGGGCCGGGGATTCTGGAGGGGGTAAGACAGGGGGAAAAGCTGGGAGCTTTGGAGCCAGACAGTCCCGAGTTCAAGTCCTGCCGCTGCT 
TCAACCAGCCGAGTAACCCTGCGCAAACAACCTGGCCCCTTGGTGTTGCTCGTGGGGCCTGATAAAATTCAACTCTTAGTTGCTCTGAGACTCCCAT 
GGCATTATATCTCCAAAGGTGCCAGGCATGGTGCTGGGCACCAAGGTGCTTTCCATCCCTTCCCTGCTAGGGGATTCCCTGAGCCTCCACAGTGGGA 
GGCCCTTTACCACCTATGACGCTGACCACGATTCAAGCAACAGCAACTGTGCAGTGAT^ 

AAATCTCAATGGTCGCTATGCAGTGTCTGAGGCTGCCGCCCACAAATATGGCATTGACTGGGCCTCAGGCCGTGGTGTGGGCCACCCCTACCGCAGG 
GTTCGGATGATGCTTCGATAGGGCACTCTGGCAGCCAGTGCCCTTATCTCTCCTGTACAGCTTCCGGATCGTCAGCCACCTTGCXTTTGCCAACCAC 
CTCTGCTTGCCTGTCCACATTTAAAAATAAAATCATTTTAGCCCTTTCAAGACTCTTCCAGTTACTGGGAAAAAACCCTAGAGGGCAGAGCAGGGAC 
ACAAGTTTCAGCTCCATTCAGGGAAGAATTTCCTTCCAGTCAGGAAGATGGAAAGGACAGCeTCTAAAG 

CAA GACAAGGC C AG AGGGGC ATAT AGTGGAAGGA CTGG AGG AGATTC AAGCATCCATTGAC AATTT CCTGAGGCT CTCTGGGTCCTGCAAGTGGATG 
GCTTTGTGGCAGGAATTGCACAATTGCACAGCTTTTTTGTTTGTTTGTTTC 

GTGCAGTGGCACCATCCTGGCTCACTGCAAGCTCTGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGAACTACAGGCACC 
CGTTACCACGCCCGGCGAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCATC^TGTTAGCCAGGATGGTCTCAATCTCCTGACCTCATGATCTGCC 

CGCCCTGCCCAGGCTGGAGTGCAATGNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNtnWNNNNNNNNNN^ 
NWNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNl^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNimNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNmNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
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NNNNNNNNNNNNNNI^NNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNmNNNNNNNNlTONimNNNNNNNN^ 

NNNN]OTNNNNNNIWN1JNNNNNNNNNNNNN^ 

NNNNNNNNNNNNl^NNNNlTONNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNIWI^N^ 

NNNNNNNNNNNNNNNNNNNNNIWN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNN1TONNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNN1WNNNNNNNNNNNNNN 

NITONNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNmJNNNNNNNNNNNNNNNNNNNNJWNNNNN^ 

NlTONNNNlTOmNNNNNNNl^NNNNNNNNNN^ 

NNNNl^NNNNNNNNNNNNNNNNN^ 

NNN NNNNNNNN NNN NNN NNNNNNNNNIJNNNNNNNNNNNNNNNNNNNNN^ NNNNNNNNNNNNNNNN 
NNNNNCCCCGCAGCTGCGCAGCCTGCCCTTCGGGACGCTGGAGCTAGGCGACACCGCGGCTCTGGATGCCTTCTACAACGCGGGTGAGAGCGCTGGG 
GGCGGGGCCGGATCCGGGCGGGACTCAGGGTCCTGGGAGGGGACCAAGCCGGGCTCGAAGGCTCAAGGGAGGGGCCGAGAGGGGTAOCGGGTCTAGG 
GGACTTAAAGTGAGCTAAAGCTGAAGGGCTTGGGACAGGGCCCGTGAAAAAAGTGGGATTTAGGGGATCTGGGGCGCGGCCGGGCTCGGGACGAGAC 
AGGGATCCGGGCGCGGCTCTAGGGCTCGAGGTTGGGGCTTAGGCACCCTTCAGTCGGAGCGGTGCCCAGCCAGCGGGGCAAGGATGGGGAGGGGCTT 
TGGAACTCTGGGGCGGGGCGGGGAGCGGCGCGGGGCTGTGGTGTCTC 

ATC TCTTGA GGACAATTACT GGG AG AGGAGCTTGGAC TTAGGGAAAGGGGC CGGGAGGC TGG AGTTCAGGAGTC CT AGC ATAC TATTT CC AAGG AT C 
GGGCTAGGAGAGGCAAAAAGAGAAGCGAAGGGGCCCGACGGGACTGGATACTGGGTGAGGGAGCTTAGGTTAAGTGGCCTCAGCGGCCCAGAAAGAA 
TGGGCGGGGAAGGGGCGGGGAAGTGGGCGGGAAAGGATGGCTCAGGGGCGGGGCCAGAAGGAGGCTATTCCCCCGCGAGTCACTCCTCCCTCCAGCC 
AGCCCGGATGCTCGGCCCCCATACTTACTTGCTTATGCTAGAGTCCAGTAGCTGTGCTTCCTGGATGTCCATGAAATCGTAGCCTAAAATGAAGGCT 
GACCCGACCACTCGGACACAAAGATACAATTATCGACAGACACAAATCGCATACATGTTGACTCATGCAGGCCCTAGCAGACGCTGTCACACACACA 
GATACAGTCAGACATAGTAAGCTATACATGTCACAGATATACACATACATACATTTAGGCACAGCGTTGTCTATGCACACTGATGGTAGCCCCTTTC 
AAGAGCCTGTGCTGAAGACCAAAGGGAGGGGACTTCTATGTTTGCGCAACTCCCACCTCCTAAATGGCCCGTAACAGACTCAACTGGTCCACAGCCT 
CCCTTCCCCCATGCCAGGTGCTCACAGGCTCTGAACCGTGGCCTGAGGGTCCCA^ 

GTTCTGTGAAAGTGCCAGGGGAACCCTAAAAAGGGCCGACTGAAGCCCAAAGCAGAACCAGACCCTACTGGAGTCCCACAGCAAGGCAGGTCAAGCC 
AGGATGGACCCCGGCTCCTTCCCCATCAAGCTGGGCTTCAGGGATCAGCGCCACCCCCTCTGTGCCTGCAGATGTGGTGGTGCTGGAGGTGAGCAGC 
TCGCTGGTACAGCCCTCCCTGTTCTACCACCTTGGTGTGCGTGAGAGCTTCAGCATGACCAACAATGTGCTCCTCTGCTCCCAGGCCGACCTCCCTG 
ACCTGCAGGCCCTGCGGGTAGGTGGCCTGGGGCTGGTGGAGGCAGGCACAGGGCTGGGGTAAGGCCTACACAGAACACCAGTATCTGACTGTCTGCC 
TCCTGCAGGAGGATGTTTTCCAGAAGAACTCGGTGAGCAGAATCCACCTCTCACTCAAAAGTGCCCTTTGACTTCCGCTATGACCTCTGACCTCCAC 
TAATCCCCCTGGCCTCCTCAGTCACCTAACCCCCCACTGACATCCCAAGTATTCCTCTGTTTCCTGTTGCCTCCTCTCATGACGGGTGCCCCTTTCC 
TGCGATTCCAATTCCCACTGATGCTGACCTTTGATGCTCTCACTCCCCTTGTCTAGCCCTGTGCTGACCTCTGACCTCACTGATTCCTGCCTCCCTC 
CCACCAGGATTGCGTTGGCAGCTACACACTGATCCCCTATGTGGTGACGGCCACTGGTCGGGTGCTGTGTGGTGATGCAGGCCTTCTGCGGGGCCTG 
GCTGATGGGCTGGTACAGGCTGGAGTGGGGACCGAGGCCCTGCTCACTCCCCTGGTGGGCCGGCTTGCCCGCCTGCTGGAGGCCACACCCACAGACT 
CTTGGTAATGAAGGCCCCCAGGGAGGGTGCTCCAGGAGGGGGCTGGTGAGGGTACGGTGGGCTGGGGCTGAGGTACAGGCCCCACCATTCTCrCTCC 
CTGCCTCCCCTACCCTGGGCATCAGTGGCTATTTCCGGGAGACCATTCGGCGGGACATCCGGCAGGCGCGGGAGCGGTTCAGTGGGCCACAGCTGCG 
GC^GGAGCTGGCTCGCCTGCAGCGGAGACTGGACAGCGTGGAGCTGCTGAGCCCCGACATCATCATGAACTTGCTGCTCTCCTACCGCGATGTGCAG 
GTGCGTGGTGTTCAGAGAGGCAGTAGGGCAGCTGGCATTTGAGGTCAGGCAAACCCAAGTTCAGTTCCTGGCTGGGCCACTTGCTCACTGGTGGCTT 
TGGGCAAATTATCAAATGTCTCTGAGTGTTGGCCTCCTTATTTGTTAGAGAGGATGAGCATGTCGCTGCCTCCTGGGGCAGTTGTGGGGTTGGGAAC 
GCCCAGCATGGGGCATACAGTAGGTTCTCCCTGTGAGGTGTGTCCGCTGGGCCTGAGGGAGGCGTGCAGAAGATGGTGGCTGGGTGGGATGTCAGGA 
GTGGGACAGGACCCTGGCTCAAGGTTCCTTGGGGCCTGGAGGCCCATGATGCCAGTGCTTGCCGTTCCCTGATGCAGGACTACTCGGCCATCATTGA 
GCTGGTGGAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCACTACACTTTTGCCCTCAACCGGTGAGTGGCA 
• GAGCAAGGGTTAGGTCATAGCCACATAAAGGTTTCCGAAACCTAGGTCTCTGCGTGCAAAGCCATTGC^ 

TGCTGACCGGGGTCACTGATGTCAATAGGTCCCTGTTGTTCTGAGTTACTGCTGCCTGGGTCGGGGTGGGGGAGGTGGTGCTGCCTGTGGTATGGTC 
ACCGCTGTCCAAGATC^CTGCTCCTAAGAGTCATCGCATCTAGGAATCACT^ 

C ATTTTG AGGTC AT TATTGT C AG GC CCCTGG AAGGGTCCCTGCTGGTTCAG TGTCTAGAGGTCCCTGGACAGAGTAGGCCAGG GC CAT CAGGGAGGG 
CCTGAGCCCATCTTTTCCCTAGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTGCTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGC 
GCCCGATCTGTACTGCATGTGTGGCCGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAGGCCTATCACTGG 
TAAAGAGC^CAGTGAGTGGCCGGTGGGCTAAGCTGTGAGATGAGGGGTGGTGCTGTGGGCATCAGTCCAGCCCTGTGCCCCCTACCCCCACCCCAGG 
TATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAATGCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAG 
AGCTCCGGCTAATAGGTGAGCACAGATCCTGGCTCTTGTCTCGTCACCCACAGGGGTTCCGAGGGGCCTGGGAAGAGGGCAGGGAGAGAGCACACTG 
GGCAAGAGCTAGGATGAGCTCATCCATTTTGGACTGTCCTGCCCATACCCACACCTGGGTCCTCATACCTGTTCTGGGCCCTCAGGAGCCCTCAAAC 
CCACAAATGTTCCCAAACTCACATCTGCGTGTCCATTACATCCTGAATTCCACATATACCTGTGCCTGAGCTCCTGTTCCCACCCGAGATCTTGCCA 
TACTCAATGCCATACTCATATCTATGTTCATGTCCACACT 

ATG CCT C TCTCC CC CAGGCATGAAG CTGGGC TGC CTG CTGGC CCG C AAAGGCTGCGTGGAGAAGAT GCAGTATTACTGGGATGTGGGT TTCTAC CTG 
GGAGCCC AGATCCT CGCCAATGACC CCAC CC AGGTGGTGCTGGCT GCAG AGCAGC TGTAT AAGCTCAATGC CC CCATATGGTAGGTGG CCCCCTCTG 
CATTCTTGGCCTGACCTGGGCTGTCAGCTCCTGCACCTATGTCCATTGGCATGCTAGACTGAACCCTGAGCCAATGACCTGGCTGCTTTTAGACCAG 
TGCCCGTGGCACCCCATGCCAACGTTCTAGCTATATCCAGGCCAGTTTCCTCTCCCTGGCAGTGCTGGTTGGTGCCTGGGCTGAACTGAGGATGTGG 
CTTTGC ATG CAGGT ACCTGGT GT CC GTGATGGAGACCTTC CTGCT CTACCAGCACTTCAGGCCCACGCCAG AGCCCCCTGGAGGGCCACCACGCCG T 
GCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACAGCCTGTGCCCAGGGCGACCAGTGCTTGG 

TGGACCCGGGAAAGGGGTAAGCCCCTTCTGACTGACCCCACTCCCCAACTGCCTGCAGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTGCCTGCA 
AAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTGGAGCCTGAGACCCAGGTGAAGTCGCTGCTGAGGCCCCTCACTC 
ACCTTTGCTAGGACCTTTTCTGTGGAGCCTTTCTCCCATCCTCAGACCCCTCTCCTGTCACCAGCACTCCCCATCACACCCCATTTCTTCTTGCAGG 
ACATTC C CTCCAGCTGGACCTTC CC AGTCGCCTC CAT ATG CGG AGT CAGGTGAT AGGCGGGACCTGGGGTG ACTGG AAAGACGGG AGG TG AGGG TCG 
CTGCCACCCAGCCTGACGCTTACTGGGTTCTTCATCCCTTCCCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCT 
CAGGACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTGCGCAGCCTTTCATGGTCGGGGTACTCTGAGGGCCAGGATGCCCGGGGGGG 
AGTGGGGGAAAGAAAGGGACGACGTCCGTTGGATGCTCCX3GCTTAGGTGGGGACCAAGATGCCCCTCCCGCGGGTCTGACCTCCCCCACCTTCAGGT 
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TCTGCGGCCTGATCCAGGCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAGATGTTGGAGGTGAGGACGCG 
GGACTGGGAGCTCGCGGAGGGCGAGGGGCCGCGGGCCCTGCTGAACTCCCATCTCACCCCGCGCCCGGCAGTTTGATTATGAGTACACGGAGACGGG 
CGAG CG GCTGGTGCTGG GCAAGGGC ACGTATGGGGTGGTGTACGCGGG CCGCG ATCGCC ACACGAGGGTG CGCATCGC CAT CAAGGAGATCCCGG AG 
CGGGACAGCAGGTGCGGTGCGTGGCAGCGCGGGTGGGGCTGAGGGCTCGCGAGGGCCGAGTGAGCCGGGCCACGAGCGGAGCGGCTCCTGTGGGCGG 
GGCCTAGTGGGCGGGCTCGTAGGACCTGAGAGTTCGAGCCCTGTGGACTGGGCCCGTGGGCGGAACAGCCAGGACCGTGCGGGCGGGATTGAGGTCT 
GGAGGGTCCTGTGGGCAGGGTAGTAAGGGTGGGGAAGGTCGGAGAGGGCTTGTGGGCGGGGCGGGTCCTTGGG^ 
GGCGGGGCTTCTCTGCCCACCTCGGTCCCTTTCCTCTCTC^ 

GCCTGC GCC AC AAGAAC ATAGTG CGC TATCTGGGCTC AGCTAG CC AGGGCGGCT ACCTT AAG ATCTTC ATGGAGGAAGTG C CTGGAGGTACCTGCCC 
TGTGTG GGATGGGAATG AAACCG AGGTGTGAAGC ATGGGACCGAGG CAGGC AGG GGTAG AGAGCCCTAGGGGGAAATGCACC C TAGTGCGTGTGGGT 
GGTGTAGCCTAGAATTAGCTGAGGAGTGGCGCTGGGTGATGGAGTTGGGCTACAAGATGGGTGTGTCTGG^ 

GCCAGTGGGTGGGAAGAATGAACTTGGACTGTCCCCACCCAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTGTGGGGACCCCTGAAGGAO^ACGAGAG 
CACCATCAGTTTCTACACCCGCCAGATCCTGCAGGGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGTAAGCCCTGGGGCT 
GGCCCGCCCTCATCTGCTGAGGGTGGCGGGGAAGGAAACATGGTCCCAGACGCAGGCTGGC^ 

AGGGAA TT CAT CAGGG AGGCTCCTTAGAGAAG G AGGGG AAAGGTTCAGCTCTG AAG CC AAACCAATTGT CAGCTCC CAGAGGGGCAT ATAAGTGGGA 
TGGC AA GGGGAGAGTGTTGAC AG AG GCAGGACCAGGCTGGGATCT CCAGGAGATGGGATC CC AGAAAGAGGGTCTAGTTCTG CTGGCAGGGGCCCAG 
ACCAGTGTCCACTAATGGCTGTCAATCAGGGGGACAATGTGCTGATC^ 

GCTGGCAGGCATCACACCTTGCACTGAGACCTTCACAGGTAACAAGGTGTGGGGGTGGTGAGTGGGGATGCTGACCTGGGGCAGAAGATCTCTTGG 
ACTTTC CAGGG AAGGGGGTTTCT CC TAGGCATGG AGTTAG AGT CCTAATGG GGA ATGAAACC C AG GGTGAG TGATTGGGCCAGGG CTGAGGTGGTGG 
C AT CTGGAACTGGAGCTTGG AGAAG TGATGAAATTGACTTTAGCAC TTGAGGAGTGTTGATG CAGGCC AAGGTGGG AGTTGGAGGAAAAAG CAGGTG 
ATGGAGTTCAGGGGGACGAGGCAAT CCAGGA TGTGTG CTGGAGCA CTAC TGTGGGTG ATGGAGTCTGTGAATGTGGTGG AGC CCAGGATGAAT GAC G 
GAAGCCTTGGCAGGCGATGGAGCCCAGGGTGGTGATGGAACTCAGGATAGATGTTTGATGACCAGGATAGGTGTTGAGGTATATTTTGGATGATAGG 
TTCAGTATGGGTGCAGTCGGCCTGAGGATGACGGCATTCCCCTCCCTGCTCCAACCCCATGATGAAGCCTAGATGGGCATTGGTGCTCCTGTTTCTG 
TCCTAGGAACTCTGCAGTATATGGCCCCAGAAAT C ATTGACC AGGG CC CAC GCG GGTATGGG AAAGCAGCT GA CAT CTGGTCACTGGGCTGCACTGT 
LJ, CATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCTGCCATGTTTCAGGTGAGACCTCTTTGGGCCTGGCCCATGGAA 
f~, GTGGCATAGGGCCAGCCTGGGCCTTGGACACCTCTGAATGTCACCCTTGTTCACCCTCCCCCCACAGGTGGGTATGTACAAGGTCCATCCGCCAATG 
CCCAGCTCTCTGTCGGCCGAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCCCAGACACTGCTGGGGGACC 
CCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCCCCACGACATGCTCCACGGCCCTCAGGTGCTTGCGGGTGGCGGGATGGGCACAGTGG 
AGGGCAGAGGAGTTGTGGAGCTGTGAGGGTGGAGGAGAGCCCAGCTGATGGCTCrGTCCCTCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCA 
GCCAACTCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGCCCCCCGAAGCGCTGCCTCAGTTATGGGGGCA 
CCAGCCAGCTCCGGTGAGACCAAGGCTCCTGGGATGGGTTCCTTTTGCCTGGTTTTGCCTTTTTCTCCTCTCCTACAAAAGCCCTCCCTGGGGACCT 
GGGCACCTTGACCACTTGTGAATGTGGCATTTGGTGCCTCCCCATGCCCTGTTCTGGCAACTCTAACTTGGACTTGTTGTGTCCTCTCAGTGCGGGG 
TTTATCTATGTTCATGCCTGTGAGGTAGGCATAAAAGTAGGAGAGCCAGGATTCAAACCCGGCAACAGAGGTGACTGGCCGGTTGCTGCCCGTCCTC 
AATCAGCAGCTCGCCCTTGTGCCCAGGGTGCCCGAGGAGCCTGCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACC 
Si AGGAGAGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAGAATCTGCACCAGG 

I GGGCGGCCCACTCTAGCGCCCCCTGGTGGTCGCAAAGGGTCAGGACACCTGTTGATGTCCCCGCCCCCTACCTCCGCGGCAGCGTCTCTTTTGGCTG 
^ GAGCACTCGCTGACAACCCCTCTCTTCCTTCTCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAAGAGCTGCTGCGCTGCCTCGGGGCACACATCCA 
Q CACTCCCAACCGCCGGCAGCTCGCCCAGGAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCACAGACCGCTG 
TTTGCCTTCCCGGATGCGGTGAGGGCGCCTTCTGGTAGTTAGAGCTGCCCAACAACAGGTAGTGAGTTCCCCGTGCCTGGTGAGGTGCCAGGACTTG 
ACGTTCCCAGATTCCCTGTCTCCTCTGTGAAGATCAGGCCTGCAGCAAAAAACCGTCTGCCCAGTCCTAGCTTCCCAGGCCTTCCCTGGACCTCAAG 
yj CCCTGCCCACCNNNNI^NlWNiraNNNNNNlTONl^ 

fi§ AAGCAGATCCTCCGCAAGCGCCAGATCCGTCCACACTGGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGTGTGCTA 
f ;;s ' GGACCGGGTAGGAGGGGAATGCCACGGGCCTCCTGCCAGCGAGGGTGTGGTGCTTGCTGGGCGCGGTGAACATCGAATGGGGCCGGTGGGGGCCAGG 
CGGGACAGGGGCAGGCAGGCGCCGGTGGGGGCGTGACTAAATGGACATCTCCCATACCGGCACCG 

CCCCGCACTTTGGGGTCTGGGGTTCCCAAACGGGCGTTGTAGCCTTGCCTGAGTTCACCGCACTGGGCTGCAGAGGTGGAGAAGGAGGCGGTCTCAC 
CGAGGTCAGAGGAGCTGAGTAATGAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAGGGCCCCGCTCCTCT 
GAT GGT GCAGCTGAG CCTCTTGAGGGCAG AGACTG AT CGGTAAAGCCCCTT GGAATCGCT CATAGCCTCTG CCGGG CTTCAACCCACTCACCACTCA 
CCGCTCCATTCTACTCTTCTGCGCCCCACAGGCTGCGCGAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCG 
GCTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCC^GAGCCTCCAAGTGAGTGGGCCTTGGCTGGAGAACATTATTGGTGGGTTCAAGGCAGCTTC 
TAGCACAAGGCACCCTTGGGCTCATCACCTGACTTGGGCTCATC^CCTGAGCGGGCCTTGGCGGGAGAACATTATTGGTGGGTTCGAGGCAGCTTCT 
AGCACAAGGCACCCTTGGGCTC^TCACCTGACTTGGGCTCATCACCTGAGTGGGCCTTGGCTGGAGAACACTATTGGTGGGTTCGAGGCAGCTTCTA 
GCAGGAGG CACTCTTGGGCTCATCACCTGACTGCTTCTTGACAGCTGCTCTTTCAACGGACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTG 
GATTCAGGCACCATCCAAATGGTGAGTCGGGGAGTGCTTGGCTATCCCTGACCTCAC<^GCCAAAGCCT^ 

ACTGTG CCCT CTGATGCAGAG AATCT CA CTAC CAGAAGAACACTT AGT ATTC TCGTCCCATTGTATGGATGAGAAAAC TGAGGC CCAGAAGGGTGTG 
ATCTGCCTCATTATTTTGGCAACAGAGCAGAGCTAGTACCTGGGTCTCGACACATCTTTCTGTAATTTCTTATTTCCCTCTTTCTGCGGGTCTTTGA 
CCACATCCAGCTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACCTCATCTACACCCGCATCAGGTACATCCTG 
GGCCCCCCAAGTGTGAACCATGCATATGGCCTCATCCTGGCCAACTGCAGGCCT^ 

TAGGGACAGAGGGGGCACTGGGCAGACAGTGTGTGCAAGATAGAGTCCAGGGCCACCTTCTGGCTGAGCCACACTGAGTCCATTACCCCCAGGGGAG 

GGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAGCGAGCAGGATCCACACCAGTCACCTCTGGACCCTGAGAGCTGAATGAGGGCATCATAGG 

CC^GACAGGCCCCAAGGATGGATGAATGGAGAGGACAAAGGCAGCTTCTGACACACCAGCCCCAGGACCTGGGGCGACTGGAGGAAGCCAGGCGAG 

GGGGCCCAGGACTGGTTCCAGTGAGAGAAACCAACCACAGGCACCCAAGCACTACCAGACAAAGCGTA 

TGGCAATTTTGGAGGTGTATGGTGGGAGGGGTATCCTGGGAAGAGCCCCCTAGGTCTTT 

ATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGCAGATCACGAGGTCAGGAGATCAAGACTATCCTGGCTAACATGCTGAAACCC 
ACTAAAAATACAAAAAAATTAGCTGGGTGTGCTGGCGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCTGCGAG 
GCGGAG CTTGC AGTGAGCCGAGATTGCG CC ACTGCACT C CAGCCTGGGC AAC AGAGCG AGACTTTGT CTCAAAAAAAATAAAT AAATAAATGAAAAA 
AAAATGTTTTTAGGCCGGGTGCGGTGGCTCATGCCTGTAATCCCCGCACTTTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAGGAGTTCGAGAAGA 
GCCTGACCAACATGGTGAAACCT CGT CT CTA CTAAAAATGCAAAAATTAGCTAGGCGTGGTGGTGCGTGCCTATAATCCC AGCT ACTCAGGTGGC TG 
AGGC AGGAGAAT CG CTTGAAC CC AGG AGGCAGAGGTTGCAGTG AG CCGAGAT CG CGCCATTG C ACTCC AGAGCGAGACTCCGT CT C AAAAAAAAAAA 
AAAAAAAAACCTTTTTTAGAGACAAGGTCTTACTCTGTCATCCAGGCTGGAGTGCAGT 



AGCATCTCGCTTTGTGGCATAGCTCACTGCAGCCTTGAACTCCTGGGCACAAGCAATCTGCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCA 
TGTGACACCACCACCTGCAGCTAATTTTTAAAATTTTGTCGAGATGGGGTCTCATTATGTTGCTCAGACTGGTCTTGAACrCTTGGGCTAAAGCAAT 
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CCTTCTGCCTCAGCCTCCCAAAGCATTGGGATTACAGGCGTGAACCACTGTGCCT 
ATCTGGCCCCTCTTACTTCCCTGGCTCTTTCCTAC^TCACCTTTCTTCCT^ 

TTATTGACTTTAGCCAAGGGCAGGCCCTGAGATGGGGGTCCAGAGAGAGAGGCTTGGTGGGGCTACGTCCTGGGGGCCAGGTTGGTTCTGAGGGGTA 
G AAGGC C ATCC ACC C ACTCGCACGGCTG CTCCAGGAGGGCTT GCCACAGCTGCTTCTCCTCAGGTGTGG AATC C ATC CAGGGC AC CTG CAGCC CAT A 
GCTGCTGCCTGGATGTGGGTGGGCAGGGGTTGAGGGCATGATCACACTGGACACCTTGGGGCCCCCAAACACCTGCCTGGTCCCAGATTATGCACrG 
TCCCCCGTTACC^CCCTCAGTTCTCAGTAGATATCAAAGCTC!AGGAAAGGACAGAGAGGTATCC^GGGCCACACACTGAGCCCCCATCTCCCAGGGC 
CTGTCCCAGGGCCTGTCTCAGGGTGCTGTCCCACTTACCGGTGCCCAGGCTGAGGCGTGTGCCCCCCAGCTGGCGGTTGGCCAGGGCCCCATGGTCC 
CAGAGGGAGAGCTCGGCACAAGCCTGGCGCIAGGTCAGCAGGCCCAAAGCCATCGTACACCATGGTGTGATTGAACACAGGGCTGAGGCTGCGTCGCA 
CAACCCTTGTACGCTGGCGGCTGGCCCGGCTGTCATCAGGCAGCACGAAGCTGTGCGGGAGGTGGCCTGGCATCAGGTGACCCCCCTCACACTCCCT 
AGAGTCATCCAGTCACCACGTCCCACTCATTCTCCCTCACATATGTCCATGTCCCCTGGCTCCTGCCTGCCTGGCCTTAGACCCTCACTGCCCTGGG 
ACCTTTGCACCAGCCTCCTCTCAGGTCTCCTGGCTTTAAAAGTTACAGCTCCTGGCCGGGCGAGGTTGCTCACGCCTGTAATCCCAGCACTTTGGGA 
GGC CGAGGTGGGTG GAT C ACCTGAGGTC AGAAGTTCG AGACC AGC CTGGCC AACATGGTGAAACCCCATCTCTA CTAAAAATACAAAAATTAGCTG G 
GCATGGTGGCATGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGTTGGAGCCCAGGAGACTGCGGCTGCAGTGAGCTGAAATCA 
C AC C ACTGC ACTCC AGCCTAGGC AAC AG AGTG AGACTG CATCT TAAAAAAATTAATAATAGCCGGG CGCGGTGG CTC ATGCCTGTAATC CC AGC ACT 
TTGGGAGGCCG AGG CGGGCGGATCACGA GGT CAGGAGATTGAG AC CATC CTGGTTAACATGGTGAAAC CCCGT CTCTACTAAAAAT ACAAAAAAATT 
AGC TGGGCGTGGTAGCGGGCGCCTGT AGTCC CAGCTACTC AGGAGGCTG AGG CAGGAGAATGGTGTGAACC CAGGAGGCGGAGCTTGCAGTGC ACCA 
AGATGGGGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGANNNNNNNNITONN^^ 

AGAGCAATGTCTCTGAAATGCAGTTCTGCTATTCTCGCTGGCTCAGTGACTTTTATTATCTCCCTGTTGCTCCACCACCACCCCTCCTCAAGCAGTT 
TCCAAGGTTTTTTTTTATTTTTTGAGATCGAGTCTCGCTCTGTTTTGCCCAGGTTGGAGTGCAGTGGCGTGATCTTGGTTCACTGCAACCTTCTGGG 
TTCGAGTGATTCTCCCGCCTCAGTCTCTCAAGTAGCTGGGATTACAGGCACCCACCACCACGCCCGGATAATTTCTTGTACTTTTGATAGAGATGGG 
GTTTCAC(^TGTTGGTCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCTTAAAGTGCTGGGATTACAGGCATAAGCCAC 
CGTGCCTGGCCTGTTTCCAAGGTTTTCAAAGCCTGGGAACCCCCTCTGCAAAATACAGTGTTGCTCAGAAGTACAGTACCTCTGTACTGCAGCCTCT 
GGCTGCATCAGAGTGGACACCCAGAAGTGACCCGCCCACCTCCATTTCACTCTCTAACAGTGGGGGCAGCACCTGGAGGCACCCCTAGAAGTCCCCT 
AGAAGCTTACACTGAACTATAAGTTCCCAGCCAAGCCCCACAGCCTGGCCTCCCAGGTTCTCTATGGAAGGACCCTTGCAGCCCTCAGCCTCAGCTT 
TTCAACAAGTCCTATAGGCGCCAGTCACGCTGAAATGCAGCTCCTCAAAGGTGTCCTCAACTTCCCTGACTCCAGGCCTTTGCTCAAACCGTCCCCT 
TCACCCCTGCTCAGTGGTGAAAGCCCAGGCTGCGTGATCAAAGGTTGATTAGTAAGGGTTTGTGGAGGTTGGGCACTCCACTGCAGGCAAAAGAATG 
GGAAGCCCGGAGGGCCAGCACTCCTCACCATTGTACGTAAGTGTCCAGGGATCCTGCCCGCAGCGGCAGGAGGTCCCGAGCCTCCTTCACCCAGAAG 
TGCAGCTCCCCGCTCGGGGGCAGTCCTGCGCCTGCGGAGAGGCTCGGCTCAGGCCCGCACACGGATGGAGTCCACCGCCCCCAAACCCGGGGACTGG 
AGTCTCCCGCTTTTCAGGGCGTGTCCAGCTTGGCCTCTCCGGCTGTCACTCACCCTCGGAGCCGGCGGGGACGTACTTGAGGGACAGGGCGAGTAAC 
CCGCGGCTCGGAAGGTCGTCGGGAGAGGGTGGGACCTGCGAGAGGCGGGCTTATCGTGGAGCAGGGGCTAATGGAAAGGGGCCTCAGGAGGGTGGGG 
TCTGCAGACCCGGATGGGGTGCGGCTAGAAGAGACGGCGGGGCCTGCAGGGGCAGAACCTAGGGGTCCTAGCGCCCTCGGGGAAGGGGCCGTGGCCG 
GCCGGGCTTCGCCTGGGGCCATGAGGTTAAGAGGTTGAGGGGAGGTTCTGTCGAAGGCGGGGTGTGAGGGGGCTGCGGGAGAGCCTGGTGGGGTAAA 
TGCAGGAGACCCGGGCCGCAGGTCTCCCCACGCGGCCTGGCTGCCTCACCCGGGGCTGCAGGGGGAGCCAGGTGGGCTCAGAGCCCCAGTCCCACGT 
GTCCAGGGGCACTTCAACTTCGCCCAGAAAGATGTTGCGACCCAGGCTTTCGCGGTGCCACACAGACAGGCTCAGCACGCGGCCCTGAAGCTCGGCC 
TGCGGGACGGAGTACTGAGGACAGGAGGTGAGCTTTAGTGGAGCACTGCTGGTGGCCCGCGTAGATGCTGCGCGAGTTGGCTGAGGCAGGTCCTACC 
GCGTTCCTACCATGATGGAATTCTGGGGAGCCCCTAAGGGCATTAATGGGGACCTGCACTATCACCACGCCCACCACGAGGCTGACAACACTGGATA 
ACGAGTTAATGGGCCCCCTCCGCCGATCCTCCGGGCAATGACGTTACTGGCTGCCCTTGCCACCTAGCCCCGCAGGAAGGGGGCGTCCAGTTCATTA 
ACGGGGAACCGCATCGTGGTCACAGCCTCACCCGGAGAGTCTCGTTGAAAACCGGATTCAGATTCCGTTTCTTCACCGCCGTCTTGCGCTTGCTCTG 
CTTATCCGGGAGGAGGTAGCTTTTGACGTAGCTGGGGAGGTGAGAACCCGAGGTGAGGAGGGGCC^ 

CCC CAGCCACCATCCCGAACACGTGCCAGGCCTCGTGCTAAGCGCTTTACAGGGATTGTCTCGCTTAATCGCACGCCTCCCAATAACGCTGTTGTTC 

CCATTTTGAAGATGAGGACGAGGAGGCCGAGACCCTGTGAATGGTGGGCTGGGCTGTGACAGCCGCGCCCCGCTTGGCCGGCGGGGCACTCACGGGT 

CCGAGCGGCGGCGCCGGGCGGCGGCCAGGCCCTGGCACTGGATCACGTGCACGCGCAGCTCGGCGGCGCCCGGCTCGTAGTGCAGCGCGAAGTGCAC 

GGAGCCGCGGACCTGCACCGCCTCCGCGTCGCCTGACAGGCTCATCTGGCTGCCGCTCAGCTGCGGACGGGAGTGGGTGGGAGTCAGGGCCGCTCGC 

AGGGCTCACGTCGGGGCCGCGGGCAGCGCGGAGGCTTCAGACCCGGGTCATAGGTGGCAGGCTGGGGGGCTGCAGCGGTGGGTCCCGCGGCGAGGGA 

GGGGCAGGCAGGGG CTACGCATCGGAACGGGTCCACCCCCCGGCCGCCCGGGTCCCCTCCCTCCCGCCTCACCGTGGAGGAGTTAAGGCTGGACACC 

GAGGAGCTGCTGCTGAGCATGCGGTCGAGAGAGGGGTCGGGCCCCGGGGCCTCCTCCCCATTCTCCAGGATCTGGGACGCGGCCTTGGTCTGCAGAG 

GGGGCAAGTGACGGGGCGCCCCTTCTTTCCCCACCCCTGCCCCGAGCGCCCTGGGTCGCGGGCGGCTCGCAGCCCTGGGCAGGAGCGCTCCCCGCCC 

GGAGTCCGCTCCGGATGTTGAGAGCAGCCACGGGCCACTCCCGCCTACCTGGGCTTGCTGGGGCCGCGGCTCCTGCTCTCCCCCCGACGCGGGCTCC 

AGCTCCGGGTCAGCCTCCTCGGCACAGACCTGTTGGTCTCCTTGGCCAGGATCTGGGGTGGGGCACGAGATGCTAAGCGAGCCCTCTCCTGCCCTCC 

CCTAAGCAAATTTGTAGGAACTGGTTTACAACCCCAGCGAGCTGCAGATTCAGGCCCCGGTGCTGCCCTGGGATCGTCCCCACCTCCGACGCGGGGT 

GGGAGTGGGGGTGACGAGGGTGAACACAGGTGGTTGGGGCCCCTTAGAGCTGCCTCTGACCCGCGAAACCTCTCCGTGCTCCCCAACAGGCGCTCCC 

ACTCACCTTCCTGGGCCTGGGACGCCTCCTCAGGATCTGAAGCCTTTAGGGGGACAGAAGGCGATGGGAAATCAGGTCCCTGAAGGAAGAAAGGGGT 

AGAGGTCACAGTACGGGGGCTGGTGAGGAAGAGGTCAAAGGTCATATTGGGGTCAGAGGTCATAGGGCTCCAGCCAGCTACCCC^GTTTCAGGTTTA 

GGGCAGCCCTCCGGGCAGGGCTGGCAGCCAGCTGCAGGAGCGGTGAGACCAGTGCTTCACAGCCCACCCCGCAGGGCTTGTGCCGCCTGACTCCCTA 

CTCCCATTGGTGGGGTCATTCGTACCTGAGTCCTCCACCTCCGCCAGGCCAGGCCAGTCCCATTCCTTTCTCACTCCTTTTTTTATTTTTATTTTTT 

ATATTGACCCATTGGTAGACAACCTTTTTTGCTTTTTTTTAATITONNNNHNNN^ 

NNNNlJNNNNNNNNNNl^NNNNNNlsnSN 

NNNNNNNNNlWNlTONNNNNNNNNNNmNNNNN'NNNNNNNNNNNTTCAAGCGATTCTCCTGC 
CCMCATGCCCAGCTAATTTTGTATTTTTAGTAGAGATGGGGTTTCTCC^^ 

TCCCAAAGTGCTGGGATTATAGGCGTGAGCTGCCACGACCGGCCTTTTTTTTTTATTTATTTTTTATAGACATGGGGGGCATTTTGCTATGTTGCCC 
AGGCTGGTCTTGAACTCCTCGGCTCAAGCGATCCTCCCACATCAGCCTCCCAAAATGCTGGGATTACAGGCATGAGCCACCGTGCCTGGCCCCACTT 
CACCCCTTAATAGCCAGATTCAGAAGTTCCCTTAGAGGCTGGGTCCATATACCACTCACTGTTCACCACCGTTCACCACTGTTTACCACCATTCACC 
ACCGTTCCCCACCGTTCACCACTGTTCCACCCAGGAGGTGATGAGGTGCAGACACAGGAAGGAACATTTCCAGGGTCTCCCCATTCCCAGGCTGGGG 
CTCTTTCCCCCCTGCCCCCTGCACATTCCACCTCCCCTCCAGGGACACTGTACTCACATGTTGGCTCATTCACrTACCTCAGTCTCCCTGAGCCTCT 
CCTGAGGGGCCTCATCAATGGTGAGCCTAGGAGTAGGGAGAGGACGCTCTCAGAGGGAGGGACCCTGGTGACTGGTGGGCACCATCTCCCCCTTCCT 
GAGACTCCTCCTC^CCTGGGCTCTGGCCCCrCTTCCTTCTCTTTCACAGCAGCCTCAGCCTCCCTGTCGTGGCCTGG 

GGTGAGAAGTTCTGTGAGCCCCCATGCTCAGGAAGCCTGGCTGTGACTCGGGAGAGGCCCTGGGAGATACCCACCCACCCCATGCCCACGGAGGGGC 
ATTGCCCACTGAAAGACACACAGTACTGGGGGCAGCCAGGACTGGAGCCCTAGTGGTCCTCTCTCCCTGTCCACATACAGCCAGCGCCCTGGACGGC 
ACATGGCTTGCCCTGCCCCGAGGATCTCCTCT(^CCCCTGGTGCTCTTCTTCCTGCGCATAGACGCTCGGACAAGGTCAGAGCCGAAGTGGGCATTG 
TGGTGCCGCTGGGAGCGTGCTTCCTGGAACCAGTCCCCTGTCAGGATCTTCAGCTGCCCAGGGTCTGCCACTGAGGCCCGGAGCTTGCTGGAGGAAG 
GGGGAGGGTGTGGCTGTCATGACTCACCCAAGCAACATTCCCGCCACCTACATTGTTAACAGGCTTGGGACCAGGGGCCAGGGACCCAACAGCAAAT 
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CAGACTTGATCTGTGCTGTGGAGGAGCTCGGGCTGCTGGGGAGACCAAGTCCAGAAGAGACAAGTACACTGTAGTGAGGTGGATGATGTGATAGAAG 
TAAGAAGAG CCAGCTCTG CCTGTGAGAAT CCAGGAGGG CTTC CCAG AGG AGGTG ACATTTGAATTGCCTTT ATTTATTTATTTACTTAATTAT TTTG 
AGATGGAGTCTCCCTGCTATGCCCAGGCTGGAGTGCTGTGGCGCAATCTC^^ 

CAGCCTCCTGTGTAGCTGGGAOTACAGGCGTGTGCCACCATGCCCAGCTAATTTTTTATATATTTTTTTGTAGAGACGG 
AGGCTGGTCTCTAACTCCTGGACTCAAGCAATCCACCTGCCTCAGCCTCCCACAATGTT^ 

GACAAATTGGAACTCTCCAGGTAGAGAAGGATAATAATTATTATACTTTTAATTTTATTATTTTTTTTTGAGACAGGGTCTTGCTCTGTTGCCCGGG 
CTGGAGTGCAGTGGCATGATCATGGCTCATTGCAGCCTGACCTCCTGGGTTCAAGCAATCCTCTCACCTCAGCTGCCTAAATAGCTGGGACTACAGG 
CGCACACCACCAOTCCAGGCTAATTTTTTATATTTTC^ 

CTCCCGTCTCAGCCTCTCAAAGTGCTAGGATTACAGGCGTTGCCACTATCTCrGGCCAATTCCTCTTAAATCAGTGAAGTAGGACTCCTTAGCCCCA 
TTTTACAGACGACX5AGCATGGTGCTCAAGGAGGTCAGGTGAGTTGCCAAGGCTACGTAGCTAGGAAGTGTCTGTCCCCAAACATTGCTTGTAACGCC 
CATGCTCTTCTACAACTGTGCTCTTCCAGGAAGAAGTACCAGCATAGGCATAG 

GAGGGGTCACTTTGTGTCTGTGTGTCGGGTCGGAGTTATGGTGAAAGAGGAGATGGTGGGGGCTGGCAAGGACCGACCAGGGAGGCCCAGTGGACCA 
TGC C TAGGAGCTTG CCCT TGATACTGGG GAGCCAGGGG AAGTTTTCAGCGGGGATAT TAAGGACGGAT CTGGC C AGGCACGGTGGCTCGCGTCTGTA 
ATC CCAGCACTTTG GGAGGCTGAGGCGGGTGGATCATCTGAG ATCAGAGGTTCAAGA CCAGCT TGG CC AAC ATGGCG AAAAC CCATCT CTACTAAAA 
ATACAAAAATTAGCTGGGCATG^TGGC^TGTGCCTATAATCCCAGCTATTTGGGAGGCTGTGGCAGGAGAATCGCTTGAACCCGGA 
TGCAGTGAGCCAAGATTGTGCCACTTCACTCCAGCCTGGGCGAAAAAATGAAACT 

CACCAGCTGCAGGAGGTGGGCCCAAGAGGCAAGCCTGGAGGCAGGGAGGCTGGCGAGAAAGCTGGTGCAGTCACCCCTGGAGAAGTGATGAGCTGTG 
AGCTGGCCAAGGCCAGAGGAGACTGAAGGGCAAAACCAAGAACCACGCAGACATCTGACAGGCAGTGGTGTCGGTAGGGAGCAGGATGGGGGCCACT 
CAAGCCTCTTGGTGCTCCCCGGCCAGGGTCTGCCCTGCCTTACCTGACCCGCCCCTCCTCCAGCTGGCGCAGGCGGGCATCTCGTTGGAGGACGCCA 
GCAATGGCCTCCTGCTCCTCCTCTGTCAGGAAGCTGAGGTCCAACAGTCCTTCAGTCTCAGGCTTTGGCCCATGCGCCATGGGGAGGGAGGGCAGAG 
CCCAAAGCCCCTCTTGCGATGGGTGGCCCCTCTGGG^ 
¥ A GTCAG 

HUMAN SEQUENCE - mRNA 

hJ GAATTCGAGGATCCGGGTACCATGGGGCTGGAGTGGGGACCGAGGCCCTGCTCACTCCCCTGGTGGGCCGGCTTGCCCGCCTGCTGGAGGCCACACC 
fj CACAGACTCTTGTGGCTATTTCCGGGAGACCATTCGGCGGGACATCCGGCAGGCGCGGGAGCGGTTCAGTGGGCCACAGCTGCGGCAGGAGCTGGCT 
jw CGCCTGCAGCGGAGACTGGACAGCGTGGAGCTGCTGAGCCCCGACATCATCATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCA 

TCATTGAGCTGGTGGAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCACTACACTTTTGCCCTCAACCGGAG 
Ms GAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTGCTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 
y CGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAGGCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGC 

CCAGCCTTCACTCAGGCATCAATGCAGCTGTGCTCCTCATTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCGGCTAATAGGCATGAAGCT 
|lj GGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTATTACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCC 
B ACCCAGGTGGTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTGATGGAGACCTTCCTGCTCTACCAGCACT 
J .. TC&GGCCCACGCCAGAGCCCCCTGGAGGGCCACCACGCCGTGCCCACTTCTGGC^ 

pfc TGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTGCCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACA 
p| GTGACCCTGAGCCTGCTGGAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGAGTCAGCGCCTCAAAGCGCG 
IJf; ACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAGGACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGAT 
ili CCAGGCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAGATGTTGGAGTTTGATTATGAGTACACGGAGACG 
|s i GGCGAGCGGCTGGTGCTGGGCAAGGGCACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATCAAGGAGATCCCGG 

AGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCACAGACGCCTGCGCCACAAGAACATAGTGCGCTATCTGGGCTCAGCTAG 
M CCAGGGCGGCTACCTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTGTGGGGACCCCTGAAGGACAACGAG 

AGCACCATCAGTTTCTACACCCGCCAGATCCTGCAGGGACTTGGCTAC^^ 

TGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTCGGCACCTCCAAGCGGCTGGCAGGCATCACACCTTGCACTGAGAC^ 

TCTGCAGTATATGGCCCCAGAAATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGCACTGTCATTGAGATG 

GCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCTGCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTC 

TGTCGGCCGAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCCCAGACACTGCTGGGGGACCCCTTCCTGCA 

GCCTGGGAAAAGGAGCCGCAGCCCCAGCTCCCCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAACTCAACC 

ACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGCCCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCC 

GGGTGCCCGAGGAGCCTGCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAGAGCAAGCGTCGGGCCATGC7 

GGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAGAATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTG 

GAAGAGCTGCTGCGCTGCCTCGGGGCACAC^TCCACACTCCCAACCGCCGGCAGCTCGCCCAGGAGCTGCGGGCGCTGCAAGGACGGCTGAGGG 

AGGGCCTTGGGCCTGCGCTTCTGCACAGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGTCCACACTGGAT 

GTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGTGTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAG 

GAGCTGAGTAATGAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAGGGCCCCGCTCCTCTGATGGTGCAGC 

TGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGCGAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGGCT 

GAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACGGACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTG 

GATTCAGGCACCATCCAAATGCTGTTGAACCATAGCT 

GGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAGCGAGCAGGATCCACACCAGTCACCTCTGGACCCTGAGAGCTGAATGAGGGCAT 
CATAGG CCAGAC AGGCC C AAGGATGG ATGAATGG AGAGGAC AAAG GCAGCTTCTGACAC AC C AGCCC CAGG ACCTGGGGCG ACTGGAGGAAG CCAGG 
CGAGTGGGGCCCAGGACTGGTTCCAGTGAGAGAAACCAACCACAGGCACCCAAGCACTAC^ 
AAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTGGAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCG 



ACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGCCGTATCTACAAGGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCAC 
CGGGAGCAGGCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAATGCAGCTGTGCTCCTCATTGCTGCCGGGC 
AGCACTTTGAGGATTCCAAAGAGCTCCGGCTAATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTATTACTG 
GGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTGGTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATA 
TGGTACCTGGTGTCCGTGATGGAGACCTTCCTGCTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCACCACGCCGTGCCCACTTCT 
GGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACAGCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCT 
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GCTGCCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTGGAGCCTGAGACCCAGGACATTCCCTCCAGCTGG 

ACCTTCCCAGTCGCCTCCATATGCGGAGTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAGGACGTCCAGC 

TGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAGGCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGA 

GGGCGCGGGGGAGATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGCACGTATGGGGTGGTGTACGCGGGC 

CGCGATCGCCACACGAGGGTGCXSCATCGCCATCAAGGAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGerCl^ 

GACGCCTGCGCC^CAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTACCTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCT 

GTCCTCCTTGCTGCGGTCGKSTGTGGGGACCCCTGAAGGACAACGAGAGCACCAT^ 

CACG ACAACCACATCGTGCA CAGGGA CATAAAAGGGGACAATGTG CTGATC AAC ACCTT C AGT GGGCTGCT CAAGATTT CTG ACTT CGGC ACCTCC A 
AGCGGCTGGCAGGCATCACACCTTGCACTGAGACCTTCACAGGAACTCTGCAGTATATGGCCCCAGAAATCATTGACCAGGGCCCACGCGGGTATGG 
GAAAGCAGCTGACATCTGGTCACTGGGCTGCACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCACAGGCTGCCATG 
TTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCCGAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCC 
GCCTCCGAGCGAGCGCCCAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCCCCACGACATGCTCCACGGCC 
GTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAACTCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCC 
AGCCCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCTGCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTT 
CGGGGCTGAGCCTGCTGCACCAGGAGAGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAGAATCTGCACCA 
GGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGGAAGAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGG 
CAGCTCGCCCAGGAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCACAGACCGCTGTTTGCCTTCCCGGATG 
CGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGTCCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGTGT 
GCT AGG ACC GG AGGTGGAGAAGG AG GCGGTCTCA C CGAGGTCAGAGGAG CTGAGTAATGAAGGGG ACTCCCAGCAG AGCCCAGGC CAGCAGAGCCC G 
CTTCCGGTGGAGCCCGAGCAGGGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGCGAAATCCTGGCGGGGA 
AGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGGCTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCT 
M= TTCAACGGACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATGCTGTTGAACCATAGCTTCACCCTCCACACT 
f«| CTGCTCACCTATGCCACTCGAGATGACCTCATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAGCGAGCAG 
I~ G ATC CACAC CAGTCACCTCTGGA CCCTG A 

ass 

u 

o 

hi 
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TABLE III 



MOUSE NOMENCLATURE 
ICSGNM Fosb 
Celera mCG484 8 

HUMAN NOMENCLATURE 
HGNC FOSB 
Celera hCG20725 

MOUSE SEQUENCE - GENOMIC 

CTCACTCCGCCAGTCTGAAGCATCCGGGCTCTGTCTGTGAAACAGTCCGCGAACTGGGAGGGTCTACCCACTGGCCCCAGCACCCCTCCTCCCAAGG 
CCCCCCATGCTGCTGCTTCATGTGACCAGGGCTGGGCCGAATCTCTGAGCAGACAGAGAGAAGCATGGTGTGACCGCATGAGTGGAGTGAGTTCTGG 
GGACAGTGCCCAGTTGTGCGATAGACAGCAGATTCCGGAGGACCACTGTGCATGTGTGTGTGTTGTTGTATTTGGTTTGTGTTTGTGAGTATGTCTG 
TGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACAGACAGACAGACAGAGACAGAGACAGAGACAAAGAGAGTTGTGTGAGGA 
CCACTGTGCATGTGTGTGTGTTGTGTGAATGTTTGTGAGTGTGTGTGTTGTTGTATTTGGTTTGTGTTTGTGAGTGTATCTGTGTGTGTGAGAGAGA 
GACAGAGAGACAGAGAGAGAGAGAGAGAGAGAGAGACAGACAGACAGACAGACAGACAGAGAGACAGACAGAGAGAGACAGAGAGAGACAGAGAGAG 
AGAGAGAGTTGTGTGAGGGCCTCCCAGGTGATCTGGGAAGCTTGTGAATGTGAAAGTGCCTGTG 

CCTGTATATTAACTATGTGTGCTCAGACAATCCACTCAAAGATGCTCTGGGGCTGCAGCTGATAGAGCAGCTTACTTAGTGTGCACAAGGCCGTGGG 
TGCAAACACCCCAGAACTACAATCCCAGAACTCTCTCTGCGCACTGAACTCGACTTCCTCATTTGTTTCTTGTTTCTTTTTTTTGACGGAGTCTAAC 
ATTGTCCGGATTGCTTCAAACTTTATAGCGGAGGAAGATCGATGGCAATTCTTCTGCCTCCACCTTCTGAGTGCTAGGATTGCAGGCATGTACCCTC 
ATGCCCAGTTGAGACAGTATCAGTGATTGAACCCAGGGCCTCACGCCTCCTGGAGAAGCCCTGTACCATGGAGCTACAAGTCTAGTTCCTTTTCTTG 
TCCTTAAAAAAAATAAAAAGCTGGGTCTTCTTATAGCCCAGGCTGGCTTTGTACTCCCCATCCTCCTGCCTCAGCCTCCCAAGTGCTGGGATGATAG 
Lj, GCTTATGGCACGGCCATGATAGCACCTACAATTTCCAGAGTTGCCTGTCCTTTGCAGTGTACTATGCCTTGTGAGGGACAGGCTCTGAGGGTCTGCT 
jL- TGTGATCACACACCTGAGGACCTGTTTGCTGGGAGTTCTGTCCCCATTTCTTTGCCAGGCTGGAAGACACTGACACACCCACCATGGTGACAGTACC 
M CATGGACTTAAATGGTGTGACTCCCTCTGCTCTAATGGGAGGGTCTCAGTGCCCCGTATCTGCTGTTTCTCAAATGGGGTAGGTAATACAGAGATAG 

0 GAGGGCTGACACAAAACATATCAAACACCAAGATAGCAAAGGCTAGAATGAGACCCTGTCTCAAGCTTTCCCATAGAGAAGGGGAGTGTATAATTGC 
S ACCCACTCACAAGACTGTTGGCCATAAACAAGTTTGTATGGAAAATGTTTACAAAGTGCTTGGCACATCCA.GGGGTGGTGGCGCACACCTGGATTCr 

CATACTTGGAGAGAAGAGCCAGGAAGATTAGGGACGGTTCCAGTCAGGCCTACATATTGAGACCTTATCTCAAAAACAAAACAACACAACACACACA 
4* CACACA CAC AC A CAC AC ACAC AC CTC AAAAACCTCTG GAAGTTCAC ATTTTGGGAAAGGC AT CTG AATGGTAAAGTGTAGGG ATT AAGAGTAGAGA C 

TAGAGT CCCTC CATGGG ACC GAG ATGTAG AAAAGAAAGAAAAGAG AGCTTGCTAAAG CAGGCCAG CCTAAGGGC AGACTCCTGAAAGGACTTG CTC C 
P AAATGATTGTTGAGGGTGGAGGACTCTAAGAAACTGGAATTGTGGGAAAAACTCTCGGGGAATGACAAAAGCCGGTTTGCAGGTTGGTGAGATTAGA 
|m4 AAGTAATGGCCACAATATAGGGCACAATATACAGGCTGGTCTTGAACTTGTAATCCTCCTGCCTCAACCTCCCAACTATTATAGGTATATGTCACCA 

1 s I GCTGGATGCTTTTGCCTGTCTTGTTTCTTTCTGTGCCCCAGGGCTTCAGTGAATAGGGATGGCTGCCGATTAAATAACTGAATTGCTAGCAGGGTGG 

GTGAGTCAGGGCTTTAATCCCAGCCTTCAGGAGGCAGAGGCACACAGATCTTTGTGAGTTCGAGGCTAACCTGGTTAACAGAGAGAGTTCCAGGACA 
GTC AGAG CT ACACAG AG AAACCT TGG CT CAA AAA ACAACAAAATATCTGAAT GGC TG ATTGAATG ACTGGG CAGATGGACAG ATAGAC AG ATGGACA 
yk TGTACXSATCCACAGACATGTACGATCTTTGGGATGGTGGGGAAGGCTTGGTAGCTAACACAGGCTGGCTC^AGACCTGCTTCTGGGAACCCTTTCTT 
^ TCCTTTGTATCCTCAGCTCTGGGTCATGAAGGGTGTGGAGTGGGCCGGGAAGGGAGTGTCAGGCTCAGTGGCTGCTTGCCTTGGATTCTGGCCCAGA 
M GTGTCTGGAGGAAGAAGAGGGTATCTCGGCATCTGAGTCACCTTAGAGAAACCCTAGCCACATACCTGACTTCTGACATCACACAACCAGGGCAACC 
fl| CTGGTGGCCGAGACAAGGAGACTGAATCATGGACTGAAATC^GTCACCTTGGGCATGGGCACGTGGCCGCTAGTGACCCCCTGAGACACCCCCGAGT 
| "| GGCTGTCTGGCTGTGTGGGCATGCAAACCACATCAAGGTCAGTAAAGGCTCTCCAGCCTGAGTC^GGGACAGGGTGGCCACTGCCAGAGACCACCGA 
5fjf CCACAGCATCTGCAGGACAGGAAGAGAAAGTTGAGGGAGAAAAGGCAGAGGGTAGGCCCCAGTCAGGGAGAGGGAAGTGCCTGTGGGAGAAAGGGGA 
Q AGCTGTGGCTTGCGGCCAGTAGCCTGGTGACCTGTGGCTGGGATGTGGAATGGGGTCTTAGTGTTTGCGCACAGAATGGCGAA 

rr TCAGAGGACAATGGAGTCAGCAGACAATGGGCTCTTACACTTGC^CACGCAGGCACAGGCGCACAAAAGCCAGCGTGGGAGCCCTACCCrCCAGCCA 
r * CCCCAGAGCTACAGCTTGCCACACACACACAAGACTGTCCCCAC^GACTCGCTTCCTTTACCCTATTAAGTGACATCCGTAATTCTCATTGTGAAG 
ATTGCAACT CT CGGTATTTTGAT GATCT CAT TTG ATATGAAT AAC TTAG AC ATTTGC AGTGC C AGG CA CTGTTGTAGACTATTTAGCTGTGCTAGGG 
ATTGAACCCAGGGCTTCTTGCATTCTGGGCAAACATCCTGCCACTGAGCTTGCTCCCAGACCCACGTGCCCATGGTTTGTCCCCAGACCCCTGTTTT 
ATATGTAATTCTGAGACAAGGTCTCACTGTAATGCCCAGAGACCCTTGAGCTCACTCTATTACCCAGCCTTGAACTCACTGATCCTTCTGCAGGGAT 
TGTAGGCATGACTAGCTAGACACAGCTAAGTATCTGTTAGAGCGTGTATGTGTACCTTGGGGCAGACATCTGTCCAACTGTATATGTCTATACATTG 
TTAGGTTTCAAA CAAAT GGC TGAGGTGC CTATTT AAC ATATC AAG AGG ACC TGGC C CCC AAT TTCTCC CTG CAATAACTC AG C TCCT ATATGCCCT C 
CTGGCTGGCATACCCCACCCGCTATGCTGAACTTCTCCAGCCCAGGGGCTGGGCTGTTCATCCCTATGTAATCCAACTATTTTAGCTCTCCCCTCTC 
TTTGTACCTTAAGGCTGCACCTGGCTCCCATGGCTCCCCTCCCTCTCTCCCCACACGGCTCAGGGTTAAGTCGACTCTGGACTATCCTAGAGGTCCC 
TGCCTCTGGCTATGCTCTCCCACATATCTATAATAAACTCTCCTCTGCATACCTAGGAACAGTCATGTTCCCTTCCTTTCTCTTTCTTAATTTCCTT 
TTCTTTTCATTCATACATACGTTTCTTTGGATTCTGTATTCTATTCTCCCTCCT 

AATCCCAGCACTCGAGAGGCAGACACAGGTGGATCTCTGTGAGTTCGAGACCAGCCTAGTCTACAGATTGAGTTCCAGGCCTGTCTGGCCTATACAG 
TGAGATCCTCAAAAAGAAAAATGTACACACACTACCCCCTTCAAGTCAGAAGACCCT^ 

CAGGTT GTTAC AAATGC CTGGAGTC AGGTCTG AC ACT C ACAC ACAAGGTCCCTGCCACCAGT TGATCCAGC AGCTA CATTCGAAGG AT CAGAGAGGT 

CCAGATAACTGTGCTGCCCTGTCAGACCTTATGAATGAAGACCTGAATGGAGGCTGATGTCTGTAATTCCAGCACTCGGGAGGCACTCAGGAGGCAA 

GGAGGATTGGAGGGAGTTAGCTGGAGGCCAGCCTATGCTATAGAGTAAGACTGCTTCAAACAAGAGAAATGAGAACTGAAAAGATTGCTCAGTGGGT 

AGAGGCGCTTGCCTGCAAGTCTAAGGAACTGAGTTCAATTCCTGACATTTGCATGATGGAAGAAAAGAAACTCCTTCAAGGTCACACAC 

ACACACACTCAAAGAAGTAAAAATACTTTTTAATTTAAAAGAACAGAGAAACTTGAACAC^ 

CCTTGGATGTATGTAGTCCTGGTTTTGACCCCACCATTATACAGACCAGGTGTGGTGATACA^ 

ACGATGGGAGAAGTTCTAGGTGGCTCATCCTTGGCTAAAAAAACATAGCCTGAGCTACCTGACA 

CTCACAAATAATTAAGTAAAAATATGGTAGATTCATGACGACAGCACCTACAGATTGCCTGTCCTGTGCTGGGACACAGGGCAGCTTCACACTAACC 
CTG ATGGGAAC ACCTGGG AGACG AC ATG CTTGTTGCAAGGCG CTGGCTCATCTG AGACCAAAG CGA CTTCAGGAGC C AAGTAG ACC AGAATCCGTGT 
GCTTCAGACAGTGGGAAGGGCATACAGAAACCTCAGGAACTGAGGCACAGCAGGGAGATGGGAGGCTGGGACAAGGGACCACTGACAGACACCATAC 
ATCATACCATCTTGTCCCTCTGAATCTCCATGGGCACCCCCTCTGGGAGGACTCCAGGACTGCAAGCAAAGGTTAACTGAGTCAGCCAATTCAACCC 
TGG CCTAGGGGGTGGAGGCAGGGGGTCT ATCAGCGCG AGG AG ACAGGACAAGGACAC TC ATGT ATTGCC AT AGGG ATGGCT CT CTTGT C AAACC ATC 
AGA ACAC CCAC AAA CAT C CCGGC TC ACAGGGGTCTGC AGGGGT AGGGCTGGG ATGTGGCCCX3 AGACTTGTGGG AACT GAGGAATGTAGTGGCAAAAG 
TTTGGCCAAGGG AAG AC C AG AGG ACTCTGTATGC ATG AAGGC C AGGGT GAT GAC ATTTC AGG AAAGAAAAG CAC AG AATCAAGCATGG TAGGTAGTT 
GGAAAGTGGAAGCGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
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NNNNITONNNNNNNNNNNNNNN^ 

NNNNNNNNNITONNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNI^NNNNNNNNNIWNNNNNNNNNNNNN^ 

NNNNIWNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNl^Nl^NNNNNNNNNNNNI^NNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNJMN^ 

NNNNNENITOlOTNiraNNNNNNNNNNNNNNNNNN 

NNNNNNNNNIWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNI^NNNNNNNNNNNNNNNN^ 

NNNNITONNNNNNNNNN^NNNNNNNNNNNNN^ 

NNKNNNNl^NNNNNNNNNNNNNNNNJTNN 

NNNNNNNNNI^NIWNIOTNNNNNNNNNNN 

TACTAAGCCCCTCCCTGCCAAAGTAAGC!AATAAATTATGGCTAAGAGTCTCCC^^ 

C AACAG ACC AGCTACTTGGAAGACT C AGAACC TAG CC AAGCTGCCTGCAAG AGG TTTAG ACCAAC TAAGACATCTGGAGAGAGCA CCCTCCAATCTG 
TGGAGCAGTCTTCAGGCTCTGCAGGGTGCTCCAGGCTCCCAGCTTTGTAAGCTGTCACTCATGCTGGGGCGGGCTTGGGTGACGCAGCTGTCTTTGA 
GTGACTTCTCCCGTAAGTCAC CC CT C AC CCATGC ACAC TCCTGTAAGTAACT CT AAT AGAAAATTC ATCGGTT CAGCAAATTGGA CTTTGGTC TCTT 
CTGAGCTGGCTCCCTGTCTAGGGTGAATGGACATGTGTGTGTGGCATCTCCCCTGGAAAAGTTTTCTTTCCCTAGGCTTGGCCTGGGGTCTCCCAGG 
CTTTTTGCAGTTGCTGGTGGCATCCTTCTTAGTGGGTACTTAAGAATCAGGACCCTGGCCAGCGATGCAGCAGCCGTTGGCAGAGGGCTCCCCTAGC 
ATG C AG GAGGCC CCTGG ATT ACATC C CC AGTAAT AAACCAGGC CTGGCGAC AAT ATAGAGGTG AG AGC AAG ATGTTCATTGT CATC CTTGGGTACAT 
AGTAAATTCTAGTCTTTGTTTTAAAAACAAACAAACAAACAAACAAGCAGGACATAAGATCAGATAT^ 
CAACACTCAAQAGGTAGAGGTCATCCTGGCCTACAGAATGAATTCCAGGCCAAAGGTGGTA» 

GGAGGAGGGCAAAGGGT AGAAAGAGGAGGGG ATG TTC ACAAT AGG GGG AGGGGATGTTC ACAATACGTGTATAT AT ATT CTT TCCTTCTGG ATGTGG 
ATAGAACCCAGGACTTCACACATGCTAGACATTCATTCCACTTTAAGCTACATTCCCGCCCTCACATCCCCTTTTGTTTAAATCATTAATATTGATA 
TTATTATTATTATTATTATTATTATTATTACTACATATGAGTACTGGTTGTTTTCTGTCTTTGCATGCATGTGGAGATCAGAGAACAATTTTTAGGA 
GTCATTTTTCTTCTTCCACCGTGAGTCCCAGGGATGGGACTCAGGTTGTCAGGTATGTGTAGCAAGCTCCTTTACCTGTGAACCATCTTGCTGGGGC 
TGATACCTGCTGATTGCTAAACACATGCCACCGATTATATTAATGCACTAGTATTTTATTTATTTATTTTTCGAGACAGGGTTTCTCTGTGTAGCCC 
TGGCTGTCCTGGAACTCACTCTGTAGAACAAGCTGGTTTCGAACTCACAGAGATCTCCCTGCTTCTGCTGGGATTAAAGATGTGTGCTGCTGCCGCA 
GCTGCCACCACCACCAGACACCACCAATTATTATTGTTCAATTATTATTGTATACACACTTTCTTTTCTGTCTCCTGAGCTAAGTGGGAAGGAGAGG 
CTTCCAACACCATCAAGCCTGAAGAACTCACCGGGTGTGGGTTCCTACCTGCTCCAATTCTGC^ 

CTATCCTTTGCTTAGCTATGAGTACTGAAGTTTTAGATCAGGGAGGGAGCACATTTTCCAAGCTCACCGAGTCACACAGAATCCGTGACAAAGCTAG 
TGGGGAGGGGACTGCATCCTTACCCATCCCTACACCTGGGGATATCACAGACTCTCCAATTCCCCCTGAGCCTCAGTTTCCCCACCTAGACTGATTA 
TACCCCTCTCGCCAGAATCTGCACTGGGGACCTTGCTCTTTTTCACCAGCAGGGGCGCTGCACCGGTTTGGGGAGGGTGGGGTCCCAGGGGTATAAG 
CAGACCTGGGATCTGGAGTTGCACCTTCTCCAACCCGGGTCAGCAGGGGCCTTCTGAGGGAGTTTAGGCGCCTGTCAATCTCAGCCTCCGGGACAGC 
GTGGAACTGCGCAGGCGCGGGCGGGTTCCGCACAGCGCAACAGCGGGCGCGCGCGGGAGCGAGGGATTCCCTCTGACGTAATTGCTAGGATACCAAA 
CAAACACTGGGCCGCGCTGGCCGAGCTCCTTATATGGCTAATTGCGTCACAGGAACTCCGGGGAGGGCGGGGCGGGATCCCCTCCCGCGAAGCCCCT 
CAG AACGC AGCCTTGGGGAC C CC CGAGA C CC CCAGGGTCACA CTATGGGCAGGTGGCAGG TG CACGAGGCC CTGGAGGGCGCCACAAGG AG CT CAGT 
CGGCGGGAAGGGAGAGTTTGGGAGGTTTGTGCATGGAGTTGCGGGTGACGCAAGCGCGGGGGGCGGGTCCCGGAGGCATAAATTCAGCCCGGCAGCT 
CCCGGTTTCATTCATAAGACTGGAGCGGCTACGCCGGGGACACGCGGAGCCGGGGCTTGCTGGACTTGACTACTGGTGACTCTTTCTTTTTCTTCTT 
C TTGGAGGC CGTGAAAAATTT AT AT ATATAT ATTTAT ATATATTTTTAAACTGG AG AG AAAG TT TTGTGGGTTTC CTT TTTGCAAGACT CTTTCT AG 
ATCGTTCTTCCGATTCGTCCGGAGGATAGACTTTCTTTCTGTGGGCTTCTGGATCGCGCCCCTCTGCTCTCCCTCATTTCTTGGGACGCTTCGACAG 
GATTGGAACTGCATCTGAAGCGCACGCTGCACCGCGGCACTGCCCGGCGGGTTTCTGGGCGGGGAGCGATCCCCGCGTCGCCCCCCGTGAAACCGAC 
AGAGCCTGGACTTTCAGGAGGTACAGCGGCGGTCTGAAGGGGATCTGGGATCTTGCAGAGGGAACTTGCATCGAAACTTGGGCAGTTCTCCGAACCG 
GAGACTAAGCTTCCCCGAGCAGCGCACTTTGGAGACGTGTCCGGTCTACTCCGGACTCGCATCTCATTCCACTCGGCCATAGCCTTGGCTTCCCGGC 
GACCTCAGCGTGGTCACAGGGGCCCCCCTGTGCCCAGGGAAATGTTTCAAGCTTTTCCCGGAGACTACGACTCCGGCTCCCGGTGTAGCTCATCACC 
CTCCGCCGAGTCTCAGTACCTGTCTTCGGTGGACTCCTTCGGCAGTCCACCCACCGCCGCCGCCTCCCAGGTAAGTTCTAGATCGTAAAGATTCTAC 
TTTAGTGGTTGGGGGGGGGTGTTTCTTTCAGGCTAAAGTGTAGATTGAGCATCCTCTCAAATAGAGACGCGCCAAAACCCAGGATCTGGGATTGCAA 
TAGTTCGGTTTGCACTTTGGTTTTTTGTTGTTTGCAAACTGCAAAGAATGGAGTGATGTTTGCAAAAGGTTATTTGCGCGGAGCGCGGGAAAGGAAT 
GCAGCTGGGCAAACGTTGGCGATGCCCGGTGCAAAGTATATACCCGGTGGTTAGCAGAAGCTGAGAACTTTTAGCCGAAAGCCGGCTCCCTAAGCCG 
AAGCTAGGCAAGTAGGGGAAGAAAAAGAAACAAAAAATTCC^GAGAAGCTTCCAGGAGCCTCCTCCTCTTCCCTCTTCCTTCAAAACGCGGACTGC^ 
AGTCCGCAGTCACCCTCCACCCAGCAAGAGTTAGGGCCTCGAACCCCGGTCACGCTGCCTCCGCCTCCTGCGCGGAGACGTAACGGGGGACCCGTGC 
GTAAAGGCTGACGCGCTGGAATCCTCCGTCTGACGCGGGGCACGCACAGCGCGCAGCGCCCCCTCCGCCCGCCCCGCCCCTGACGTCCCGGGCACGT 
TCTATTTTGGAACGCCGAGGCCACGTTGCTAAGGGAGGGGGCAGCGTGGCTTTGTGATTGGCTGTCGCGGCGCGAGCTTTAGCCAATCAGCGTTCCC 
TTCCTATTTGTAGAGCGTAGCTCCCTTCCTTGCTTTTTGTGGTTCTTCCCGTGCTGGGGGTCTCCAAGAGGGAGAGCTAGGGGATTCTTGTCGCGAT 
CGGGGGACTCGTTGTCACCCCATGGGTCTGCGAGGACCTTGTGTGGACCTGGTCCTGTTGTCATAAGCTAGAGGCTTTTGGCTGAGTGTTAGCGCCT 
CTAAGGGGGAACTGAAGGCCTCATCCCTTCTCCAGGCACACATATAC^ 

GATGAGCTAGCCTACGGAGAGGCAGCCAGGTGGTCTCTAAAAGGTCCGCCTTCCCTTAGTTCCCAGGGCTCTGATTGGCCAGGGATTCAGCCCTTCC 
CTCGCCACGCCCCCTAGAGTAGTTAAGCCTCTAGGATTCCACTTGCGGGAAGGGGGGGGGGGGCGTGATGGACGCTTCTTGGGTTCGGGGACGCAGA 
TCCTATGTCACCOTATCCCCTGCAAGACAGTCTGAGAGATTCTCGCTGTCACTTTTCTCTGCCTATCAGTTCACTGAAACCTGTCAGTCTCACTGGG 
GAAGAGACAGACACTCGGAAGGGATGCTCTCAACTCTTAGGCCGGTCCCCCAACACCGTTGGAACTGGGATCTCCGCCCCTGCGGGAGCCCTCATGC 
AGT GGGGGGTGTGT TTGTGT GTG AGTGGAGG AGAGGAAGGCTTGGGCT AAGGCCT CT CCCT C TC CCTACCTACTGTGGTGGGGGTGGGGTGTTTTGG 
C TGTATGTGTGTGTG ANNNNNNNNNNNNNNNNNNNNCTGACTTGAAGGGGG TAGTGTGTGTA CATTTTTCT TTTTGAGA CAGG ATC TC AC TGT ATAG 
G CC AGACAGGC C TGGAACTC AAT CTGTAG ACTAGGCTGGT CT CGAACT CAC AGAG AT CC ACC TGT GTC TGTGG CTCC AG TCC CGGGATTAAAGGTTT 
GCATCACCAGGTTCTGTCCAGCCTCCGTCTCCCACCCACCCCCCCACACCT 

C AACCTTTC CCCGAGGAAGAAGGAGGAGGTAGAAGGCAGTTG AACAGAATCTCTCATTAACCACTGCGTCA CGGTGTAGTGGAAGGGT GGGTGTTGT 
GGCTTTTTGCCTGTGACACACACATCCACACCCGCTCACCCTGTGCTCACTCACAGGGGTCGGTGTGTGTTATGTGTGTTGGGCGTGTGTGTGTCGG 
TGGCTTTGTTTGTGTGTCTACGCCTGTGTGTGTATGTCTCACCCCGTAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGCTCCTTCGTGCCAACGGTC 
ACCGCAATCACAACCAGCCAGGATCTTCAGTGGCTCGTGCAACCCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGCCTCCCAGC 
CTCCAGCTGTTGACCCTTATGACATGCCAGGAACCAGCTACTCAACCCCAGGCCTGAGTGCCTACAGCACTGGCGGGGCAAGCGGAAGTGGTGGGCC 
TTCAACCAGCACAACCACCAGTGGACCTGTGTCTGCCCGTCCAGCCAGAGCCAGGCCTAGAAGACCCCGAGAAGAGACAGTAAGTATGAGGCCTCAG 
GAGTTGGGATGGAGGAGCCTAGCTAGGGATGTGGGCTCAGTTTGT^ 
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G GTTATGCAGA C CTGTAAC CCCAG CTCTC AGAAGGTGGAGGCAGGAGGAG CAGGAGTTCG AGG CCAG CCTGTGC T ACTTATGG AGTCC AGCCTG CAC 
TGCAAGAGATCATTATTTTCAAAAGTTGGCCTTGGGGGGAGGTGGGTGAGGG 

TTCCTCTGAGGCCTCAAGTCTGAAGGAACTTTACCATTCTGGCCAGTGAGGAGTAGGGGTTATTATTTGGGGTTCAGGAGGAAGGAGTTTTCTTAGG 
GCTGATAGAGGTACCCCCAGATCTCATGGTCCTTATCTCTGACTCAGCTTACCCCAGAAGAAGAAGAAAAGCGAAGGGTTCGCAGAGAGCGGAACAA 
GCTGGCTGCAGCTAAGTGCAGGAACCGTCGGAGGGAGCTGACAGATCGACTTCAGGCGGTAAGGAGGAGTCTGGGGGTGTCTTGAGGCCGTGCTGGG 

GTGGCTCCATATGCATGCTCAGACCCATGCCCACTTACTTT^ 
CTCAGCCTAAGGAGACAAGCTGGAGGAGGTAATTCTCTCACC^ 

CCTGAG CTGGGGATCT ACCTGTGT AGTT CAGCCCT C CT C CCCCAACTTG ATAGCCTCAAGTTTCAGCCC TTGGCTGAGATGCCATCATC CTGACTGG 
CTCTGG CTGGAAACTATTTTGTGCTAAGTCAATT C CTTACCTG CT ACTCCAGCT ATCTACAGTTC TGCCGAACT TGAGCTGGTGG CGCCC ACCAAGC 
CCACTTCTTTCTCTCTTCTACCTCAGTGCAACCCCCACACACACACACACACCTCATGCCTGCCCCTTGAAACCAGGGTGTGTCTCTGATTTCCCGT 
CGGGAGGCTGAAGGAGATGGGTAACAGAACCTCATTAAAAACAACACATAAGCATTACCTACTGACTCAACAAACTGTAGTGTTTTTCTTTTTTCCT 
CTCAAAAAATTATTTCGTTTGTTTATTTATTATTTGCTTATGTTTGAGTGAGTGCTGGTGCACCACAGCACACATACGAGGTCAGAGGGAAATTTTC 
ATAGTTTGTTCTCTCCTTCCGTGTTGTGGGTGCTTGCTGGCAATCTCCTTCACTCAGTGAGCTACAATGCCCCCTTCTGCCCTTTAAGGCAGAGTAC 
TCCTTAGTACAGGGGGACCCTTTCCTCGGCCTCTCAAAGTTGAGATTACAAATGTTCACCATCACACCAGGCTTGGAGTTCTTGCCTATCAGTGACG 
TCCACTCCTGCCTAGCTTCTTCCCAACCATCTCTTAG^ 

GTCCCCTCAGTTGGGAGGGAGCTGTCCAGCCCCCTGGATCAGCAGCAAGAATGTATGAGTGTGGGGTTGGGCGGGTGAAGCTACTCTGTGTGGTCGC 
TGACCAGCAATTCTCCTTTCTCTGTCTCCTATGACCTGGCCCTGCTGGGATCCATTAGGAAACTGATCAGCTTGAAGAGGAAAAGGCAGAGCTGGAG 
TCGGAGATCGCCGAGCTGCAAAAAGAGAAGGAACGCCTGGAGTTTGTCCTGGTGGCCCACAAACCGGGCTGCAAGATCCCCTACGAAGAGGGGCCGG 
GGCCAGGCCCGCTGGCCGAGGTGAGAGATTTGCCAGGGTCAACATCCGCTAAGGAAGACGGCTTCGGCTGGCTGCTGCCGCCCCCTCCACCACCGCC 
CCTGCCCTTCCAGAGCAGCCGAGACGCACCCCCCAACCTGACGGCTTCTCTCTTTACACACAGTGAAGTTCAAGTCCTCGGCGACCCCTTCCCCGTT 
GTTAGCCCTTCGTACACTTCCTCGTTTGTCCTCACCTGCCCGGAGGTCTCCGCGTTCGCCGGCGCCCAACGCACCAGCGGCAGCGAGCAGCCGTCCG 
H ACCCGCTGAACTCGCCCTCCCTTCTTGCTCTGTAAACTCTTTAGACAAACAAAACAAACAAACCCGCAAGGAACAAGGAGGAGGAAGATGAGGAGGA 
Q GAGGGGAGGAAGCAGTCCGGGGGTGTGTGTGTGGACCCTTTGACTCTTCTGTCTGACCACCTGCCGCCTCTGCCATCGGACATGACGGAAGGACCTC 
JZ, CTTTGTGTTTTGTGCTCTGTCTCTGGTTTTCTGTGCCCCGGCGAGACCGGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGGCGGGGATGGACACCC 
W CTCCTGCATATCTTTGTCCTGTTACTTCAACCCAACTTCTGGGGATAGATGGCTGGCTGGGTGGGTAGGGTGGGGTGCAACGCCCACCTTTGGCGTC 
Q TTGCGTGAGGCTGGAGGGGAAAGGGTGCTGAGTGTGGGGTGCAGGGTGGGTTGAGGTCGAGCTGGCATGCACCTCCAGAGAGACCCAACGAGGAAAT 
GACAGCACCGTCCTGTCCTTCTTTTCCCCCACCCACCCATCCACCCTCAAGGGTGCAGGGTGACCAAGATAGCTCTGTTTTGCTCCCTCGGGCCTTA 
g£» GCTGATTAACTTAACATTTCCAAGAGGTTACAACCTCCTCCGGGACGAATTGAGCCCCCGACTGAGGGAAGTCGATGCCCCCTTTGGGAGTCTGCTA 
Ms ACCCCACTTCCCGCTGATTCCAAAATGTGAACCCCTATCTGATTGCTCAGTCTTGCCCTCCTGGGAAAACTGGCT(^GGTTGGATTTTTTTCCTCAT 
y_ CTGCTACAGAGCCCCCTCCCAACTCAGGCCCGCTCCCACCCCTGTGCAGTATTATGCTACGTCCCTCTCACCCTCACCCCCACCCCAGGCGCCCTTG 
j , GCCGTCCTCGTTGGGCCTTACTGGTTTTGGGCAGCAGGGGGCGCTGCGACGCCCATCTTGCTGGAGCGCTTTATACTGTGAATGAGTGGTCGGATTG 
|jj CTGGGTGCGCCGGATGGGATTGACCCCCAGCCCTCCAAAACTTTCCCTGGGCCTCCCCTTCTTCCACTTGCTTCCTCCCTCCCCTTGACAGGGAGTT 
AGACTCGAAAGGATGACCACGACGCATCCCGGTGGCCTTCTTGCTCAGGCCCCAGACTTTTTCTCTTTAAGTCCTTCGCCTTCCCCAGCCTAGGACG 
CCAACTTCTCCCCACCCTGGGAGCCCCGCATCCTCTCACAGAGGTCGAGGCAATTTTCAGAGAAGTTTTCAGGGCTGAGGCTTTGGCTCCCCTATCC 
p= TCGATATTTGAATCCCCAAATATTTTTGGACTAGCATACTTAAGAGGGGGCTGAGTTCCCACTATCCCACTCCATCCAATTCCTTCAGTCCCAAAGA 
fl CGAGTTCTGTCCCTTCCCTCCAGCTTTCACCTCGTGAGAATCCC^CGAGTCAGATTTCTATTTTTTAATATTGGGGAGATGGGCCCTACCGCCCGTC 
2t CCCCGTGCTGCATGGAACATTCCATACCCTGTCCTGGGCCCTAGGT^^ 

i'U CCCAAAAAGCACTTATATCTATTATGTATAAATAAATATATTATATATGAGTGTGCGTGTGTGTGCGTGTGCGTGCGTGCGTGCGTGCGAGCTTCCT 
| s I TGTTTTCAAGTGTGCTGTGGAGTTCAAAATCGCTTCTGGGGATTTGAGT 
■J? CTCCTCTGGCTGTTGGAGACAGTCCCGGCCTCTCCCTTTAT^ 

U CTCTGATCTCCGGTCTGTCTGTTAATTCTGGATTTGTCGGGGACATGCAATTTTACTTCTGTAAGTAAGTGTGACTGGGTGGTAGATTTTTTACAAT 
|,-». CTATATCGTTGAGAATTCTGGGTGGAAATGTCTGATCAGGAGAAGGGCCTGCCACTGCCGACCACAATTCATTGACTCCATAGCCCTCACCCAGGCT 
GTATTTGTGATTTTTTTCATTTTGTTTTTTTGTATTTTGCACCTGACCCCGGGGGTGCTGGGGCAGTCTAGCACTGGGCAGCTCCCCTCCCCCCCTT 
GGTTCTGCACTGTCGCCAATAAAAAGCTTTTAAAAAACTGTATTCTTCAGGTCAAAGTGTCTGTTTTCCCTGGACATCTACTACATGGCTTCCTTTC 
AGAAAAACGGAGTTTGGATTGCTAGGGAGGTCTTGCTGGCACTTAGTGGGACGCCTAATGAATCAGAACCTACAACGGGACTAAAAGGAAGTGGAGA 
CTTGCTAGGTTTTCCCATGTTCCCAGGCTGGGCCACCTACTTGAAAAAATAAGGGGCGGAAAAGTGTAAGGTACAAATTTGGTGAAGGGTCTGGAAG 
ACTTCATGATCGGAAAAGAATTTATTCACCTTGGGTGTGCAATGACTTCAGCAACAGCTAAGGGCAAGGTGTAAAAGCTGGGCACACTTGTAAATCC 
TAGCATTTGAGAGGTGGAGGCAAGGGGATCACTGGTGGAGTTCAGTGTCATGTGGATCGTAGATACCAAGCGCAAAGATCTGCTATGGGGAGAGGGC 
TTGGTACACCAGGGGAGCCAGAAGTTCGTGGTGAGGGTAGTGGAGGGCAGGTGGAGAGTGAGAGTTAGCCTCAGGGAGATTCTACAGGCAATGATGC 
AGAGTTCAGACGCTCCTTTGAAAGCACTAGAGAGCCGCAGCAGGTTTTGAGCAGAGAAGGTTAGCGTTAGGTGGTCTCTTCTAGCCCATCCCAGGCT 
GAGGAGGACGCTGAGGGTTTCAAGAAGGATCGAGAATGGAAAGCAGAGGAGAAAAAGGATCCAAGAGGCATGGAGGAGGCAGAACACATTTCTCTTC 
TTT AAT AGC AAGCC TGGAAAGGATAACTTGCTGCAGG AGG AGATG CTCACCAGTCGGGTGGTCTAGGGGGT TCTTGGAAAAG AGAAGGCATTTGCTC 
AAGCCTCGGTTCCCCCATTCTCGCTCTTCTGTCAGCTTGTCTTCCATTAAGTGTGTGTCTCAAGGCCACCCTGCTCAGGACTCCTTGTGAGACGACC 
TTCTATGCTCGAGTT(^TTAAAAACACAATTGCCTGGTGCCCTGCTCTCTCCACTGGCTCAGTTACCTCAAAAGACCAGGGCTAAAGGTGTGATCAC 
AACTCTATCCCCATTACTGCTCCAACGCAGAGACAGGACTGAGCCGGAGTGAACAAATGAACAAAAATGACTAATAATGCATGCGTGATTAAATACA 
TAAAAGAGCAGATGACTGGATGAGCAAATCGTTTAAGGAGAGACAGCAAGATCCTAGAATTTTGGAGACTAATTTAAATCCATCTTTGAGATGTATT 
TGGTCGGAAATTCCTGGGAGGAAAAAAAGTGTAAGTATGAAGAGAGAATAAATGAGAATAGGGGTGGCTTCAGAGAGGTTAACTGCGCGCTGGTCGC 
TTTTGTACAAGAATGTGAATTGCAGGGAGCAAAATGGGATAGATACTCCCGCCCGAA^ 

TTGAAGCCTGCACCCCAGACCACTGAGGATCATCCGGGCGAAAGGAGCTATTTTCGGTTAGTTATATAAAGGCCAGATACTACTACTTTTTTCA 



TGTTCCGTGAATGACTCTACTAACTAAAAAGTAAGTAGCTTCATTTGCATAGC^ 

ACTAAAAGCAGAATTTTTTGCAAAGTGAAAAGTCAGTTTTATTTTTGTTTGTTTGTTTGTTTGTTTGTTTTTAATGGAAAAACTTCTCACGCGGCCC 
ATT CGT AGCAG AAT TCGAGATTTTCTGC AAGCGAGACCGT AG GGT CTGACGGCACGCGGCCGCAGAGCG ACACCTG CCGTTGCTTT AT AG AACTGCA 
AGT ATGT AGGG AAT CTACTG AGT CC CTAAGGTGATGG AGTTG ACAACCAACTCCCCrTGCGTTTAGACGCTAAAAA CCAT CC CTTTTTATATCTATG 
TGATTAGCCCAGGGAAACTAAGGCTCAGACATGGATAATACCAC^GCCGAGTTCCTGTAGCCCAACTCCCTAGGGGAAATGAAACCTACAGTTGTGG 
TTTTAATATGCTTGGCCCAGGGGCAGTGGCCCTATTGGCAGGAGTGGCCTTATTAGAGGAGGTGTACCTTGTTAGAGGAAGTGTGTCACTTGGAGGC 
GAGGTTTTGAGGTACGTATGCTCAAGTCTGGCCAGTGTGATCCTGGCTGTCTGCAGAACGCGGTCTCCTGGCTGCCTTCGGATCAAGGTGTAGAACT 
CTCAGCTCCTTCTCCAGCACCATGTCTGCCTGCTTAATGCTTTGCTTCTTTCCATGACGATAATGAACTGTGCCTCTGAAACTGTAAGTCAACCCCA 
ATTACACAATTACATGTTTTCTTTTATAAGAGTTGCATATATATATATATATGTCACACTCTTTAGCTCTGGCTGGCCTGAAATTCACTATGTAGCC 
CAGGATTGCCTGAACTTTGAAGCAATCTTCCTGCCTCAGCCTCCCAATGGTATTACAGGCATGAGTCACAACAAGCCATTTAAATCTTATGATGACT 
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TATAAGAAGAC AGAAAAT CAG AGTT CCT TTACCT AGTTCACAG ATCCCT ACAATCTAACCTCGTT CGCT CC ATAAAC AGCCCTACCCCACC CTCC TG 
GAACTGCTTTGAGGAATGCTGCAGGCTCTCACAGGCACACTCCTCCTTGGTTAAT^ 

CAAAGCCTCTCATCCTGTTCTCAAATACCACTAGCTAGTAAGGCCTCCCCGACCTGACCCaSGTTTAAATATTAGAAAAGGGTCACTTTCTCCCCTG 

CCACAGACAACCAAACCACCATATGCTTGTCACTTACTACCTGACTATGAAGGTGAATAGATGTCTTCACAACCTTTCTCTGAGCCTCAGTTTCCCC 

ACCTGCATAATGCATCTGAGAGA CAGAATTC CCT AGAGCTGTGGT T CT C CT C ATT CCTAGTGCTGGGACCCTTTAATAC ATTTCCTCATGT TGTGGT 

GACTCCACCACCACCATAAAATTATTTTCATTGATACTTCATAACTGTAATTTTTTCTATTGTTATGAATAGTAATGTAAGCATTTGTGTTTCCCAG 

TGATCTTAGATGACCCH^STGGAAGAGTCATTCC^CCCCCAAAGGGGTCCCC^ 

TGAATTACACTTTGGGTCGACTTTTGGGCTGGCCTCTGGGAGGCGNNNNNNNNNNNN 

NNNKNNNNNNNNmiNNNNNNNNNNNNJJNNNNNNNNNNNNNNNNNNNNNNN 

NNNNTTONNNNNNNNNIOTNNNNI^NNNNNNNNI^^ 

NWNTWNWNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNMnraNNNNN^ 

NNNNNNNNimJNNNNITONNNNlWNNNNNNNNNNNNNNNNNNNNN^ 



NNNNICTNNN^NN^^NNNNNNNNNNNNl^ 
NMtfN^NI^2^NI^NNNNNNNKNNNNNNNNNNNNNN^ 



:::: ni^nnnni^nnnnnnnnnnnnnnnnnnnnnn 

M NNNNlsntfNNNNNNl^NNNNNNNNNNNNNNN^ 

JZ NNNNIWNNNNNNNNNI^NNNNNNNNNNNNOT 

s\" NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

f** NNNNNNNNNNNNNNNITONNNNNNNNNNNNNNNNN 

lair NIWNNNNNNNNNNNNNNNNNNIWNNNN^ 

J „ ? NNNNNNNKnSNNNNNNNNNiTONlWNNNlWNNNNNN^ 

^ NNNNNNNNNNNNNNNNNl^NI^NNNNNNNNNNN^ 

35 NIWNNNNNNNNNNNNNNNI^NNNI^ 

NNNNNNNNNNNNNNNNNNNNNNNIWNIWNNNNNNNI^ 

W CAGGCGTTGATGGCTATTAAACCCAAGCCTTGTATATACTAGGCAAGTACCCATCCACTGAACTACAGCCCTAGCATGGACCTCACAGGGCAATGGC 
flf CTGACATCTGGGAAACACTATCAC^GAGTATAATAATAAAACCGTAGCATGCTTT^ 

TTATTG T TATC ATGAGT ATGTGTGC ATAC ATGTGGAG GTG AG AGG ATG ACGT CAT AGAG ACAGCTCTG TCTCCCATTCTTTATTG CCAAT ATTATTT 
W TATGTTTTGTCTGCGTGTATGTTTGTATACCAAAGTGCATGCAGTGCCTGTAGAGACCAAAAGGG 

pjj TGAACCACTCACTCTGTGGGTGCTGGGAATCAAACCCCGGTTCTCTGGCAGAGCAACTGGTGCTCTTAACCACCGAATCAACTTCTCCAGCCCTGCA 

*~ TTGCTTTAATCTTTTGGTCGATTCCCCCAATCCAACTCATACCG^ 

TTTTCCTTTCTTTTTCTTTCTTTTTTTCTTTTTTTTTTTGAGTCAGGTATGATTATTTAGATGGCCTCGAACTCACAGAGATCCACCTCCCTCTGTT 
ATGC CAGGCTT TTAAAAAACAACAT TTA TTT TTGAATGCATA T AT AGGGCTGTG CAAACACAG CCTGAAGGAGTCGTTTCTG ATTGCTCACAACGTG 
GTTCCAGGGATGGAACGCGGGTGACCAGCTTGAAGGTAGGTGCTTTCACCA^ 

AAATGACAC AT GTT C ATTGTGGGTAGTGCAGGT ATG CGATGGAGCAC ATGTAG AGGTG ACATTACTGCAGGCGGT CTCT CT ACC ATGGGG ATTCAAC 

TCAGACCACTGCTTCCTTTACCAGATCATCATTCTTGCATCCCCCAATCATTTCTTTTTGTTGATGGTTTTGTTTGTTTGTTTTAGTTTTGGGGGGT 

TTTTAAAGTTTTTTTCGAGACAGNNNNNNNNNNNNNNNNNNNNNN^ 

NlTONNNNNNNNNNNNNNNNNNl^NNNNNNNNNNNNNNN^ 

NNNNN1^NN]WNNNN1TONNNNNNNNN1^ 

NIWNNNNNN^NNNNITONNNN^NNKNNNNNNNNNNNNNNN^ 

AAC CGGC CTTT CTG CCTTATTTGTTTCG AAACGG AGT CTC ATGTAGCCTGAGGTGGCCTC AAACATGCAATGTAGCTGCAGC TGGG AT TACAT ATGT 
TCACCACCACAACCTGTTTTATGCACTGCTAAGAGATTAAACCTGGGCCrTCGTGCTAGGTAAGGACTCTATTAACAGAGCTACAACCTCTGCCT^ 
TTTTTAGTTTTTC^GACAGGGCTTTGCGACATAGCCCTAACTGGCCTTG 

GATGCTACACCCACATATCTCAGACCTGAGGGTTGGCAGCTTGGGCCATGCTCTGATTTGGGAGTGACATATTAACAGTAGTTTAATACGAATTTGA 
ATCCACCTTACTTTTTTGGTGGCACTGACATTGAACTCATGGCCTTCCACAGGCTTTTACGTGAGAAATGTTTTACCTCCCTTCCTTGAGCCATGAG 
CGAGCCCCrCACTGGTGGAGTCTATGaVGCTGGTCTGAAACTCATACCATCTTCCTGCCCCACTCTCCGAGTGCTGAGTGTCTACAGAGCTCCTGAC 
TCTGCCTGGGACGTGATGTCTGTGGGACCTTGGCTATAGAAGGGAGG<^GTCACACCACCCATTTTTGGCTTCCATGTCGGCAATGGAGAGCTTGCC 
ATACTCACCTGATGTTGTCTGTAC^GAAGGGGGACGGTAAACAAAGCAACCACTCCTACAGATGCAGAGAGAGAGATGGGGGATGGGGGGTGTCAGG 
TTCTCCTGGCAAAGGGTCAGCTCCGTTGAGGGCTTTTCCACTCC^ 

CTTGGGCTAGCTTGGGCTTGTTCCCTGACAAGCCCCTATCTC^TTTAACCCGTTTATTCTATTCTATGAATGCCACATGGCTGGTTACCrCGTCrCA 

GTTTCATGAGACCCTCTCATCAGCACTAGGGCAAACCCTGATTCTATTACTCTCTTCCTGCAGGAGCTCGGGCCCCCTATTTCCT 

AGCCCIAACAGCTTTTGATTGACAGGTGATGCTTCCACACAGTGCACACGAAGATTCTCT 



NNNNNNN1TNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNI^NNNNNNNI^NI^NNNNNNNNNN^ 

G CTGAGGTC AAGTT TAAG ATT AGGA AAG AGAATCAAG AATTATCAAGGATAAAGGGAGAGGT CAGGATTTACAGGGGAGCAGGGT TCATGGTC AG AA 
TGAAAATGGATGAGAGAGGGCAATGCGTGAGCTGAGGTCAAGGGTCACAATGGACGGTCCGTCGGCGGGGGCTGGGCAGGGCCAAGCATTCTCACTG 
GAAAGGGTTGGTGCCGTCGCCCCGGTGCACGGCCTGCAGCACTTTGCGGTAAACCCTGAGAGAGATGGTGGCGCAGAGACCCAACAGGGCCAGGTGC 
G CAGCCACGGAC AC GAT GCT AAAGTGCAGGAGGC AGAGGAGAGAGGCCATG AGGCCCGTG AAGACCGCTCCCGACGT CC TGGTGT CCT TCC AGT ACA 
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ACAGGTCTGCCACTGTGGAAAAGAAGACAATGTT6GGACCCGACCTGACCAAGCTCACTCCCTTTCTGTTTGTTTTTTAGAGATAAGGTCTCACTAT 

GAAGCTAAGGCTGACTTCAAACTATGAAACCCTGGCTGGTCTTGAACTCCTGATTATTCTAATTCATCCTTTCTTTTTCTTAGACTGAGTTAAGTTT 

GACCGGCTTCAAACTCAAGACAAATACTCCTGCCTCTACCCTTTAGTGCTTC^ 

AAAACCCAACTGAGCTACTTCCCTGAC 

MOUSE SEQUENCE - mRNA 

ATAAATTCTTATTTTGACACTCACCAAAATAGTCACCTGGAAAACCCGCTTTTTGTGACAAAGTACAGAAGGCTTGGTCAC^TTTAAATCACTGAGA 
ACTAGAGAGAAATACTATCGCAAACTGTAATAGACATTACATCCATAAAAGTTTCCCC^ 

GCAAACTAGTGTAGCATAGAAGTCAAAGCAAAAACAAACCAAAGAAAGGAGCCACAAGAGTAAAACTGTTCAACAGTTAATAGTTCAAACTAAGCCA 
TTGAATCTATCATTGGGATCGTTAAAATGAATCTTCCTACACCTTGCAGTGTATGATTTA^ 

TAGAGATATTAAAATGAAAAGGTTTGCTAATAGAGTAACATTAAATACCCTGAAGGAAAAAAAACCTAAATATCAAAATAACTGATTAAAATTCACT 
TGCAAATTAGCACACGAATATGCAACTTGGAAATCATGCAGTGTTTTATTTAAGAAAACATAAAACAAAACTATTAAAATAGTTTTAGAGGGGGTAA 
AAT CCAGGTCC TCTGCCAGG ATG CT AAAATT AGACTT C AGGGG AATTT TGAAGT CTTCAATTTTG AAACCT ATT AAAAAGCC C ATGATTACAGTTAA 
TTAAGAGCAGTGCACGCAACAGTGACACGCCTTTAGAGAGCATTACTGTGTATGAACATGTTGGCTGCTACCAGCCACAGTCAATTTAACAAGGCTG 
CTCAGTCATGAACTTAATACAGAGAGAGCACGCCTAGGCAGCAAGCACAGCTTGCTGGG CCACTTT CCT CC CTGTCGTG ACA CAATCAAT CCGTGT A 
CTTGGTGTATCTGAAGCGCACGCTGCACCGCGGCACTGCCCGGCGGGTTTCTGGGCGGGGAGCGATCCCCGCGTCGCCCCCCGTGAAACCGACAGAG 
CCTGGACTTTCAGGAGGTACAGCGGCGGTCTGAAGGGGATCTGGGATCTTGCAGAGGGAACTTGCATCGAAACTTGGGCAGTTCTCCGAACCGGAGA 
CTAAGCTTCCCCGAGCAGCGCACTTTGGAGACGTGTCCGGTCTACTCCGGACTCGCATCTCATTCCACTCGGCCATAGCCTTGGCTTCCCGGCGACC 
TCAGCGTGGTCACAGGGGCCCCCCTGTGCCCAGGGAAATGTTTCAAGCTTTTCCCGGAGACTACGACTCCGGCTCCCGGTGTAGCTCATCACCCTCC 
GCCGAGTCTCAGTACCTGTCTTCGGTGGACTCCTTCGGCAGTCCACCCACCGCCGCCGCCTCCCAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGCT 
CCTTCGTGCC^^CGGTCACCGCAATCACAACCAGCCAGGATCTTCAGTGGCTCGTGCAACCCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCA 
GCCACTGGC CTCCCAGCCTCCAGCTGTTGACCCTTATGACATGCCAGGAACCAGCTACTCAACCCCAGGCCTGAGTGCCTACAGCACTGGCGGGGCA 
h* AGCGGAAGTGGTGG GCCTTCAAC CAGCACAAC CACCAGTGGAC CT GTGTCTGCC CGT CC AGC C AGAGC C AGGCCTAG AAGAC CCC GAG AAG AG ACAC 
f=^ TTACCC CAGAAGAAGAAGAAAAG CG AAGGGTTCG CAG AGAGCGGAA CAAGCTGG CTGCA GCTAAGTGCAGG AAC CGT CGGAGGGAGCTGA CAGATCG 
Jf ACTTCAGGCGGAAACTGATCAGCTTGAAGAGGAAAAGGCAGAGCTGGAGTCGGAGATCGCCGAGCTGCAAAAAGAGAAGGAACGCCTGGAGTTTGTC 

0 CTGGTGGCCCACAAACCGGGCTGCAAGATCCCCTACGAAGAGGGGCCGGGGCCAGGCCCGCTGGCCGAGGTGAGAGATTTGCCAGGGTCAACATCCG 
f% CTAAGGAAGACGGCTTCGGCTGGCTGCTGCCGCCCCCTCCACCACCCCCCCTGCCCTTCCAGAGCAGCCGAGACGCACCCCCCAACCTGACGGCTTC 
"jj TCTCTTTACACACAGTGAAGTTCAAGTCCTCGGCGACCCCTTCCCCGTTGTTAGCCCTTCGTACACTTCCTCGTTTGTCCTCACCTGCCCGGAGGTC 
sp TCCGCGTTCGCCGGCGCCCAACGCACCAGCGGCAGCGAGCAGCCGTCCGACCCGCTGAACTCGCCCTCCCTTCTTGCTCTGTAAACTCTTTAGACAA 
L| ACAAAACAAACAAACCCGCAAGGAACAAGGAGGAGGAAGATGAGGAGGAGAGGGGAGGAAGCAGTCCGGGGGTGTGTGTGTGGACCCTTTGACTCTT 

1 , CTGTCTGACCACCTGCCGCCTCTGCCATCGGAC^TGACGGAAGGACCTCCTTTGTGTTTTGTGCTCCGTCTCrGGTTTTCTGTGCCCCGGCGAGACC 
^ GGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGGCGGGGATGGACACCCCTCCTGCATATCTTTGTCCTGTTACTTCAACCCAACTTCTGGGGATAG 
f±J ATGGCTGGCTGGGTGGGTAGGGTGGGGTGCAACGCCCACCTTTGGCGTCTTGCGTGAGGCTGGAGGGGAAAGGGTGCTGAGTGTGGGGTGCAGGGTG 

GGTTGAGGTCGAGCTGGCATGCACCTCCAGAGAGACCCAACGAGGAAATGACAGCACCGTCCTGTCCTTCTTTTCCCCCACCCACCCATCCACCCTC 
AAGGGTGCAGGGTGACCAAGATAGCTCTGTTTTGCTCCCTCGGGCCTTAGCTGATTAACTTAACATTTCCAAGAGGTTACAACCTCCTCCTGGACGA 
|=4 ATTGAGCCCCCGACTGAGGGAAGTCGATGCCCCCTTTGGGAGTCTGCTAACCCCACTTCCCGCTGATTCC^AAATGTGAACCCCTATCTGACTGCTC 
AGTCTTTCCCTCCTGGGAAAACTGGCTCAGGTTGGATTTTTTTCCTCGTCTGCTACAGAGCCCCCTCCCAACTCAGGCCCGCTCCCACCCCTGTGCA 
W GTATTATGCTATGTCCCTCTCACCCTCACCCCCACCCCAGGCGCCCTTGGCCGTCCTCGTTGGGCCTTACTGGTTTTGGGCAGCAGGGGGCGCTGCG 
flj ACGCCCATCTTGCTGGAGCGCTTTATACTGTGAATGAGTGGTCGGATTGCTGGGTGCGCCGGATGGGATTGACCCCCAGCCCTCCAAAACTTTCCCT 
t,f GGGCCTCCCCTTCTTCCACTTGCTTCCrCCCTCCCCTTGACAGGGAGTTAGACTCGAAAGGATGACCACGACGCATCCCGGTGGCCTTCTTGCTCAG 
!ff GCCCCAGACTTTTTCTCTTTAAGTCCTTCGCCTTCCCCAGCCTAGGACGCCAACTTCTCCCCACCCTGGGAGCCCCGCATCCTCTCACAGAGGTCGA 
O GGCAATTTTCA.GAGAAGTTTTCAGGGCTGAGGCTTTGGCTCCCCTATCCTCGATATTTGAATCCCCAAATATTTTTGGACTAGCATACTTAAGAGGG 
GGCTGAGTTCCCACTATCCCACTCCATCCAATTCCTTCAGTCCCAAAGACGAGTTCTGTCCCTTCCCTCCAGCTTTCACCTCGTGAGAATCCCACGA 
GTCAGATTTCTATTTTTTAATATTGGGGAGATGGGCCCTACCGCCCGTCCCCCGTGCTGCATGGAACATTCCATACCCTGTCCTGGGCCCTAGGTTC 
CAAACCTAAT CCCAAACCCC ACC CCC AGCTATTT ATC CCTTT CCT GGT TCC C AAAAAGC ACTTATATCTATTATGT ATAAAT AAATAT AT TAT ATAT 
GAGTGTGCGTGTGTGTGCGTGTGCGTGCGTGCGTGCGTGCGTGCGAGCTTCCTTGTTTTCAAGTGTGCTGTGGAGTTCAAAATCGCTTCTGGGGATT 
TGAGTCAGACTTTCTGGCTGTCCCTTTTTGTCACCTTTTTGTTGTTGTCTCGGCTCCTCTGGCTGTTGGAGACAGTCCCGGCCTCTCCCTTTATCCT 
TTCTCAAGTCTGTCTCGCTCAGACCACTTCCAACATGTCTCCACTCTCAATGACTCTGATCTCCGGTNTGTCTGTTAATTCTGGATTTGTCGGGGAC 
ATGCAATTTTACTTCTGTAAGTAAGTGTGACTGGGTGGTAGATTTTTTACAATCTATATCGTTGAGAATTC 

MOUSE SEQUENCE - CODING 

ATGTTTCAAGCTTTTCCCGGAGACTACGACTCCGGCTCCCGGTGTAGCTCATCACCCTCCGCCGAGTCTCAGTACCTGTCTTCGGTGGACTCCTTCG 
GCAGTCCACCCACCGCCGCCGCCTCCCAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGCTCCTTCGTGCCAACGGTCACCGCAATCACAACCAGCCA 
GGATCTTCAGTGGCTCGTGCyU^CCCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGCCTCCCAGCCTCCAGCTGTTGACCCTTAT 
GACATGCCAGGAACCAGCTACTCAACCCCAGGCCTGAGTGCCTACAGCACTGGCGGGGCAAGCGGAAGTGGTGGGCCTTCAACCAGCACAACCACCA 
GTGGACCTGTGTCTGCCCGTCCAGCCAGAGCCAGGCCTAGAAGACCCCGAGAAGAGACACTTACCCCAGAAGAAGAAGAAAAGCGAAGGGTTCGCAG 
AGAGCG GAACAAGCTGGCTGCAGCTAAGTGC AGG AAC CGT CGG AGGGAGCTGAC AGATCGAC TTC AGGCGG AAACTG ATCAG CTTGAAGAGGAAAAG 
GCAGAG CTGGAGTCGGAGATCGCCG AGCTGCAAAAAG AGAAGGAACGCCTGGAG TTTGT CCTGGTGGCCCA CAAAC CGGGCTGCAAGATCCCCT ACG 
AAGAGGGGCCGGGGCCAGGCCCGCTGGCCGAGGTGAGAGATTTGCCAGGGTCAACATCCGCTAAGGAAGACGGCTTCGGCTGGCTGCTGCCGCCCCC 
TCCACCACCCCCCCTGCCCTTCCAGAGCAGCCGAGACGCACCCCCCAACCTGACGGCTTCTCTCrTTACACACAGTGAAGTTCAAGTCCTCGGCGAC 
CCCTTCCCCGTTGTTAGCCCTTCGTACACTTCCTCGTTTGTCCTCACCTGCCCGGAGGTCTCCGCGTTCGCCGGCGCCCAACGCACCAGCGGCAGCG 
AGCAGCCGTCCGACCCGCTGAACTCGCCCTCCCTTCTTGCTCTGTAA 

HUMAN SEQUENCE - GENOMIC 

CACTAAGCACTCTCACCACCCAATGCCTGGAGTGGTTGTAGTCAGTGAGTGACACATTGCACAGTGTGTGCCCCCTGGATTGGGGGTGGGTGAGAGA 
CAGCCCCCACAGTGAGTGGCACCCCTGGCCAGGGCCTGGGACAAGTCTGTATCCAAGGGTGGCTCTCrGCrTAGGTCTGTGTTTGTACCTGGGTGTG 
TCTGTGTCTGCCCTACTCTGTGCATACTCATATATGTGAACCCGTGTGTGTGTGTGTAGTTGTGAGTGTGTGTGGAAGAGGCTGCATGGCAGTGGAA 
GCTAGGTGTGTGATTCGTGTATCTGTGTGCCTAGAGTGCCTTGTTCTATAGATTTGGATGCCACTCCAAGCAAGTCAGTGGGTCTTTTTGTTTTTTG 
TTTTTTGAGATGGAATCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCACGATCTTGGCTCACTGCAACCTCTACCTCCTGGGTTCAAGTGATTCT 
CCTGCCTCGGCCCCCCGAGTAGCTGGGATTACAGGTGCCCACCAATACACTCAGCTAATTTGTTGTATTTTTAGTAAAGATGGGGTTTCACCATGTT 
GGCCAGGCTGGTCTCAAACTCCTGACCTCATGATTCTCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCGCCCGGCCAAGT 
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CAGTGGGTCTTTAGGAGCTGTTTCGTACGGTGTGACTGTGAGTGAACCTTCGCACACGTGTCTGTACGGATATCTAAGAAATTCTTCAAGTAGGCCG 
GCCACAGTGGT T CA GGCCTGT AGCCTTAGCACTT TGGGAGGC CCAGGTG AGTGG ATCGCTTG AGCTCAGGAGTTCG AGAACAGCC TGGGCAACATAG 
TGAGAC TTCGT ATCTATAAAAAATAT GAAAATTAGC CAGGC ATG AC AGTGGGCAC CTGTAGTCCCAGCTACTACTTGGGGGGCTGAGGCAGGAGGAT 
C CCTTG AGC CTGGGAGGTGG AGG CTGCAGTG AGCTGAGATCGAGC CACTGC ACTCCAGCCTGGGCGACAGAGGGAGACCCTGTCT C AAAAAAAAAGA 
AAAGAAAAGAAAAAAAGAAGACATTCTTCAAGTCCACAACTCTGAGGGTATCACTGTGAGAGCAGGGGTCCTAGCTCTACCCATTTGTGTGTGTGTG 
GAGATGTGTAAGTCTATGTAGACAAAAGTGTGTGTCATTTACTGTGTTTGGGGTGTGAACACCTATGTGATGTGTTTGCACAACTGCACGTGTTTTG 
TTCTGTGTGTGTGATCGTGTGTTCAAATAAGTCATCTTGTCGCCGGGCGTGGTCGCTCATGCCTATAATCCCAGCACTTTGGGAGGCCTAGGCAGGA 
GGACCATTGAGCCCAGGAGGTCCAGACTGCAGTGAGCCGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGCAAGACCTTGTCTCAAAACAAAAG 
CAAAAA CAAAAATAAACAAATAGTC ATC AGGTGC CTG C ACAG ACAAAGGTGAAAGGTGT CTGCCTGTTG AG AT CTGTGG ATAGGGTGTAT ATATGG A 
CATCTCAGCCTGTCTACGTGTGTATCTGTCTCTGTCCTCACGGCAAAAGAGAGGTTGGCCGGGTGTGTGGTGGCTCACGCCTGTAATCCCAGCACTT 
TGGGAGGCTGAGGCGGGCGGATCACCTGAGGTGAGGAGTTCAAAACCAGCCTGGCCAACATGGCGAAACTCCATCTCTTCTAAAAATACAAAAAAAT 
T AAC CGGGCGTGGTGGCACACGCTT GTAATC CCAGCTACT CAGGGAGGCTG AGGCATTAGAATTGCTTGAACCCAGGAGGTGGAGGTTGC AGT GAG C 
C AAGATTGCGCCAC TGGACT CCAGC CTGGGC AAC AGAACAAG ACTCCGTCT CAAAAAAAAAAAAAAAAAAAAAAAG AGG CTC AGAT ATGTTTCTGT C 
TGAGAGTCTGTCAGAGTTTAGGAATTAGTAAATGAATGAATGG1XSAAGATCCATTTACTCAACAAACATTTATTTATTTATTTATTTATTGAGACAG 
AGTCTCGCTCTGTCACCCAGGATGGAGTGCAGTGGCGCAATCTCGGCTCACTGAAGCCTCTGCCTCCTGGGTTTAAGAGAGTCTTGTGCCT 
CCAAGT AGCTGGGATTACAGGTGTG TGCCAC TGCGCCC AG CTAATTTTTGT ATATTT AGT AGAGATGGGGTTTC ACCAT GTTGCC C AGGCTGGT CTC 
GAACTCCTGGCCTGAAGTGATCTGCTCACCTCAGTCTCCCAATATGCTGGG^ 

TATTATATTGTTTTTATTTTAAATTTTTTTGTAGTGACAGGGTCTTACTATGTTGACTGGGCTGGCCTCAAACTGGCCTCAAGTGATTCTCCTGCCT 

CAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGCCTGGCTTGAACAAATATTTAATAACCACCTATTGGGTACCACGTGCTATGCTGG 

GGACAAGGCAGTGACOSGGATGGTTTTGGCCGAGCTTTC^CCCAGCTCACAACCCAATGAGGGAGACAAACCTATCCCCAGACAATGATGAGCCCAG 

ATTGGCAGAGTTGGAGGGAGGGACCCCAGGAGAGGGGGCCTTGACTCAGCCTGGAGATCAGGGAGGGCTTCCTGGAGGAGAGGTTATGGGAGTTAAG 

A CTTGG AGG AAAAG ACT ATG AGC CA C AGG AAGCG AAGGGAAGAGTGACCCAGGC AGAGGGAAAAG CACCTG CAAAGGCCTGAAGT C CAGG GAG AGAG 

GCC AGG CAT CTGGAACACAACGGGG AGAGGAGAG ACGAGACTAGGGCA C CCTCGGAGCAGGTCGT TCAGGG CC TCAGGAGCC ATGGGGAGGAGTAT A 

ATGCAGGATGCAGGGGTACTGGGAGGGTCAAGCTAACAGTGTCCAATCTGGGTTACTCTGCTTCCCTGAAACTCCAGGGACATCTCCTCCAGGAAGC 

CCTAGGGATTGCTCAGTGTGGAGCGGCTCAGCC^CCCAGTCACACTCACTATTCAATGCTGAGCACTGAGGAGGCAGCCGTTACTAAATCAGCCTGG 

ATTTGCCCTCCCAGAGGTCACAGTCACAACAGTCAAACACACACCTGCACGTGCACACTCATGAACGTGCCCCTCCCCTCCTCTGGAGCCTCAGAAA 

CCCACAGACCTTGTTCCTCCCAGGGGGTGTCCCTGGGGCCAAAGCTTCTGCACTACCAGGGGGCTTGTTGGAAATGCAAAAGATGGCTGGGCACAGT 

GGCT CAC AC CTGTA ATCCCAGCAGTTTGGGA AGC TGAGGC AGGCAGAT C ATTTGAGGTC AGG AGTTCATTTGAGGTTGGGAGTTCAAGAC C AGC TTG 

GCCAACATAGCGAAACTCTGTCTCTACTAAAAATACAAAAAAATTAGCCGGGCGTGGTAGCACACGCCTGTAATCCCAGCTACTTGGGAGGCTGAGG 

C AGGAG AAT CTTTTG AACCCG AG AGGTG AAGGCTGCAG TGAGCCAAGATGG CAC CACTGCACT CCAGC CTGGGTGA C AG AGTGAGTCTTAAAAAAAT 

AATAATAAAATAGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCGGATCACCTGAGGTCAGGAGTTCGAGAC 

CAGCCTGACCAAAATAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTTAGCTGGGCGTAGTGGCTAACGCCTGTAATCCCAGCTACAGG 

AGGCAGGAAAATTGCTTGAACCCAGGGGTGCAGAGGTTGCAGTGAGCCAAGATGGCACCATTGCACTCCAGCCTGGGCAACAACAGTGAAACTCCGT 

CTCAAAAAACAAAATACAAAAATTAGCCAGGCATGGTGGCACACACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCG 

GGAGGCAGAGGTTGCAGTGAGCTGAGATCGTGCCACTGCACTCAAGCCTGTGTAACAAGAGTGAAATTCCGTCTCAAAATAAAATAAAATAATAAAA 

ATAAAAAGATGACCACAGCAGATCTGCTGGCTCTCCAGGCAATTCGTGATCACCACGAATTGAGAAGCGCTGCTCTCAGGCTTAACCCCCCTGTACC 

TCAGTTTCCTTTTCTGTACAAGGAGATCAAAACAGAATCCATGTAGAGGACTGACGTGGAGTGAGGCTTAACTGAGATAATGAGTGTGAAAGTGCTA 

AGCACAGCTTCCTAAGTGCTTGTTTTCAGCAGCTCCTCTTCCTCTTTTCTATTTTTTTATTTTTATTTTTTTTGGGGGGGGAACAGGGTCTCGCTCT 

GTTGCCCAGGCTGGAGTATAGTGGCAGAGTCATAGCTCACTGCAACCTCAATCTCTCAGGGCTCAAGTGATGCCCGTGCCTCAGCCTCCTGAGTGGC 

TGGGACTACAGGTGTGTGCCACCATGCCCAGCTAATTTTTAAATTTCTTGTAGAGGTTGGGGGTGGGTCTCACTATGTTGCCCAGGCTGGTCTTGTA 

CTCCTGGCCTCTGGCGATCCTCTTACCTAGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACACCCAGCCCTCCrCCrCCrCTGTATTAT 

TAAATTTCAGAGCTGGAAAGAAATTTCGAAGACCGCCATCACTCAACCTTCTTGCTCTACAGATGGGGGAACTGAGGAACAGAGGAGCAGAGGTTTG 

GCCAGAGGAAGGTGGGAAGGACGTGGTGTGGCTGGCCTTCACAGACTCTGCCGGGGGCTGTTTACAGGATACTCTGAGGAAAGCCAGGATGGGCACT 

GATACCCACCTCCACCCTGCGGGAGGCAGAAGGCTAAGGCTCCCCCAGCCGTCTCCCCCAAGCAGAGCTGAGCCCTTGGGAGCCAGGGCAGAGATGC 

AACCCTCTCCCCCATGCTCAGGTCCTCCTCCTCCTGGAAGCTCTCCTTCCAAGAGGAGCCTCTTATTTTATTTTTTATTATTATTATTTTTGAGACA 

AGGTCTCACTTTGTCACCCAGGCTGGAAGTGCCGTGGCATGATCATGGCTCACTGAAGCCTTGCCCTCCCCAAGCTCAAGTGATCCTCTCACCTCAG 

CCTCCCAAATAGCTGAGACTACAGGCACACGCCACCACCCCTGGCTAGTTTTTGTATTTTTAGTAGAGATGGGGTCTTGCCATTGTCCAGGCTGGTC 

TTGAACTCCCGGGCTCAAGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGACACTGAGCCTGGCAAGGAATGAGCCTTAT 

GGAGAAGAACTTGATTTACCCACTCCATGCTCTGCAGGCAGGTGTGTCATGTCTGTGAGTGCAGGTGTGTGTGTGTGTGTGTATGTGTGTGTGTGTG 

TGTCTCCAGGGCCAATTTCTCCCTTAGGCCCAGGGGCACAGTGCCTAGGGCCCATGATAAATTTAACCGACCCACGGATATGGGTCAGGAGGGTATA 

CATGTATAAAGTTCAGGAAGACTTTGTTTTAGTAACAAAACGAGCATGTACAATCTGTCAGGAGCCTTATGATATCTTCCCACCAGTCTCATGAGTG 

GGCACTACATCTTGTTCTCCTTTGTCGGGTAGGGAAACTGAGGCCCCGGATGTCCTAGCTCAAGTTCACCAAGCTACTTGGTGATAGAAGCGGAACA 

TAAGCCTAGGCCACCTAGCTTCTGGGTCTGCGATCGTGTGAGGATAAATGCCCAGTGTGTTCGTGAACATGAGCATCCCTGTGTGAATAAATTGACA 

TACATAAACCCATTTAATAAATTTCAAAGCCAGGCGCCGTGGCTTACTCCTGCAATCTCAGCGCTCTGGGAGGCTGAGGTGGGAGGATCGCTTGAAG 

CCAGGAGTTTGAGATCAGCCn'GCAACAAAGTGAGACCCTGACTCTAAAAACATTTTTTTGAATAAAAAAATTAGGCTGGACACAGTGGCTCACGCCT 

GTAATCCCAGCACTTTGGGAGGCCAAGGTGGGTGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGTCAACAGGGTGAAACCCT 

AAAATAGAAAAATTAGCCAGGCGTGGTGGCACATGCCTGTAATCCCACCTACTGGGGAGGCTGAGGCCGGAAAATTGCTTGAATCTGGGAGGTAGAG 

GTTGCAGTGAGCCAAGATCATGCCATTGCACTCCAGCCTGGGCGAAGAAACGAGA 

GGTACGCACCTGTAGTTGTTAGTTACTTGGGAGGCTGAAGCAGGAGGATTACTTGAGGCCAGGAGTTCGAGGTTACAGTGAACTATGATTGCATCAC 
TGCACTCTAGCCTGGATGACAGAGCAAGATCCTATCTCAAAATAATACTAATAGTTCAAGGAACAGACTTTGAAGCCTGACACCCTGCATGGTGTTA 
TTCCAGCTTTGCTACTTACTTGCTGTGTGACTCTGGGTGAATAACTTAACCTCTCTGGGCTTCTGTTTCCCTTCCTGTAAAATGATCATTTGTACCT 
C AC AGG AGTGTTGTG AGAATTAAAT AAGTTAATATAAG CTCTTGGGAAGAATTAGCTCTTGTTATGGTTGTGTGAAGAGC TTT AC ACG AGTCC CTCT 
GCAGGATGCATGGTTGTTGGTCTGTGTGTGTGTATATATATATATGTGTGTGTGTATGTGTGTATCTATGTGTATATGTGTATATGTGTGTGTTGCT 
CTC TTG ATCTCTG AGTGTTG TGTGTGTGTCAATGTGTGAGATCATATATGCACCTTT AGT AAAGATCT AGGGGT CTCTGTGT GTT TTGGGGGTCTGT 
ACCTGTCCT TGGCTGTACAC CTG AA CCCTACGTCTTCTTGTGTGTGTAGGG ATGTGTGTCATGTGTGT TGATAACCATAT GAC AGTGTGTGTTTCC A 
CATGCCAGTGTGTGTCTGTGAAGCTGTTTCTATGTGGCTGTGGGTGATTGCCCACGCGTGACCTGGTGAGCGGCTGTGTGGAAGTCTGTGTTGCCTG 
TGTCGGGTGTGGAGTCTCTGGCCAGCCAGGGTAGGTCTTTGCATGTTTCCTTTATTCCATGGAGGAGGAGATGAGGGGCTTGGATGAGACAGAAGAA 
TGAGGGGTCACTCCCTGTCTGATG^TGGGGCCGAGTCACrGCCTACTAGTGGCTGCTGTGTCATCTCCCAGTCTCTGTCCCTCCCTCCTGGCTGGTG 
GCAGACAGAGGGGGCGTGGAGGAACAGGGCAGGGGGTGGGCTTGAGGCTGGACTTTTGTCTGGAAACTTGAACCTCCACCTATGCCCCCCACGTCCC 
TCTAGTCTTCCACATCTCCCTACACCCTCCCTACTATTGGGGCAGGGAATCAGGACTCCCTGAAAATTAAGAACCATCGCCTTGGCTCTGCCACAGG 
CTTGCTGTGTAGCCCGGAGACAGGCCCTGCCCCTCTCTCAACCTGTGAAATTCAATGTTTTAAAAGTGCTTCTTCCTTTGTGCCCAGGGCTGGGCCA 



73 



GGC TGTGCACTGGGAGGGAAGTG CTGGGTTCTCTGAGGTTTGTGAAAT AGCTGG TAG CACCCTTCCCAGTGGGTGCTTAAGAGTTGGGGG CTG TAGC 

ATCTGAAATACCCAAGTCAGTGCAGGCATAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGACCGAGGCAGGCGGATCACCTGAGGTTGGGAGT 

TCGAAACCAGCCTGGCCAACATGGTGAAACCCCCATCTCTATTAAAAATACAAAAGTTAGGCCGGGCACAGTGGCTTACGCCTGTAATCCCAGCATT 

TAAGAAGGCCAAGGCGGGAGGATCACCTGAGTTTGGGAGTTTGAGAGC 

CTAGCCAGGCATGGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCT^^ 

CGAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCTGTCTCAA 

CGCACACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATC^CTTGAACCCAGGAGATGTAGGCTGCAGTGAGCCGAGAT 
ACTCCAGCCTGGGCAACAGAGTGAGACTCTGTCAAAAAAAAAAAAAAAAAACGTC^ 

TCACCTGTACCCCTGTGAGCCAAGATTGCAACAGTACATGCTAAGAGCAGTAAGAAGGAGTTTGTAAAGCTTAGAAAGACCTGACAGGCCAGGCGCT 

GTGGCTCACGCCTGTAATCTCGACACTTCGGGAGGCCGAGGCAGGCGGATCACCAGGTCAGGAGTTCAAGACCATCCTGACCAATATGGTGAAACCC 

CGTCTGTACTAAACATATAAAAATTAGCTTGACGTGGTGGCCCGCGCCTGTAGTCCAGCAACTCGAGAGGCTAAGGTGAGGCGGAGGTTGCAGTGAG 

CCGAGATCGCCACTGCACTCCAGCCTGGCGGCAGAGCAAGACTCC^ 

ACTAATATTGCCATTATCATTTAAAAAAACCCCAGACACAGGTTTTTCAGGGA^ 

AAAGGGT AGGAAAT CTGGAGTTT GAC AATTCTGAGTTTGAATTTCTTGTGATGGGAT CTTGG GCAAGT CATTTAACC TC CCTG AGTAT CATTTTTTT 
CTTT TATAAAATGAAGATTTTTCTCTCT TAACCT TCCAGAGCTGTTTTAAGGAT TAC AAATCTTCTAC TGAAAGGCGTAGCAC AGGAGCTGTG CTTG 
GCAAGTGCCAAATACAAGGCATTAATTATTATTA TTAG AATT AATAAT AAT ATC CCCTC CCT C TTACAC ATTC TTTGTC TCCGGGTGGATTAAAAGG 
TGGAAGGAGAGGCTACCAACACCATCAGAAGAGAGGCTTCTCTCTAAGTTTCATTTCCCATCTCCTCCAAATCCGCATCCCTCCCAAACGCCGGACC 
TGTAAGGCCAGGAGGGTCCAAGAC^CACATCCTTTGCCCAGCGGGGAAGATTAAAGCTAAA 

CGAATCAGGACAGAAACCAGGACGAGCCTCGGAATCCCTCCATTACCTCCACTTTCACCTGAGCATCACAGCCCGCTCGGGACTCAGTTTCCCCACC 
TACGTGACCACACCACACTAATCAGGGTCTCCTTTTGGAGATCTGCTCTTCTTCTCGAATGGGGGCGCTGCACCATCGGTAGAACAGGGTAGGTGGG 
GGCGCCAGAGGTGAAGGGGACCTGCAGGCTGGGGTCTTCCCCGCCCGGGTCAGCGGGGTCCCTGCGGGGCTAGTCTAAGCGCCTATTATTACCAGCC 
CCCGGGGCGGCGTTGCACTGCGCAGGCGCGGGCGGGGCGCGGGCGCGCGCGCGAGCGAGCGAGGGATTCCCTCTGACGTCATTGCTAGGATACCAAA 
CAAACACTCCGCCGCGCCGGCCGAGCTCCTTATATGGCTAATTGCGTCACAGGAACTCCGGGAAGGCGGGGCCGGGATCCCCTCCCGCCGAGTGGCC 
CGGAACGCAACCCCCGAGACCCCCAGGGCCCCGAGGGTCATG^ 

TTTGGCGGGGCCGGAGATTTGGGGAAGTTTGTCCAGCAAGGGGCGGGTGACGTAAGCAGGGGGGCGGGTCCCGGGCATATAAATACAGGCTGGCGGG 
TCTGTGCTTCATTCATAAGACTCAGAGCTACGGCCACGGCAGGGACACGCGGAACCAAGACTTGGAAACTTGATTGTTGTGGTTCTTCTTGGGGGTT 

TTTTGGTCTTCTTTATTACTCCCCTCCCCCCGTGGGACCCGCCGGACGCGTGGAGGAGACCGTAGCTGAAGCTGATTCTGTACAGCGGGACAGCGCT 
TTCTGCCCCTGGGGGAGCAACCCCTCCCTCGCCCCTGGGTCCTACGGAGCCTGCACTTTCAAGAGGTACAGCGGCATCCTGTGGGGGCCTGGGCACC 
GCAGGAAGACTGCACAGAAACTTTGCCATTGTTGGAACGGGACGTTGCTCCTTCCCCGAGCTTCCCCGGACAGCGTACTTTGAGGACTCGCTCAGCT 
CACCGGGGACTCCCACGGCTCACCCCGGACTTGCACCTTACTTCCCCAACCCGGCCATAGCCTTGGCTTCCCGGCGACCTCAGCGTGGTCACAGGGG 
CCCCCCTGTGCCCAGGGAAATGTTTCAGGCTTTCCCCGGAGACTACGACTCCGGCTCCCGGTGCAGCTCCTCACCCTCTGCCGAGTCTCAATATCTG 
TCTTCGGTGGACTCCTTCGGCAGTCCACCCACCGCCGCCGCCTCCCAGGTAAGTTTTTGATAGTAGGGGTGCTGCTTTGTAGGTTTTATTTTTTAAG 
TCAAGGGTGAAAAGAATAAA CCC C A ACC C CC ACA AAAAGGCGC ATC AG AACC CT AGATCTGAG ATGGAAAAGG CTCACAGCGCACTTTGCAAA CTGC 
AAAGAGTCGGGAGATGTTTGCAATTGGTTGCGTGCGTGGAGCGCAAGGAGGGAACGCGGCAGGGAGGGTAGGCTTTGGGGCGAGGTGGGGGTGGGGT 
GGGTAATGCGCTGCTCAATGCAACGTGTATGCGGTAGCGGGGCTGAGAACTTTGAGCCGGCCCCGGGACTGCCCCCTGCTCGGGTCCCAGACCTGAA 
GCTAGCGCAGTTAGGCAGGTGGGGGAAATCCCGGGGAAGCTTCCAGCAGTCTCTTTTCCTTTCCTCCTCCTTCGGAGCGCCCACTTCGGTGCCGGGT 
CGCCCTCCACCCATCGGGAAGAGGGGCCTCGAACCCTCAGCCGTO^ 

GACGCGCTGGAATCCTCCGTCTGACGCGGGGCACGC^CGGCGCGCGGCGCCCCCTTCGTCCGCCCCGCCCCTGACGTCCCGGGAGCGTTCTATTTTG 
GAACGCCGGGGCCACGTTGCTAAGGGAGGGGGCAGCCCGGCGTTTCGATTC 

GTAGGGTGCATTTTCCTTCCCCCCTCTCTGTCCCCGGAACCCGTGGTTCCTTGGCGGCTGGGTCTCTTTTCGGCGCCTCTAGAGGCAGAGGGAGGGG 
ATCCCTGTCGTGACAAGAGCGCCTGTCTGCGACCCAATGGATCTGCGAGGCCCTTGCGGGGATCTAGTCCCTGGGCTCTCAGGAGAAGGGGGTGTCT 
GCTTGTGTGCTGGCGTTTCTTGGAGAGATACGGTGGCTGTCACCCTCTTCTCCAGGCACACACAGACACATTCCCACTCCCTCTGCTCCTCATGCCC 
GGTTCCTCCGGTGTGTCCCAAGACAGGACTAGAACCGCGAACCGAAGGGCAACC 

GACTCTGATTGGTCGGCGAGCCAGCCCTTCCCTCACCACGCCCCCCGAGAGAGTAGTTAAGCCTTCAGAGCAGTTCCAGGAGTCCATTTACGGGAGG 
GGGGAGATGAGCGCTGCTGAGGCTTGGGGGCTCAGGTCCCGCACCATTCCCCCTCCGCGACAATCTGAGAGAGCTCCAGTGGTTACTTTTATCTACC 
TGTCCGTTCACCCTAAACTGTCACTCGTCAGTCTCACTCTGAGAAGAGACAGTAACTTGAAACGTTGTTCTAAACTCCTAGGCCCGTCCCCCAAACA 
CCCTTTTGACTGGGACCCCCGCCCCTGCATGGGACCTCGCGCAGAGGGGGGTGTGTATGTGTGTGAGTGTAGAGGAAGGCTTGGCCTAAGGCCTCTC 
CTTCTCCCTCCCCTTGCCTCTGGGGTGGGGGTGGGGTGTTGTGGCTGTGTGTGTGGCTGTGGCTCCGTCCCGGGGGTTCTGTCACCCGGCTGTGTCC 
AGCCTCCTCTCCACCCCCCATACCTAAGAGTCACCAACCCGGGGTGTGATTCACCACCCGCTGGAACCGTGCAACCTTTCCCCGAGGAAGAAGGAGG 
AGGTAGAAGCCAGTTGAGCAGAAATCCTCTCATTAACCACTGCGTC 

TCCACACTCGCTCACCCTGTGCTCACTCACGGGGTCGGTGTGTGTTATGTGTGTTGGGTGTGTGTGTGTCGGTGTCTTTGTTTGTGTGTCTACGCCT 
GTGTGTGTATGTGTCACCCCGTAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGTTCCTTCGTGCCCACGGTCACCGCGATCACAACCAGCCAGGACC 
TCCAGTGGCTTGTGCAACCCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGCCTCCCAGCCCCCGGTCGTCGACCCCTACGACAT 
GCCGGGAACCAGCTACTCCACACCAGGCATGAGTGGCTACAGCAGTGGCGGAGCGAGTGGCAGTGGTGGGCCTTCCACCAGCGGAACTACCAGTGGG 
CCTGGGCCTGCCCGCCCAGCCCGAGCCCGGCCTAGGAGACCCCGAGAGGAGACGGTGAGTAAGGGACATCAGAACTTGGCCTGGGTAGGGGGAAGCA 
AGAGAGGCAGGAAGTTTCTTATGAATGGAGGGGGGCTCCACTAAGGCCTCAGTGTTACAGAAA 

GGCTTGTTTTTTGGCTCTTGGGGGTTCTGAAAGAAGTAGAGGTTCGGGATGGGTGGAGGAGTGCTGTATCCCCAAATCTCATGGCCTCTATCTCCCT 
GACTCAGCTCACCCCAGAGGAAGAGGAGAAGCGAAGGGTGCGCCGGGAACGAAATAAACTAGCAGCAGCTAAATGCAGGAACCGGCGGAGGGAGCTG 
ACCGACCGACTCCAGGCGGTGAGGACAGGCCCTGGGGTGGGAGAGGGGATGTTGAGGGGAGCTCTCTCCCCATTCTCTGCCCCCTCTCCACCT 
CCTTATCCTGGGTTGAGAACTAGACGTTCCACACATGGAACTAGGTACTGCTC 

ACTTAAGTCAACTCCTGGTCCCCCCCCCATTTCCTGACCCCACGGACTACTCTCCTAGCCTTCATTTCATCCCAAGGGGCCACATGGGGCCCTTGAG 

GAGAGGGGCGCCCCCATCATTTCTCCCATCTGGTCTTCAGCAAGTAACAACCCATTTTGCCTCAGTTTCTCCATCTCTGCAAACCCTCATCA 

CCCGGGCTCTTTCTACCCTTAACACTCTGGAAAGCCTGTGAAATGAA^ 

CATCCCCCAAAACCCACTCCCTTCCTACGTCCTCCCTTGGTC^ 

GGG AATTGT CAACTTTCCTT CTC TA CTTTTT CTCTTC CCCGGG AGGTAGAG AGGGAGGGGTAATAG AAGGG AAC ACATTAAAAACACATAACAG TGG 

CTCATGCCTGTGATCCCAGCACTTTGAGAGGCCAAGGCAGGAGGATTGCTTGAGCCCAGGAGTTTGAGACCAGCCTGGGAAACATAGGGAGGACTTG 

TCTCTACCAAGAAAAAAAAAAATTAGTTGGGCATGGTGGTGCACACCACTGTGGTCCCAGCTACTATGGAGGCTTTAGTGGGAGGATCGCTTGAGCC 

GAGGAGGTCCAGGCTGCAGTGAGCCATGATTGCACTATTGCAC^ 

ACATAACGATTGTTCATTCATTCAACAAACCTCCTGCAC^ 

AAACTGAGGTTCCTGGAAGGCAAGACCCTTGCCCCAACTCCCATGGCCACCAGGACTCCATCTCAGGAGGTGTTTTTTCTCAGCCCGGGGCTGCCTT 
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CC^GCAGCAAGTCCAAGGGAGCCATGACCCGAGTTTCCCGGTCACTGACATGCTTTTTTCTCCTTCCTCTCTCTCTTCTGTGACCTGGCCTCCCTGG 
CCTCAGGAGACAGATC^GTTGGAGGAAGAAAAAGCAGAGCTGGAG 

TGGCCCACAAACCGGGCTGCAAGATCCCCTACGAAGAGGGGCCCGGGCCGGGCCCGCTGGCGGAGGTGAGAGATTTGCCGGGCTCAGCACCGGCTAA 
GGAAGATGGCTTCAGCTGGCTGCTGCCGCCCCCGCCACCACCGCCCCTGCCCTTCCA^ 

TTTACACACAGTGAAGTTCAAGTCCTCGGCGACCCCTTCCCCGTTGTTAACCCTTCGTACACTTCTTCGTTTGTCCTCACCrGCCCGGAGGTCTCCG 
CGTTCGCCGGCGCCCAACGCACCAGCGGCAGTGACCAGCCTTCCGATCCCCTGAACTCGCCCTCCCTCCTCGCTCTGTGAACTCTTTAGACACACAA 
AACAAACAAACACATGGGGGAGAGAGACTTGGAAGAGGAGGAGGAGGAGGAGAAGGAGGAGAGAGAGGGGAAGAGACAAAGTGGGTGTGTGGCCTCC 
CTGGCTCCTCCGTCTGACCCTCTGCGGCCACTGCGCCACTGCCATCGGACAGGAGGATTCCTTGTGTTTTGTCCTGCCTCTTGTTTCTGTGCCCCGG 
CGAGGCCGGAGAGCTGGTGACTTTGGGGACAGGGGGTGGGAAGGGGATGGACACCCCCAGCTGACTGTTGGCTCTCTGACGTCAACCCAAGCTCTGG 
GGATGGGTGGGGAGGGGGGCGGGTGACGCCCACCTTCGGGCAGTCCTGTGTGAGGATTAAGGGACGGGGGTGGGAGGTAGGCTGTGGGGTGGGCTGG 
AGTCCTCTCCAGAGAGGCTCAACAAGGAAAAATGCCACTCCCTACCCAATGTCTCCCACACCCACCCTTTTTTTGGGGTGCCTAGGTTGGTTTCCCC 
TGC ACTC CCGACCTT AGCTTATTGAT CCC ACATTTCCATGGTGTGAGATCC TCTTTACTC TGGGCAGAAGTGAGCC CCC CCCTTAAAGGG AATTCG A 
TGCCCCCCTAGAATAATCTCATCCCCCC^ 

TCTGATTGGAATTTCTGGCCTCCTAAGGCTCCCCACCCCGAAATCAGCCCCC^^ 

CCAGCCGACCCTCCTGGCCTTCCTCGTTGGGCCGCTCTGATTTCAGGCAGCAGGGGCTGCTGTGATGCCGTCCTGCTGGAGTGATTTATACTGTGAA 
ATGAGTTGGCCAGATTGTGGGGTGCAGCTGGGTGGGGCAGCACACCTCTGGGGGGATAATGTCCCCACTCCCGAAAGCCTTTCCTCGGTCTCCCTTC 
CGTCCATCCCCCTTCTTCCTCCCCTCAACAGTGAGTTAGACTCAAGGGGGTGAC^^ 

CTCTCTCTCCTCAGGACCCTCAGCCCTGGCCTTTTTCTTTAAGGTCCCCCGACCAATCCCCAGCCTAGGACGCCAACTTCTCCCACCCCTTGGCCCC 
TCACATCCTCTCCAGGAAGGGAGTGAGGGGCTGTGACATTTTTCCGGAGAAGATTTCAGAGCTGAGGCTTTGGTACCCCCAAACCCCCAATATTTTT 
GGACTGGCAGACTCAAGGGGCTGGAATCTCATGATTCCATGCCCGAGTCCGCCCATCCCT^ 
h% CGCTTCATCTCATGAGGATTTCTTTATGAGGCAAATTTATATTTTTTAATATCGGGGGGTGGACCACGCCGCCCTCCATCCGTGCTGCATGAAAAAC 
ATTCCACGTGCCCCTTGTCGCGCGTCTCCCATCCTGATCCCAGACCCATTCCTTAGCTATTTATCCCTTTCCTGGTTTCCGAAAGGCAATTATATCT 
ATTATGTATAAGTAAATATATTATATATGGATGTGTGTGTGTGCGTGCGCGTGAGTGTGTGAGCGCTTCTGCAGCCTCGGCCTAGGTCACGTTGGCC 
U CTCAAAGCGAGCCGTTGAATTGGAAACTGCTTCTAGAAACTCTGGCTCAGCCTGTCTCGGGCTGACCCTTTTCTGATCGTCTCGGCCCCTCTGATTG 
O TTCCCGATGGTCTCTCTCCCTCTGTCTTTTCTCCTCCGCCTGTGTCCATCTGACCGTTTTCACTTGTCTCCTTTCTGACTGTCCCTGCCAATGCTCC 
IS AGCTGTCGTCTGACTCTGGGTTCGTTGGGGACATGAGATTTTATTTTTTGTGAGTGAGACTGAGGGATCGTAGATTTTTACAATCTGTATCTTTGAC 
4™ AATTCTGGGTGCGAGTGTGAGAGTGTGAGCAGGGCTTGCTCCTGCCAAC<^CAATTCAATGAATCCCCGACCCCCCTACCCCATGCTGTACTTGTGG 
£s& TTCTCTTTTTGTATTTTGCATCTGACCCCGGGGGGCTGGGACAGATTGGCAATGGGCCGTCCCCTCTCCCCTTGGTTCTGCACTGTTGCCAATAAAA 
AGCTCTTAAAAACGCATTCGCCAGGCTACAGTGTTTATTTCCTCCTAACACCCATTGCCTGGCTTCCTTTTAGTAAGAGAGGATGAGGTTAGATTGT 
^ CGA AGGACT AC ACAC AC ACAGAGAT C ATA CAC C CATGC AC ACA C AC ACACAGAC ACA C AC CTATGC AG GAACTCCTTCTAGTCAATGGG TCTACAGC 
{jj AGGATAAAAGGAAGGCAGACAGTCAGGAGGACTTCCTGTACTCGCAGGCAGTGCCTCCT^ 

TTGCGCCTGTAATCCCAGCATTTTGGGAGGCCAAGATGGGCGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTTGCCAAGATGGTGAAACCCTGT 
CTCTACAAAAAATACAAAAATTAGCCTGGTGTGGTGGTGCACACCTGTAATCCCAGGCTACTTGAGAGGTTGAGGCAGGAGAATCACTTTAACTTAG 
|M= GAGGCAGAGGTTGCAGTGACACAAGATTGCACCACTGCACTCCAGCCTGGGTGACAAAGTGAAACTGTCAAAAAAAAAAAAAAAAGTAAGG 

AAG GTTAGC CTGC AGGT GTGGGATC ACAGAGGTCT CCT AGTGATGG AGCTTGT ATG CTCT ATGG GTT AAA AAC AGACGCTAAGG AGATAAACT ATAC 
'J! ACAGAAGAAGCTTAATGGGCTGTGCACAGTGGCTTGCACTTGCAATCCCAGCTCTTTCGGAGGCCGAGGTAGGAGGCTAGGAGTTCGAGAACAGCCT 
rU GGGGCAACATAGTGAGACACCCCCCACCCCAACCCATCTCATTATGTTGAGAAAAAAAAAAAAAAGAGGATCTTGAGAAACT 
fjj TAAAGACCACTCAGGCTAGGAATGAGGCGT<^TCTTGGATTTTTTACAAACT^ 

,'J.r TTGTTTTCCATATGACCACTATTGGGGCCAACCCAAGACTGACTGAAGCAAATATATGACCATTTTTTTTTTTTACACATAGAGTGGAGCCATCCCC 
CI ATTGCyU\AATCCTAGCCGTCTCTTATTTG<^TAACAGCAAGTCCTCTGGGAATTGAGGTGAGGGTCCAGGGGTCGCTCAGACGCTTTGGTAAATCAA 
T AAG CAAGGGAATTTGTC ATGGTGG AAT C CATTT ATTAATTC ACT C AA AAAAATGTGCTAAGCACCTACCCAACTGTGC TGGGAGTTGGGGAC ACT A 
TAGGGACTGAGCCTCAGGTCTTGCCCTCCTGGGGCTCACAGTCCATTGGGGGACACACACCGTTGCCAGTCAGGGACAACCTAAAGTGGATTGGGTT 
TGGGAAG CCCATGGGGCTAGGGAAG CACAGCAGGGACACC TAGCTTGGA CTGGG AGTCGGGAAAAACTTC CTGGCGG AGGGG ACAGCTGAGCT GAGT 
CCTAGAGGAGGATAGAGAGTATAGAAAGAAATAGAAAAAGGCCAAGGCCAAGAAAATAGCACAGCCATAGATCTGTTGGTGGGGAGAAGGCTTGATA 
C ATTGAGAACG CTT AGAGGGT CGGGTGCGAT GGCTCATGC CTGTAATCC CAGCACTTTG GGAGGCCGAGGTGGGTGGAT CAC CTG AGGTCAGG CATT 
CAAGACCAGTCTGGCCAACGTGGCGAAACCCCGTCTCTACTAAAAAATATATATATATATATACAAAAAATTAGCTAGGCATGGTGGTGGGCACCTG 
TGATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGAAAGTGGACATTGCAGTGAGCTGAGATTGTGCCACTGCACTCCAGCCT 
GGG C AAC ACAG CGAGACTCTGTCTC AAA AAAAAA AAAAAAAG AAAGAAAAAAAAGAG AAAACTCAG AGATT CGTGG AG ACTGG AAC CACGGGTGT GG 
AGAGAGGGGTTAGTAGAGACCAGATTCTGCAGGTACTATAATGACATTCCCAGGCTAAGGAGTTTAGATCTTTTCTTCAGGGCACTGGGGAGCTATT 
GC^GGTTTTGAACACGAGAGGGGCAGGGGCAGGTTTGTGATTTAGGAAAGACCCCTCTGGCCCCAGACTGAGCGTGACTGGAAGGGAGAGGAGTGAG 
ACTGAGCAGGAGACCAGGGAGGAAGTGGGGTGGGGACCCTGCAGGGTGGGGTGAGGAGGAGTCCAGAACAGGGCCAGGGACATGGGACAGAGAGGAG 
GGGAAGGATTCAGGAGACACTCGGGAGGCAGGACAGTTGGGCTTGGTGAGCAGCCGGCTTGGGAAGGAAAATACAGGGCAGTAATAAATAGCATGGG 
T CAAGTGGTCTGAGTGAGGCTGGGAG TGG TGG CACGCACCTATGGT CCCAG CTA CTC AG G AGGCTG AGGTGGGAGGATTGCTTGAGCC TGG AAGTTT 
GAGGCTGCAGTGAGCTATGATCGTACCGCTGCACTCCACTTTCGGTGACAGAGTAAGACCCTGTCTCAAAAAAAAAAAAAAAAAGGAAAAAAAAAAA 
AAGACACTGCCTATGAGTTAACCCTGCTCTACAAGGAGCAGTTTTTAAAGTAAAATTAAAAA 

G AG AGAAAAAAGAAAAAAGGCTGGGTTG CCT TGG ACAACAGACTT C ATCTC CCTG AGCCTCCATTTCCTCATCTG TAGAATGGGGGCTGTTAAGAGG 
AGT TGC AAGGCTTGTGC ATGC CAGC AGT AAGTGC AGAGTGACGGTGCAATTATCATTACCCCCATCAT CTTTATTGGGGTCAG CCTGAAC CCT CGAT 
ATCCCATAATATTCACCCCCATCCTTCAAGGGTCTGCCCTAATGTTCCCATGACACCCGACCAGCTCAGCTCTCCTTATGAGAGGGCCTCACTTTT 
T ATT CC CTTTG CAG C CGTTAT CCCC TTTGTAGTTGTTAATTAAGTG TGT AATTACAT GATGGCTTAATGTT TGT C TCCCCC ACTGGGCTGACTGCAC 
CAGGAGACCAAGGCCAGGGCTGTCACCCACCGCTGCGTCCTCAGCAAATGCTTGTTTGATGAGTGAATGACAGATGAACAAATGGGCAAGTATTTGA 
CTGATTAATCACGGCATGCX3TGATTAAATAAATGAGTGAGCAAACGACTGAATGAGTA 

G CC AAT AT CCCAGAACTGGGTAG ATGACTTTTTCTCCGTC TTTTGG AGTGGGTTTTTATA CTCCGGAGGGGGAAATAAGT AA CGAC AATC AGAG AT C 

GGAGTG ACAGAGAATGAGTTTGCGATGCGGAAGGC CGCAGCGACCT CTC CAGGTGCAGGC TGGAGG AGCGC CTCCCGGGCAACAAGCAAAATAAGTC 

CGGTTCGATAAGTAAGATTCAAGGCGCTTAGTTACTACCGCCCGAAAGGTGGAATTGAACCACTCTGTCGCTAGACAGCTACAGGTTTGAAGCCTGC 

ACCCCAGACCACTGAGGATCATCCGGGCAGGACAACCTTCTTCCCGCGCAGCTATATAAGGATCGCAAAAGCTTACCTTTTAGAGTTATGGTCGCGT 

CCTTTGGAAAGGTGTGAGATGGTTGACTTCAAGGGCTATAAGTTCTCCCCACTCATTTATCGTGAGAGATGTCCTTGTAAAACGTT 

ATATAGTATGAATCATCCTCGCTCCCCAAACCGTCTCATTTACATAGCGCCACGCATTCTGGGAAGCAGCGTTTGAGCACCTGGCAAGCTGTTAAAG 

GGCCCAGGACCCCCTTTTTCCAAAATGAAAAGTCAGCTTTTTAAGAGGAGAGATTCATGATTCGGCATTCAGAAGCGCCTTCTCGGAAAATTCCGAA 

TTCTCTTTCTGCGGAGGAGTGGTGAGTGGGTTCCTGTTAGAGTCTGATTCACCCTGGCCCCAAAGGCGCTGAGCAGGCGGCTATAGGGCTGGTGCGA 

CACCTGCTGTTCTTTTATGGAATTGCGTAGGCCGATGGAGCTCCGCGTTCATTCAAACGTTTATTGAGCGTCTACTGAGTCCCAGGAACTGTCTCGG 

TGCTGGTGACGACAGACATCCTGTCCCGATTGAGTTTAAACACTAGACATAACCAATTTCTGCACTTGCGTTTTATGGTTGGAGAAACTGAGACAGA 
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CTCAGAAACGGATAGTACCATATTCCGGTTAAPJ^CAGCCCATCCACTTAGGGAAATCGAACTTGTGTTACAATTTTTATTACAAAAAAGGTGATTCT 

TTTTTCTTAAATTTTTACATATTTAGGAGGTACAGGTGCCGATTTCTACATGCATATGTTGATTGCATCGTGGTGAAATCTGGGCTTTTAGGTGTAC 

TCATCACGAAAAATTATTCAATATTCTAAGTCAGTGATTCTTTGACAATACC^ 

AGTCAAAGTACTCACCCTGCTTTACAAATCCCTGTAATCCAGCCCCT^ 

GGATTTAGGCACACAGATGTCTTGCTGCACACATGGATCATGCTACCACCACA 

CATGTTCCCATGGCTCATCGCCTCCCTTCACTCT^ 

CTGATCCCAGTTTAAAATAGCGAGTGGCGGCCGGGCGCGGTAGCTCACGCCTGTAACCCCAGCACTTrGGGATGCCAAAGCGGGTGGATCACGAGAT 
CAGGAGATGGAGACCATCCTGGCTAACACGGTGAAACCTTGTCTCTACTAAAAATACAAAACATTAGKZTGGGCGTGGTGG 

AGCTACTCGGGAGCCTGAGGCAGGAGAATGGAGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCTGAGATTGCTCCACTGCACTCCAGCCTGGGGGAC 
AGAGCG AGG CT CCGT CTCAATAAAT AATAAATAAATAAAATAGCT AGTGGCTGG GAT CGGTGGCTCAGGGCCTGTAATCCCAGAACTTTGGGAGGCC 
GAGGCGGGTGGATCACCTGAGGTCAGGAGTTCGAGGCCAGCCTGGCCAACATGGTGAAACC^ 

AGT GGCTC ACGC CTGTAATCCCAGCTAC TCGGG AGTTTGAGGCAGGAGAAC CGCTTGAACCGGGG AGG CAGAGGTTGC AGTGAG CTGAGATGGTGCC 
ATTGTACTCCAGCCTTGGTGAAAAGAGCAAAACTCCATCTCCAAATAAAAGTAATAATAATAATAAATAAAATAAAAATTTTTAAAAAGCGAGCTCT 
GTCACTCTATCCCCTTACCCTGCTAGACATTTCCCCACCACTCTGATCACCACCTGCCATTTTCTGTGTCTATTTGCTTACCGTCTGTCTGCTTCTG 
T AG AAT ATC AG C AC C ATGAAACT AT GCACTTT ATTTTTGATC ATTGCTGTATCTCTAGTGCCTAAAAAG TGCT TGAGAACATAGCAGATGTTCAGTA 
AATGTTTGTGGAATGAATAAAAGAATATCATCACTGTCTTTTTTTCATTCTTCTTCCAGACAGGATCTTACTTTGTCACCCAGGCTGGAATGCAGTG 
GCGCAAAAACGGCTCACTGCAGCCTCGACCTCCCAAGCTCAAGTGATCCTCTTGCCTCAGCCTCCCTGGGATGACAGGCATATACCATCACGCCCAG 
CTAATTTTAATTTTTTTTGTAGAAACAGGGGCCTCACTTTGTTGCCC^GACTGGCCT^ 

TAAAGTGCTGGAATTTCACGCGTGAGCCACAGCACCTGGCCTCACTATCCTTCTTTCAGCCTCAGTTTTCTCATTTGTATAACCAGACTAGTACAAC 

TGATCTCACTGGAGAAATCATGATATAAAATTCTGACACTGGCTGAGGCAACTGGGAGGAGCTCAGTAAAGGCTGTTTCTGCTGGGCACGGTGGCTC 

ACACATGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACAGGAGATTAGAAGTTCCAGACCATCTGGCAAGCATGGTAAAACCC 

TACTAACAATTCAAAAAGTAGCCAGGCATGGTGGCTCAC^CCTGTGATCCCAGCTACTCGGGAGGCTAAGGCAGGAGAATCCCTTGAACCCAGGAGG 

CTGAGGTTGCAGTGAGCCAAGATTGTGCCACTGCACTCC^TCCTGGGCGACAGAGCAAGACTCTGTCAAAAAAAAAAA 

CTGGAATTACAGGCGCCTGCCCCCaCACCTGGCTAATTTTTGTTTTTTGTTTTTTTAGGAGAGACGGGGTTTCACCATGTTCACCAGACTGGTCTTG 
AACTCCTGACCTCAGGTAATCCAACTGCCTCAGCTTCCCAAAGTGCTGAGATTACAGGCGGGAGCCACTACACCTGGCCAATAAAGGCCGTTTCAGT 
CTTCAATCTGTTTTGAGCTTGGAGGCTTTAGTCATTCCCAGACCCAAAATCTCAATCAGACCCTCTTCCACCACTTTTTGTGATAGATCAATAAACA 
TTTTGTCTTATGGGAAGTTTAACTAAGAGTATCTTTAGAAAGTTTTGGACAGGCGCTGTAATCCCAGCACTTTGGGAGGCCGAGATGAGCGGATAGC 
TTG AGC CCAGG AGT TAG AGACAAGCCTGGGCAACATAG TGAGACT C TGT CT CAAAAAAAAAAAAAAGAAAG AA AGGAAAAGAAAAAAAGAAAAAAAA 
AGTATTCCCTCTGCTTGGCAAATCCAGATTCAAGATATATCCCCTAAACCCTCTTTGTTTTAATTAGATAGTTGCTGCTAGGCACCTCTGTATACAA 
ATTCTGAGGATGTAAGGACTCCTTGGAACACCGTTATCTGTCTCCTAATATGTGACGTGTGTATGACGAACAAGTGAGTTTTTTTTTTTTTTGTTTC 
TTTTTTTTGGTTTTTTGTTTTGAGACGCAGTCTCGCTCTGTCGCCCAGGCTGGAGTGTAGTGGCGCGATCTTGGCTCACTGCAAGCTCTGCCTCCCA 
GGTTCACGCCATTCTCCTGCCTCAGCCTCCCTAGTAGCTGGAACTACAGGCGCCCGCCACCACTCCCGGCTAATTTTTTTGTATTTTTAGTAGAGAC 
GGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCATGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTAAGCCA 

GTTCAAGCGATTTTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCACCACCACGCCCGGCCAATTTTGTATTTTTAGTAGAGACATGG 
TTTCTCCATGTTGGTCAGGCTGGTCTCCAACTCCCGACCTCAGGTGATCCACCTGCCTTGGCCTCCTAAAGTGCTGAGATTACAGGCGTGAGCCACT 
GCGCCTGGCTGCACAAACATTTTAAACGTCAATATTTTTGTGTTTATTTTTACTATTGTCTTCTATTTCTGGCAAGCAATACTGGTTTCTGGTGGCA 
AGGTAACCTTTCTTCTTAAAATACATTTGTTATTATATTATTAAATATTAAACCAATTTTCAAGAAAAATATTAGATAAATAGCAGAACAGGTTGTA 
GAAAGATGAAGGTCATGTGGGGAATGACTGAGGTTTGGGAAACAATATTATAACTCCTTACTGTGCATTTATTATATGCCAGGCCCAATGCCAAGGG 
CTTTACATGTAGGTATAGTTATGGTTGACAGTTTTCTTCTTTTTCTTTTCTTTTTTTTTGAAACAGGGTCTTGCTCTGTCGCCCTGTCTGGAGTGCA 
GTGACGCAATCTCAGCTCACTACAACCTCCACCTCCTTAGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGTGTAGCTGGTCTTACAGGTGCCCACCA 
CCACATCCGGTATTTTTAGTAGGGACGAGGTTTTACCATGTTGGCGAGGCTTGTCTTGAACTCCTGACCTCAGGTGATCCACCTTCCTTGGCCTCCC 
AAAGGCTGGGATTACAGGTGTGAGCCACTGCACCTGGCCTCTTTTTTTCTTACT^^ 

TCGAACTCCTGGCCTTAAGTGATCCTCCCTCCTTGGCCTTCCAAAGTGCTGGGATTGCAGACGTGAGCCATCGTGCCCGGCCTAATAAGTTTTCCAT 
TTGAGG AAA CTGAGGCTTTGAGAGATGAAGTCATATGCTCAAGGTCATACAGCGAAGGGGTAGTAGAGCCAGGATTCAAACCCAAGTCTGTGGATCT 
TTAGACCTCAGTGTTCTCCCAGTCTCCCCACGGGTCCCTTTGCTTCATCCTTTCCGAGTTTAGTTGTCTTTTTTGTTGTTGTTCTTGTTGTTGCTGT 
TTTTTGATAGAGTCTCGCCCTGTTGCCCAGGCTGGAGTGCAGCG 

G CCT CAGG CTC CCG AGT AGCTGGGA TTA CAGG TATGCGCCGCCAC A CC CAGCTAATTTTTGT ATTTTT AGT AG AGATGGAGTTTC ATC ATGTTGGT C 
AGGCTGGTCTGGAACTCCTGACCTCAGTGATCTGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTACGCCCAGTATGTAGTT 
TTTTTGTGTTTTGTTTTTGTTTTTGTTTAGAGACAGGGTCTCCCTCTGTAACTCAGGCTGAAGTGCAGTGGTGTGATCATAGCTCTCTGTAACCTCG 
AACTCCTGGCCTCAAGCAATTTTCCTGTCTTAGCCTCCCAAGTACCTGGGACTACAGGCATGCACCACCACGCCCGGATAAATTTTTTTTTTTTTTA 
GAGACGGGGGTCTTGTTATGTTGCTCAGGGTGGTCTCGAACTTTTGGCCTCAAGTGATCTTCCCACATTGGCCTCCCGAAGTGTTGGGATCACAGAT 
GTGAGTCATCATGCCAGGCCTCAGATTAGTGTTGAAACTGGAAGTCCAGGGAGGCCCTGTATTTGTTCCCGAGGCAATCAGGGTGGGGAGACGTGGG 
TGGATGAGAGAAGTTTGTGAGGCAGGATAGACACAGCATTGTGACTGATTGTTTGTGGGAGGTGAGTGTGTAGAAGTCCAGAGCCATACCTGGGTGT 
CTGGCCTGGTGACACCGTGGGCAGTGAAAACATCTCTGAGTTGGAAATCAGGTAGGATGAAGAAGACGTTTGGTTAGATTTAGGGTGGAAGGGGCAT 
CCAGAGGCCGTTGGACACGTGGGTTTGGGCTCAGTTGAGGTGGCTGCACAGTAGGCAGGGTGTGGGATCCATTAGTGATGTCTGCCTTGAGCACAGG 

AGAATCTCGCTCTGTOSCCGAGGCTGGAGTGCAGTGGCGCAATCTTG^^ 

CTCCCGAGTAGCTGTGATTACAGGCATGCACCACCACACCCGGCTAAATTTTGTATTTTTAGTAGAGATGAGGTTTCACCATGTTGGCCAGGCTGGT 
CTTGAACTTCTGGTCTCAAGTGATCCGCCCTCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCGGCCTCCCCTCAATCA 
T CATAA C CT ACACAG ATGGC AGAG 

HUMAN SEQUENCE - mRNA 

CATTCATAAGACTCAGAGCTACGGCCACGGCAGGGACACGCGGA^ 

TTCTTTATTACTCCCCTCCCCCCGTGGGACCCGCCGGACGCGTGGAGGAGACCGTAGCTGAAGCTGATTCTGTACAGCGGGACAGCGCTTTCTGCCC 
CTGGGGGAGCAACCCCTCCCTCGCCCCTGGGTCCTACGGAGCCTGCACTTTCAAGAGGTACAGCGGCATCCTGTGGGGGCCTGGGCACCGCAGGAAG 
ACTGCACAGAAACTTTGCCATTGTTGGAACGGGACGTTGCTCCTTCCCCGAGCTTCCCCGGACAGCGTACTTTGAGGACTCGCTCAGCTCACCGGGG 
ACTCCCACGGCTCACCCCGGACTTGCACCTTACTTCCCCAACCCGGCCATAGCCTTGGCTTCCCGGCGACCTCAGCGTGGTCACAGGGGCCCCCCTG 
TGCCCAGGGAAATGTTTCAGGCTTTCCCCGGAGACTACGACTCCGGCTCCCGGTGCAGCTCCTCACCCTCTGCCGAGTCTCAATATCTGTCTTCGGT 
GGACTCCTTCGGCAGTCCACCCACCGCCGCGGCCTCCCAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGTTCCTTCGTGCCCACGGTCACCGCGATC 
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ACAACCAGCCAGGACCTCCAGTGGCTTGTGCAACCCACCCTCAT^ 

TCG ACC CCT ACGAC ATG CCGGGA ACC AGCTACTC CAC ACCAGGCATGAGTGGCT ACAGCAGTGGCGGAGCGAGTGGCAGTGGTGGGCC TT CCACCAG 
CGGAACTAC CAGTGGGC CTG GGC CTGCCCGCCCAGCCCGAGCCCGGCC TAGGAGACCCCGAGAGGAGA CGCTC A CCC CAGAGG AAGAGGAG AAGCGA 
AGGGTG CGC CGGGAACGAAATAAACT AGC AGCAGCT AAATGCAGGAACCGGCGGAGGGAGCT GAC CGACCGACTCCAGGCGGAG ACAGAT C AGTTGG 
AGGAAGAAAAAGC^GAGCTGGAGTCGGAGATCGCCGAGCTCCAAAAGGAGAAGGAACGTCTGGAGTTTGTGCTGGTGGCCCACAAACCGGGCTGCAA 
GATCCCCTACGAAGAGGGGCCCGGGCCGGGCC03CTGGCGGAGGTGAGAGATTTGCCGGGCTCAGCACCGGCTAAGGAAGATGGCTTCAGCTGGCTG 
CTGCCGCCCCCGCCACCACCGCCCCTGCCCTTCCAGACCAGCCAAGACG 

TCCTCGGCGACCCCTTCCCCGTTGTTAACCCTTCGTACACTTCTTCGTTTGTCCTCACCTGCCCGGAGGTCTCCGCGTTCGCCGGCGCCCAACGCAC 
CAG CGG CAGTG ACC AGC CTT CCG ATC CCCTG AACTCGCCCTCCCTCCTCGCTCGGTGAACTCTTTAGACA CACAAAA CAAACAAACACATGGGGGAG 
AGAGACTTGGAAGAGGAGGAGGAGGAGGAGAAGGAGGAGAGAGAGGGGAAGAGACAAAGTGGGTGTGTGGCCTCCCTGGCTCCTCCGTCTGACCCTC 
TGCGGCCACTGCGCCACTGCCATCGGACAGGAGGATTCCTTGTGTTTTGTCCTGCCTCTTGTTTCTGTGCCCCGGCGAGGCCGGAGAGCTGGTGACT 
TTGGGGACAGGGGGTGGGAAGGGGATGGACACCCCC^GCTGACTGTTGGCTCTCTGACGT 

GTGACGCCCACCTTCGGGCAGTCCTGTGTGAGGATGAAGGGACGGGGGTGGGAGGTAGGCTGTGGGGTGGK?CTGGAGTCCTCTCCAGAGAGGCTCAA 
CAAGGAAAAATGCCACTCCCTACCCAATGTCTCCCACACCCACCCTTTTTTTGGGGTGCCCAGGTTGGTTTCCCCTGCACTCCCGACCTTAGCTTAT 
TGATCCCACATTTCCATGGTGTGAGATCCTCTTTACTCTGGGCAGAAGTGAGCCCCCCCTTAAAGGGAATTCGATGCCCCCCTAGAATAATCTCATC 
CCCCCACCCGACTTCTTTTGAAATGTGAACGTCCTTCCTTGACTGTCTAGCCACTCCCTCCCAGAAAAACTGGCTCTGATTGGAATTTCTGGCCTCC 
TAAGGCTCCCCACCCCGAAATCAGCCCCCAGCCTTGTTTCTGATGACAGTGTTATCCCAAGACCCTGCCCCCTGCCAGCCGACCCTCCTGGCCTTCC 
TCGTTGGGCCGCTCTGATTTCAGGCAGCAGGGGCTGCTGTGATGCCGTCCTGCTGGAGTGATTTATACTGTGAAATGAGTTGGCCAGATTGTGGGGT 
GCAGCTGGGTGGGGCAGCACACCTCTGGGGGGATAATGTCCCCACTCCCGAAAGCCTTTCCTCGGTCTCCCTTCCGTCCATCCCCCTTCTTCCTCCC 
CTCAACAGTGAGTTAGACTCAAGGGGGTGACAGAACCGAGAAGGGGGTGACAGTCCTCCATCCACGTGGCCTCTCTCTCTCTCCTCAGGACCCTCAG 
CCCTGGCCTTTTTCTTTAAGGTCCCCCGACCAATCCCCAGCCTAGGACGCCAACTTCTCCCACCCCTTGGCCCCTCACATCCTCTCCAGGAAGGCAG 
TGAGGGGCTGTGACATTTTTCCGGAGAAGATTTCAGAGCTGAGGCTTTGGTACCCCCAAACCCCCAATATTTTTGGACTGGCAGACTCAAGGGGCTG 
js£ GAATCTCATGATTCCATGCCCGAGTCCGCCCATCCCTGACCATGGTTTTGGCTCrCCCACCCCGCCGTTCCCTGCGCTTCATCTCATGAGGATTTCT 
TTATGAGGCAAATTTATATTTTTTAATATCGGGGGGTGGACCACGCCGCCCTCCATCCGTGCTGCATGAAAAACATTCCACGTGCCCCTTGTCGCGC 
GTC TCC CAT CCTGATCC CAG ACC CATTCCTT AGCTATTT ATCCCTTTCCTGGTTTCCGAAAGGC AATTATATC TATTA TGTATAAGTAAATAT AT T A 
Li TATATGGATGTGTGTGTGTGCGTGCGCGTGAGTGTGTGAGCGCTTCTGCAGCCTCGGCCTAGGTCACGTTGGCCCTCAAAGCGAGCCGTTGAATTGG 
p=| AAACTGCTTCTAGAAACTCTGGCTCAGCCTGTCTCGGGCTGACCCTTTTCTGATCGTCTCGGCCCCTCTGATTGTTCCCGATGGTCTCTCTCCCTCT 
*rj GTCTTTTCTCCTCCGCCTGTGTCCATCTGACCGTTTTCACTTGTCTCCTTTCTGACTGTCCCTGCCAATGCTCCAGCTGTCGTCTGACTCTGGGTTC 
*p GTTGGGGACATGAGATTTTATTTTTTGTGAGTGAGACTGAGGGATCGTAGATTTTTACAATCTGTATCTTTGACAATTCTGGGTGCGAGTGTGAGAG 
TGTGAGCAGGGCTTGCTCCTGCCAACCACAATTCAATGAATCCCCGACCCCCCTACCCCATGCTGTACTTGTGGTTCTCTTTTTGTATTTTGCATCT 
l ,. GACCCCGGGGGGCTGGGACAGATTGGCAATGGGCCGTCCCCTCTCCCCTTGGTTCTGCACTGTTGCCAATAAAAAGCTCTTAAAAACGC 

[g| HUMAN SEQUENCE - CODING 

"™ ATGTTTCAGGCTTTCCCCGGAGACTACGACTCCGGCTCCCGGTGCAGCTCCTCACCCTCTGCCGAGTCTCAATATCTGTCTTCGGTGGACTCCTTCG 
gi GC^GTCCACCCACCGCCGCGGCCTCCCAGGAGTGCGCCGGTCTCGGGGAAATGCCCGGTTCCTTCGTGCCCACGGTCACCGCGATCACAACCAGCCA 
l*k GGACCTCCAGTGGCTTGTGCAACCCACCCTCATCTCTTCCATGGCCCAGTCCCAGGGGCAGCCACTGGCCTCCCAGCCCCCGGTCGTCGACCCCTAC 
^ GAC ATG C CGGGAAC C AGCTACTC CA CACCAGGCA TGAGTGGCTACAGCAGTGGCGGAGCGAGTGGCAG TGGTGGGCCTT CC AC CAG CGGAACTACCA 
W GTGGGCCTGGGCCTGCCCGCCCAGCCCGAGCCCGGCCTAGGAGACCCCGAGAGGAGACGCTCACCCCAGAGGAAGAGGAGAAGCGAAGGGTGCGCCG 
IIJ GGA ACGAAATAAAC TAG CAG CAG CTAAATGC AGG AACCGG CGG AGGGAGCTGACCGA CCGACT CC AGGCGGAGA CAGATCAGTTGG AGGAAGAAAAA 
L'l GC^GAGCTGGAGTCGGAGATCGCCGAGCTCCAAAAGGAGAAGGAACGTCTGGAGTTTGTGCTGGTGGCCCACAAACCGGGCTGC^GATCCCCTACG 

AAGAGGGGCCCGGGCCGGGCCCGCTGGCGGAGGTGAGAGATTTGCCGGGCTCAGCACCGGCTAAGGAAGATGGCTTCAGCTGGCTGCTGCCGCCCCC 
O GCCACCACCGCCCCTGCCCTTCCAGACCAGCCAAGACGCACCCCCCAACCTGACGGCTTCTCTCTTTACACACAGTGAAGTTCAAGTCCTCGGCGAC 

CCCTTCCCCGTTGTTAACCCTTCGTACACTTCTTCGTTTGTCCTCACCTGCCCGGAGGTCTCCGCGTTCGCCGGCGCCCAACGCACCAGCGGCAGTG 
p ACCAGCCTTCCGATCCCCTGAACTCGCCCTCCCTCCTCGCTCGGTGA 
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TABLE IV 



MOUSE NOMENCLATURE 
ICSGNM Cmkbr7 
Celera mCG15362 

HUMAN NOMENCLATURE 
HGNC CCR7 
Celera hCG1643840 

MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGAGTTCCATTCTCAGCCCGC^ 

CTT AATTTC TCTAGTTCTGTTAT TT C CT C AT CTCT AG AATGGGCATCACGACAC TG ATTTCACAGAGT TGT TC CGA GATGAAATGATGTGATGGAGT 
AAGTACTAGTTCCCAGCCTCTCAGATCCTGGAAATGGGTGTCCTGCCTCTCCCTCTGGAAGCTCCCTGAAGGCCAGGCTTTGGGTCCTTCATCCAAG 
CTTGGATGAGTATTTTAAAAGAGTCTCCGTGAGTTTTGCTCATTAGCCAAAACGGATGCTGGCCTTTGGAAGCGGGGCGGATGACTCTGCGTGGAAA 
TATTCTTCTCTGAGTTTCCCTCCTCTGCCTACTTCCTCGTGAGTTCTATGTGGCATCTATTTGAGGTCAAATCTATTGAAATATACTTCAGCTTAAC 
T CT AGGTGC CTTTT AGTTGAGCACCAAGAAT ATACTAAGTGCTATGTAAGAAATAGA TAAAGGAC CCAGTATGGAGGAGGAGAAATACCAAGCCAAG 
CCTAGTACTCAAACCGCAGACAGTGATAGGGACAGCCGGATCGCTTTCTGTAGGCTTGAGGGTTTCTGAAATTCTTCAGGGAATTGGGAGCTCTTTG 
AGCTGGACTTTTTTAAGCGTATGAGTGTTTGGTCTCTCTCTATGCATCTGTGTCATGAGCATACCTGGTACCTATGGAGAACAGGAGAGGGTATCAG 
ACACCCTGGAACTACAGATAGTTGTCAGTTGCCATGTGGGTGCTGGGAAACCCACCCTAGGTCCTCTGCAGGAACAGCAAGTGCTGAAGGGACTACT 
G AGCTG T CT CTC C AG CC C CTTAC GTT AAATCTTG AAG AACGGG TTAAAAGATGACAGAGCGGGGGAGAGGTACCGGAGAG AAAAATGTGTG AAAGT A 
GTCAGGGCCTTGACCTGGAAGAGTTAGAGAACAAGGCAGATGCAGTTGGCAGGTCCGGAGCAACAGCTCCTGAGTGTTCAATGGGCCAAGTTGAGAG 
GGGC TTTGGATGCCATGAAAGGT TG C CG AGG AGAAGGAAGTAG AG AGTG AGT CCTTC AG CTACCACAGGGG AAGGGTGGG AAGGGGGTGAGGAGAT T 
TCCTAAATGTGCATCATTGGCCTTCTGAGGTCTTCAGAAGACGACCAGTGGGGTCCCCAAAGGGTGAGACCAGACAGAACAGACAGCCACTATTCCC 
M= TAC T CT G AC CC C CTACCTCAAGAGGAAAGGACGAAGAGAG AGCTG AGAAGAGACCCTGAGCC T CT TCA CCA CAGCAGAC AGG ACC AAGTTTCAGCAG 
fl GCACCAGGACACTCAGGTGTCTACGTCAGTCGCTAGCTATGCACATGGGTACCCTGGGCATGGAAAGTACAGACTGGCTTTCTAACACATCATTTTG 
CCGTGAGACCAATCCCAGCATTCCCTCTATTGTCCTCAACAGAATCAAACGTCCAAGCTAAGGGGCTTGCCACCTCTCCAAAATAAAGGTCACCTTA 
feJ GAGGTC AAG AAAGC ATTG AC ACAAA C CTTGGGTAAGCCCT AAGACTGT AACT TGGGAGT TGT C ATGAC AAC AG GGC AGTAGAGCCATCTCTACTC TC 
p TCCTAAATGGCACGAGGTTTAGATGAGTAACTCTGAAAGAACAGACCCCACCCCTGTTTTCCTATTTACCATCTTCAGACCCCTCCTCACCTGTGCC 
% CCAGCCCTCTCCTCTAGGACCTTAGCAACTTGAACAACATAGACACACACGCCTCCTCTCAGTGTCC^ACAATGAGCCCTCCCAGCAAAACAGGAAG 
CCTCTTGCTGCCAAGGCATTGTGCTTGCCCTGTCACGTGTCAGGAAACCACTACCGTGCCATGGCAACAGGTGTTGCTGAGACCTTAGCCTTCCCGC 
|Ms CAACCTGTGCCCTGCCCTTCTCTCTCCTCCTTACTCCTAGAGAGTGTCCTCGTTGTGCAGTGTCCTTGACTCACCTGGACACTGCTGTGAGTGAGCG 
L,.l TCCTAATTACCTTAACTTGGATGAGAAGACTAATTCACCCTATAAGTCAGTGGCCCAGGGCCAGGGCTTGGGCCTTAGACTGCATAAAAAGGAGAAA 
I GCTTGCCGACTGCAGGTGTTCATCTCCCCTTGTTTCCTGCCTATCAGTGCAATGTCACCAGCCACTCCAAGCCAATGTCACCACCAGCCACCCCAAG 
U| CTCCTGCTGCCATGATTCCCCAACCAGGATGGACTCTACGCTGGAACTGTGAGCTGTATTAACGCTTCTCCCCTAAATTTTTCTTGTCAAGGTGTTT 
TCATCACAACAGCAGAAAAACCAAAACCAACTGACACTTCCTCCTTGCAACTGGCAGCTCACATCGCTTGCTCCTTTCCCATTTGGACACCAAAGAT 
f CTCACTGATCATCATCCAAACACAGTTCTTCATGCTAGCCAGGCAACATAAGAATGTACCTGTGTGGAAATTTAGCAAGCTTCTTAGCTCTTTTGGA 
H ; TCGCAAAAGTCTAGGTAAGGTCTGAGATGACTCTCTCTCAAATTCCAATATCCCTAAA^ 

G CTAAACTG ACT GT CTTAGT TAAGG TTAGTAGTGCTAC CACAC CAAAAACAAAAC AAAC AAAC AAAAAAAAAATG AAC AAG CC AGGCGTGGTGGTGC 
J 5 ! ATG C CTTTA AT T CC AGCACTCGGGAGGC AGAGGCAAGC AGATTTCTGAGTT CAAGAC CAGCCTGGTCTACAAAGTG AGTTCCAGGACAGCCAGGGCT 
ill ACACAGAGAAACCCTGTCTCAAAAAAACAAACAAACAAACAAATAACAGCAACAACAACAAAAACATGAACAAAA 
I a | TATTGGCTCACACTTCCATATCATGGTCCATTATCAAAGG 

|S GGCCATGGAAGAGTGTTGCTTACTGGCTTGCTGAGTCTGCTTTCTTTTATCATCCATTGTTACATGGAGATTAATCAAAGTTGCATGAAAAGAAGAG 
U AGAAGGACCTCCTCAGACGCAGATGGAAGC^TGTGGTCCTTTGTTAGGACTCTGAAGGGC^^ 

y. GGGGGTTTGAGGGTTTTATAGGACAGAAAAGGAAGCGTTTGTGGTTGGAAAATTCCATCTGCAGATCTGTTTCCTCAAGTCTCAGCACTGTAAATAC 
CAACGGAGGTTCTTATCCATTAGTGAATTTCTTGTCAGGAGAATCCCGGCTCCTAACTGCCTGCAGATGTTAGCAGTGTATCCCTGGTGCCATCCGG 
AGTTTCTCAGCTCTCCTGGGGCTTTTAGGTCATTAACCCTTCACCCAAGACCAGCAGCAGCAGCACCGCCCAGTGAACTAGCCTCAGAGACTCACCC 
T AC AGG CCAAT CTG ATG G ACGCATTTTT CT C AACTGTGGTTCC CTCTT CTCAAATGACT CTAGAGTGTGTTAAATTG ACATAAAACTAGCCAGCAC A 
CTG ATG ATT CT GGAGTG AAATCAAA AGGAGC TTG AAG ATG AGGGTGGGGACAGAGGCAAGGGTATGTC TTAGA CTTTACTCATTG AATTGCAAGCCG 
AGAAGGATTAGAAGACATCAGCAGTATCAGTGGACAAGTGGCATTTGAATGCCATATGGAAGAAGGACAATAAGAAGTTCCTGCAGACCTGGTTCCA 
TCCAGAAGCCTCTGCTCTGACTCCTCTCCCGGGACTCTTTAGGTCAAGGACTTTCTCCAGGGGTCTCAAGACTTTCTGTGTTCCTTTATACTAAGTT 
TTCTTTCATCCTGTGAACAATCAAGTCATCAGTGAGGTGGTTGGGTGACATGGCCCATAAAGATGACTTCCAGGGACCTGGTTCCTGCTCACACTCT 
GAAACCTTTGCTCCTGGGCAAAACTTCCCCTAAAGAACAATGACACACATTCTCAGAGAACAGAGAACAATAAGCCGTGTGTAACAACATCTTCCAG 
AGAGAGAGAAGCGTGGATTTAGTCCTTTTGGGCCACGCTCAGCATCAGCAGGAGCTAGCTTGACATAAGGAGAAAAGCCCCTTAGCTTGGAGCTAAA 
GGTCACATCACTTTTAAGTCTCAGTTTCTACAGCCACAGAACAGAAAGTTACCAAACACAGCAGCCTGAGGGTTTCATGTGAGAAGGTCATGGAGTC 
CCAATCACTCACTCCTTAAACTTGACTCTGTCTGCAGCCCCCAGCTGGCCTTGAACTTGTGGCAGTGGAGAATGTGCTTGAAATACAGAAGCCAAAT 
AGAAACCTGGAAATGTTACTTGTCACTGGTTGGAAGCCCTGGACCATGGTGGAT^ 

AG ATGTTT ATAGTTG CTGGATGATGGTGGATGGACAGACAGACACGGACTGGCGCCTAAGAAAAGTCTAG CATGTCAT CACGT ACTCCAGC AGC CCG 
AGAGCTTCATGTC^AAGACACAAAGCAAAGCCATCTCATGTGCCAACCACAACCTAGCCAAT^ 

CCATAGCACTCTAAAATGAGGCCAAGTTAAACTATGTGGCGCTCTCTGCCCTCCAAACTGCCAACTTCAGCAGCCCTTTTGAAGCTGAGTCAAACTC 
CTGACCATAGCGTTTCTTTTCACCACTGCCTTCCTCCCTCACCTGGCCCACTCCCTCAGCTCTCTAGGTGCCACTCTTCCCATAGGAGTCTTGGTTG 
GACTCCTACTGTGCCTAAGCTATGCCAGGCTGTTCCCTAGCTAAGTCCTGCAGTGTTGTGATGCAATATTGTCCTCCATTATTTCCTAAATACCATG 
AAAT AT TTAATATAT AAAGC C AT AAAAAGCCGGG C ATGGTGGCTCACAC CTTT AAT CCC AGC ACTTGGGAGGC AGAGGC AGGCAAATTTCTGAGTT C 
AAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAAACAACAACAACAATAACAACAACAAA 
AAGCCGTAAAAAATAATGCAATGAAGACTTTAGTTCATC^TAAATAAAACATCACCTTCCGAGCCCTTCCCAAAATTAAGGTGAGCCTGTTTGTGGG 
TGCCATTTCTCTTCATATTGTTTGAGAGTGATTGTGTCTTGTCATTTGTCTCACTTTGTGTTCCTTGGTTTTTTGGCCACAGAGTCTCACTCTGTAA 
AGGTGGCAGATCCAAGTGTGGCAATCTTCCTACCTCAGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTTCTTCTTCTTCTTCTTCTTCT 
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCT 

GT AT AC CTG CAGGC C AG AAAAGG ACACCAGATCTCATTAG AGATGGTT ATG AGCCAC CATGT GGT TGC TGGGAATTGAACTC AGGACCTCTTCAAG A 
GCAGCCAGTGCTTTGAATGTCTGTGCTGTCTCTTCAGCCTTCCTCCTCCTCCTCCTCTTCCTCCTCTTAATAATTTACTTTTATTTTATGTACATTG 
GTGTTCTGACTGTATATATGAGGGTGCCAGCTCCCTTGGATCAGGAGTCATAGACAGTTGTGAGCTGCCATGTGGGTGCTGGGAACTGAACCCAGGT 
CCTTTGGAAAAACAGCCAGTGCGCTTAACTGCAGAGCCATCTGTCCAGCCCCTGCCTCAGCTTCTTAGTGCTAAGATTACAGGACTTCATCAGAACC 
CTCTGTTGATAAGACAAGATCCATCTTTGTCTATCTTCTTCACTGTGGTGGACTGGATACTCCCTACCACCATCACCCACAAAGATCATATGCTAAA 



78 



GTTATGACCCTACGATGGCTGGATTTGAGAATAAAGCTGTTGGGGGTTGGTTTTAATGCTTTGAATTAATCC^GCCCCCAAAATCGGGAAACTGCAT 
GTCCAAACACTAAAGATCCTTGTCCCCAATTGGTTTTTGATCAATCAAAAAAGAGCCA^ 

TAGATTTGCTCAGGCTGGGAACTGGGAGAGAGGAAGGAGAGAGC^TTGTCGTGGTTTGGAGGGAAACAGATCAGAATTAGAGACGCAAAAGGAAAAG 
CATCCGAAATGTAGGAGAGAAAGAATTCGGGCCCTGGAGGAACAGCTCAGAAGTGTCTTGGGCAGCAAAGACTGGGGTAGGGGCACCCAGAAGGAGC 
CAGGGCAACAAAGATAAAATATAGAATTAGAGGGCATTAAGCCAGGAGTACCAAAGGGA^ 

CCTTTGAGCTAGTCAAAGCATATTTAAAATTAACTGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCTTCATTCTTGAATCCAGAT 
AGCT CTTGGGC AGGTGCATGGGTGTGACGAA TCGGGAGCA CAAAATGAT TTAGCTAAC^ 

AATAGCACTATAGAGTAGGGTCCTGGTTCCATGGAATTGGTATGTGAGTGACATTACC^GAAATTGACCCTACTGGCCTTTGCCTTGGGACTTCCAG 

CCTCTAGAACCATGAGGAAACATATTTCTGTTGTGTAAGACATCCAGTCCATGGTATTTTAAGAGTCCAAACAGACTACTCACTCCATGTAAGTTAA 

GCAAGG CCAAGGAT CATGTGTGCTT GTCTTTGGAT CAATACTGTCC AT C ACTAT ACC TAACACAT AAGTGCATGCCAAATATCTGTTTGTTGAATGA 

ATGAATGAATGAATGAATGAATGAATGAATGAATTAGATAATTACTAAAACAAAACATATAGGGTTCTGGGGTATGCAAGATGGTTTAATGGGTAAA 

GGTGCTTGCTGACAAGCCTGA<^CCTGAGTTTGATCCCAGGAGTAGACCTGGTGGGAAGAGGAGACTTACTCCTGCAAGATGTCCTCTGACCTCAA 

CAAGCATGCTGTGCCACATGCCTGCACACAGCTCTTTCTCCCTCrCCCTTCCCCCACCCCCATTCTCTCCTTCCTTCCCTCATGCCTCAACATGAAA 

GCTCTCGGCrACTGCTCCCATACCTTGTTTGCCTGCCTGCCTCCTGCCATGCCATCATCCACCATGATGGATGTGAACTCATCCTCTGAAACTGTAA 

GCAAGCCTACAATTAAATGCTTTCTTATATAAGTTACCTTGGTCTTGCCATCTGTTCACAGGAATAAAAA 

CCCCCAAAAACCCTTGTAAATGAATATTTATAGCAACAATACTCATGATAACTAAATAGTAGAAACAATCCAAATAGCCAA 

CAAATGTGGTATATCCACACAATGGAATATTATTCAGCCACAGAAAGGAGTGGAGAGTTGAAACATACTCCATATAGATGAACCTAATTAAAAGAAG 
ATAGATATGAAGGTAACATTGTGTCTATTTATGTGCATTAATCAACAGAAGCAAATCTACAGAAAGAAAGTAGGTCTGCGGCTCAGAGCAGTTATGT 
GCACAGGTGGAAAGGACATGGGAATCATCGGAAATAATGGCTGAGGAGATCTGACTAAAAATGATTCAAAGTTTACCGTGGCGAATAAACGCCGTGA 
CTATATAAACACC&C^GAAATGTCTACTTTAAAAGAGGAAATGTG^ 

ATCCAGGCTCAGTTCCCAGCACCCACACATCAGCTCACAAGGCCTGTAACTCCAGTTCCAGAGGATCTGACGCCGTCTTCTGACCTCCACGGGCTGT 
AGGCACACAGGTGGTACACAGAAAGGCACTCAGGCAAATGCTCATACA CATAAAATA AAAAT C AATGAATCGTTTT CAAAGGTGAATT TAGAT ATG A 
h* ACTATAGCTAACTAGAGCTGTTGTCTTTTAAAAGAAAGGAAGGAAAC^ 

0 ATG GCCAATTCTCTCCAGAAG CT AGAGGGAAAGT CGTCCGGGAAATGT AGTC CC TGT ATGC AGAGCAGAGCAAGGGTAT CAAGTGAGTG C CT AAAGG 
f*>$ GTTATGTTGTCATGGCAACTGTTTTAACTATTTATTTATCTTATGTATATGAGTACACTGTCACTCTCTTCAGACACACCAGAAGAGGGCATCGGGT 

CCCATTACAGATGGTTGTGAGCCAACGCGTGGTTGCTGGGGATTGAACTCAGGACCTCTGGAGGAGCAGTCAGTGCTCTTACCCACTGAGCCATCTC 
111 TCCAGCCCCCTGTTTTTAACTAGTTAATACAGAAGAAGGGCTAAGTCATTCTGAGCTCTGTGTCGGGTCCTGTCGCTCACTGAATTGTCTACAATCA 
ess GTGGCT AGAGAT AAAAAAATTT ACAGGTGTCTCT C CTGGAAATGGAAC ATTTAG ATA TTTGCTCTAGAACTCA ATGGCCACCCAC AGT CC CCACATG 
7^' ATCTCTATGCTTGTTTTAAACCCACTCCTTGGACTGGGGAGGTGGCTTAGTAGCTAACAGAGAATTGCCCCACAAACACGAGGAACTGAGTCCAAAT 
CCCAGAACCCTCATAAGCAGCTTGGCAGTAGCATGAGCATTCGTCATCCTAGCAGTCCCCCAAGGAGATGGGAAGCAGAGACAAGAGACCACGCACC 
CAG AAGGCCAGCTCACCTTAGTA TAG AATGGGGC A CTCAATAAAGACGTGAT AC CTG CA ACAGAATGG AAGTAGGCTGA C AC CTGACATTGTC T CTG 

1 , ATCTTCACACGTGCAATGGCATAAAATCCCCACATTCACGCATACGATCGCACACATACACGTGTGTACACACACCATACACACCACACAGGCACAA 
Uj CAATTAAAAATGCCAATAAATATATCAATCAGTTTCCCTGTGGGGGAAAATGGGTGTGGTC^ 

% GCA T GGTGG CATAT GTCTTTAAT ACCAG C ACTCGGGAGGC AG ATG CAG GTAGAT CTC CATGG GTTCGAGGCTAG CT AGG TCCACAGAA CAAGT TCT A 

L ., GGACAATCATGGCACTGACAAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAATAAATAAATAAATAAATAAATAAATAAAATCTCTT 

GGGAACCAGTTTGGATAATGACTTAGACTTGATGTCCTCCTCGGTTAGTAGAGAAAGCCGAGGGGCTCCAGGCACCCTCATATCTAGCATCTAGGTC 
Q CTGTGATGTTTAGCTTAAGTACTAAAAATAAACACTCAAACTTTCAC^ 
^3 CACCCTGGCTGCGCAGGTGTATGTCGGGGCTGCCTGG CGGCTGCTGGGAC^ 

e W ACC AAG AGAAGG ATGCAAGTACACC TCTG ACTCC CGAG CCTCAGC CTAT CT C AC C CC AC AGC AGC C AC C CAGG CGTGCT TGG C AC TCACAGATGTAA 
hi GCAGCTCCTCCCTGAGTCTCTACAAGGCACTACTACCAGGGGCTTTCTGAAGGCAAACGACTCAGTGATTCAGGATCCTATGACAGCCGAATGTGGT 
: Z TATCACATACCTTGTGGCCCAGGAAGACTCTGAGGGGTGGGACAGGAGTGGGAGGGGCTCTGGAACAACTTAAAAGGGGCTCTTACCACTTCCTCCC 
TGCCAGAGAGGGCTAGCTGGAGAGAGACAAGAACCAAAAGCACAGCCTTCCTGTGTGATTTCTACAGCCCCCAGAGCACCATGGACCCAGGTAAGTA 
AGCTTTCT CTT AAG AGAAGTC AAAACTT CTGAC AC AAAG CCAGA CTCTAGGG C AATGAACATGC ATGGGGGATAGGGG TCATGGGGGGGGGGGGGAG 
TTT C CC AGAGC AAT C AGGGC AGT TGGTGTGAACT GGTGGCTG CTTTCAGAG AATGCCATCCCGTTGGATGAGGCATTGG AGAGAGGATGT ACCTGCT 
CTT TCAG AC CC AGGGGA AGG AAC AG CTGT GGATGC CAAGACAGACACAGGC ATAAAG TTGGGTCCTCT CAAGCAG CAGGC AATGAACAG CCTGGTAG 
ACTGACCTTGATGCCTGGATAGAACACGTTCTCAGCTGGTAGGGATGfGGTGATGTTCTCTGGCACCTCTGGGGACTAAGAAAATTCTCTTTAGAGAG 
TTCAGGACACCCTGCTCCCACTCTGGGTCCCTCCTTTCTGTCCTGTTAAACCAGGATTTGAATCTCCTTGCTCACGAGAAGAGTGAAACTGATGGAA 
ACTGTGCTGTGCATAGTGCAATACAAATACTAGCTGCCACCTCTGCCTCTTGT^ 

CCCTCCCTTCCTCCCTCCTACACATCCCAGGTTTTGTGCCTCTTGCAGGTTAAAGGGTGAATGACTAACATAAAATTGAGTGGCCCAGGGGGCACTC 
CACGTAGAGAAACrGTAGCCAGCACAGCACAGCCCCTAAAACAAAGCCCACACACCACAGCCCTCTCTATAACAAGCTATTCTACCAGGCAAGCCTC 
AGTGTCTACTATAGCAAAGTGCCAAGGGGAGTCCAGCTCCAGACIACCCCAGTTACACATCTTAAATCTCGGCTGCAGAGTCAGACTCGGCTGGGGCT 
GGAGGGGCCAAGTTCCTAGAGTTCCGCTCACCAGCCTCAAAGCCACAAGGGGTTGACTTTAAAAAGTGAAGGTTCCCTGGTGCTGTCCTCACACCCC 
CTC C AAGCC AC ACC ACAGCC AGC AGCAT CTTT AT CTGTCTCTCCTGCCCAGCGTGCA TTGCA CC CTAGAT CTAGATCCAGCTGTGG AAGGTAT AT AG 
TGGGGTCCACCTCTGCTGGCTATGAACTCATGGGAAGCAGGATTTTGTTCTTACACCTCTGCCTCTCAGATGTTTTGGGTGTAAGTAAAAAGTGTTG 
ATGGGAGGGAGGGAGAGAGGGATGGAGGAATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGTGTGGATGGATGGATGGATAG 
AATCAGCCTCTCTAAGTATATTTTAGAAAGCAAATATCTATTCCAGCTTCTCCTCCCACCTGCCTGCTTTCTAAGCCTACACCTCAGGTATCTACCT 
CCACCAGCCATAGTCTCAGAGAACCCACACCATCCACCAAAGATCTTCTCCACCTTCCTAAAATGTGGCTCCCAGAATAGGGAAAGGTTCTAAATTT 
ACA GCATTTGCCAATTTC CATGGTGTAAATTTTACCAT CCCAATTGAC CTTAGT CGGTC CCATTAAAGGTCTAAATGTTTTG ACACAAAAAGGAGAC 
AGACACCTGCCTTCCTTGGCACTTGGGACACTATAGAGTCTCTCCTGAACTAGGTTTGGGGATGCTAAATGCATGGCTTTGATACAGGGGGTGGGGC 
AAACTCTGGTCCTGGAAGAAGAGATAGTCTTGCTTTTTTACTCAACTTAAGAAGCACCCTAATAGCTAACTCTGAAATGCTGGTGTGAGGGGAAACT 
GCTTAA<^GTTTTGCTCAAAACAGGCTCCGGCX3COT 

CAGGAGGTGGGTGAGAAGGCCTAGGTTTCTGTTATACAAAATGAGAAACTACAAACCAGGGCTGAGCTGGCCTCTTTCATGATTTGAATGAGCAGCC 

CTGATATACCCTTCCTAACTTCAAGACATGGGACCACAGATCTCAGCACAACATCTCAAAATCAGAGAAGGGGAGGGGGTGACAAATGAACCCATAA 

GAGACTTGTGTCTCACAGCTCAGC TGTAGATGGCAGTAG AAGAGAATTC AGGGTCATT CTCTTGTGGCC CTC C AG AAACCTGG CTGTGG AGAAC ACT 

GGAGCCCAGAGTAGCTAGGAAAAAACGCTCCTCTGTGTGGTATTGTCTTCCAGTCATGTGTCACTCATTGAGAACTTCTTGAATACAGAGTCTGGTG 

GGGAACGTAACATGTATGTCAACACGCAACACCAACAGTGAGAGACACAGCACrrCCTGAGGGGGCATAAGGACAAGATTCAAAA 

CTGTGAACACGTGTCCAGAAAGCATGGCAATTGTATTTTGAAGGGTGTGTAGAAGTTGTCAAACCTAATAAAGAAGGGCCAGGTTGTAGCCAGATCT 

TCAGACTCTACAGCCTTAAGCCAACAAAGTAAATGAGGCATCCGAGTGTCAGGAA03TGACTGGCACACAAGCCTGAGGAACCTGAGCTTG 

CAGAGTTTAAAAGCTGGCTGTAGCAAGGTACACACGTAATACCTAAGCCTGCTTAGGGAGGCAGGAATAATAAGACTTGCTGGCCAACCAGTCTTCA 

GAATTGGCAAACTCCAGGTTCCATGAGAGACACTGATGTCAGCCTCTGACCTCTGTATGTGTGCACACCCATACATATATGAGCACACACCTTGAGC 

TCATAGTCAGTCAG CTCTGGTGAAGGCA ATA GAG C AA C AAGG AAGGGGG AATGGTGTCAGCTGATGGG AGTCAGGG AAATAG ATGTGC CAAGAAAC A 
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AGCTAGACTGGGGTGGCCCTTACAGCGCTGTGTGACAGCACCAGGGAATCCATACACCACCCTAAAAAGAATGGGGGCCAGCAAAGGTCCCCCACCC 
TTTTCCAAATAGATGGGCAGATAGATGAAATGTGTGTTTTGGAAGGAGTAGGGGACAACAGATCCTTGTATCAGGACAGG 

AAGATGTGGAAGATCGAGATGTGGAAGCAACAGACACGAGTGGCCAGAAGCAGTAGAGAGAGCAAGAGTGGCCTTGATCTGTGGAGGACCCTCCACA 
CCCAAGGCTCATCAGTAATGTCACATGACTCAGCCATTAGAACCC^ 

AATCCAGAGGAACGACATAACATCTCTGGGCCTGTTTCCTAGAATACAAAAGGAACTTACAAGATGTTATGAGGGGGGAGAGATGTAGGGAGAGGAG 
CTAGGAGTGAGGTGGCCATGGCCAGGGGCTGGGTGTCCTTAGGGAAGCCTTGTGCTGACAGTTCTTCCGTTGGGGCTGATTGTCACGTGGGGTCTCA 
CAGGGTATCCATTGAGGACGGGGATGGCAGCGCCTGTCTCACGTGGACAGCGCACTTCACTGAAAGCTCAGTGGGTGCCAATTGCATCACTGAGTGA 
CAGCGTTGGTGCCCTAATAACAAGATCCCACCAGCTCTCACATCCCTGCCTra 

GTCCCCTACAAGCGGCTGAGAAGGTCCAGAGAGAAAATCATCACCTGAGGTGTCTCTCGGGCCAGATGATAAGCAGCCATGGTTTCCTGGGCAGCCT 

GGAGGGACTGGGGG AATGTA CAGATGGG AGC TTT AGG ACCTG AGC AGGAGGCGACTGGCTTC CCAGGCTGGGAG CTGGGGGCGGGGGAGGGAATTGA 

GTGTCAGGCTGAGGCAGGGCAGGGCAGGGGAGTTGGGATGGTGTCGGGCTAGAGTGCTAGGGAGTTTTCCTTTCACCTTAATCATTCTGCAAACCCA 

AGAACTCAGGAAACACCGAACACCTCAATCAACAGCTCCCCCACACTGGCATTC^ 

GTCATCTGACTCACCAACACTGACCAGGTCCCTCTGT^ 

GGATCTACAGGACCCCGACTCACCCACCCAACCCCCACCCCCAACCTCM 

AGCCCCTCCGGGTTTGTGGCTACAAAAGAACCGGGACAGAACCCAGAGTAAAACAGAAACAGTTCTTGTTCCTTAGGGGTTTCTCCACGTCCCTGAC 
AGT C AGATTGG GCT TTCTCT ACAGT AACCGAAAGCCGTTGATGGGTTG AAAGAAACCTCGAA CGCTGAACC CCGAAATGAGTCAG GGAAGGGCGGGC 
TGTGCATG^GACCCTGAAAATCCTATAGTTGAGAATGGGCATGTGACCTCAAGCTCGTGGGGTGGGGGTGGGGTGCCAGGGGTAATTCAGTTGGGGG 
GGGGAATGTGTG AG AGAGAT GAG CTGTGAAGCCACGTGGTGATGTTCCCCT TCCTGT CT CCA AACGTT CTAGT ACAAGCTGATGATAAGGGGACAG T 
GGCTGTCTGTCCCCAGAAAAAGATGGAGCTGGTGAGGCATGGGCCCCACGCTAAGCCTAAGCGTCTCTCTATCTCTACAGCTAGGATTTCGACTTAT 
GGTTAGGGATTTAGGAGGGAGGCAAGATAGGAATAGAGGGCAGAGTTAGCTTCAGTAAGACGTGCTTACATAGCATGGAGGAGAATGTAGAGGGGAA 
AAACGTTTGTCCGGGTCACCCTGATGCCCTTCCTGCCTGTACTGAGTGACCAAGCGTAGCCTCCTCAGCCTGGAGTACATGTCAGGTACCCTGGGTG 
TCATGTAGCCTAGTCTCCAGCAGGTGGTGCTATGGCCAAGGACAAACAACATCTGCGTTTGAACTGTTTCCCTGGCCTCAAGAGGTCCCCCAGACTC 
AGATTG GGGAG CAAAGTCCT ACT CTGGACTG CAAGAC CCATAGGGT CTGACTCTT CCTG T AG ATCTGAAGAACTGGGATTCAAGGGCT GGAGAGGGG 
TGCCAATAGCAAAGAAGGCGGATCAAGGGTCCCCAGGGGTGAGACGGGAGTATGTGGGGTAGAGGTCTGGGGTGGGGGGCATAAGATGTCTGCAAAA 
Zf ATTACTTTTCCAGTTGTCCTCTTGCTTAGCTGTCAGGAGAAAATACAGCCCAGGCTGGCTCAAGGCTCAGCTCTGAGCAGAATGAGGAGTGGTCGCT 

0 GTCATATNNNNNNNNNNNlWNNNiraNNNNJ^NNN^ 
f*t NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NtTON^NNNNNNNNNNNNNNN^NNNNN^ 
4- NNNNNNNNNNNNNNNNNNNNNNNNNNIWNNNNNNNl^ 

l^s N1^NNI?NNNNNNNN]^NNNNNNNNNNNNNNNNNNNNNNNCTGGG C AG C AGGCC AGGTG TTC AAA C ACACAAT CTGGTTCTG ATGTT CTCTTT C AGC A 

1 :; AGCCCGAGGTGCCTGGAGCCTGGGATAAAAGTTCCCAATGGTGCCCCTGCCCCACCCCCGAATGTCTCCTTAATCACCGAGCAATTTTTACCGGGGC 
I s * GGTAACATCATCTCCTTTGTCTCCTCTAGGGAAACCCAGGAAAAACGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTGAGGCTTCCTGCCAGG 
yjj GCAAGAGCTCTCTTCCCTCCTCCACACCCCCTCCCCAGCCAGGAGTGCTGGAACACAAACAGACCGCTGCAAGGAACGAAAGGTCTAGACACAAACC 

CACCTGCCGTTCACGGCCCCTTGCCCCCCCCCCGCCTCCCCCCCACGAAGGGGGCAGGCATTCTCTGGACTTCCATTGTCCCCCTTTAAAATAAAGC 
CCAGGGAGACGTAACCTACCCCCACCCTCGTTCTTCCCAGCTCCCACCTCCTCACTCAGGAGACAGAAACAGTGTCTGAGATCTCTTAGACTTGTGC 
hik AACACAGGGCTTAGAGTGGGCCTGGTTTTGCACATAGGTGCACACATCATCTGTGAACTGAGAAAAGGACCCTCGGTCGGGCGCTCTGGTTACTCCC 
CAGTCACCCAGAGTTGCTTCCTCTTGCCTTTCAGTCCAAGGGTCTCCTTGTAATCTCATTCTGTTTTCCTTCATAAAATAGAAAGACCTTGGCTTGG 
W CTTGCCTGTGGGGAGCAGGAGGGAGAATGAGTGGAGCCCGGGACCCGGGTGGAGAGGACGGTGAAGGGAACACATAGTGGGTCTGAATTCTGCAGAG 
flj CCAAGAGCCCACTCCGTTCTCTATCCCAGCAATGGAGGCAGAACGTGTGTCTCCCCAACCACCAGAGTCAGGAGAAACTCTGAGAGTCAAGTTTCCT 
| ... i CAATTGGGGTAGGACTCTCTCAAGGTGGGGAATGGCCACGGCCAGCCTGGAGAAGCAATGCTTCAGGATTTGACTTTCTCCCAGGTCCGGAGGCCAG 
- AGGGGGAGCTTTCCAGGAGCTCAGGCCAGGCACAGGCCCCAGGCTGGGCACTGTTCCCCAAACCCCTCTCAGAGGCCCCCTGGAGCTTGGGTTGCAG 
O CCCCTTAGCCTGGCCCTTTCCCCATCACCACTCCCGCCCCTCCTGTACTGCTGGCTGGAAGGCCAGAGAAGGGTAAAGACATCTGTGGCTGGTTTTT 
{2 GTCTTTCACTTGGGAGGAAGGGraGGGGGGGGGTTCAGACAAGGCCGGATGTGGTGGGGCTGTGGGCTGCAGAGGCGGCAAGCTTGGGGGAAGGGGTG 
TTG AAC AGG AGGGAG AAG AGAAGAC TGGG AGGGAGGTGGTGT GGG AGC ACACTT CAT AGAAGCTGGGAGTT CT GACCTC CAAAGGGTCTAGAAG CCC 
TTTAAGCCTCTGGGGCCGCAGACCTAAGGCCACAGACCCATTCGTGAAGAGCCAAGCGCAGCTAACCTCTCCCAGGGCCCCCAGGCTGCAGTCAAAC 
CTGGACCAGGGAAACAAGGTTGCTATGGACTCTTCTTCTCAAGTTAGCCATGCCCCAAGAAAGGGCTCCTGAACACCACTTTGTCCTTATCCTGTCG 
GTAGGCTCACAATGGTGGTGCGATCTGTGGATCACCACTGAATGACTGAAAATCTCCTACCCAGAGGAGGAACCCTAAACTTTCCAAAGTGGGCCTT 
CAAACCACCCCACCCCCCTCCCATCTGTGGTGGTCCATCCATCACACACAGGCAGACGGCAGGCACCCCCTCCATTGATAAAAATCACAGCAGCTGC 
TGAACCGTC CC CCA C ACGCAC CC TG C AGCCCTGCTAAAGGGCT AAAAAAAG CCC AGAACAGCTGAG CCCTG CT GATTGGGAT CTGCCGGAAGAGAGG 
CAAAGAGGTGTGGGAAGAGGGTGGGAACACAGAGGGAGAGTGGGGGAAGAGAGCCGCTTCOTTCCAGGGGCACAAGCCGCCATAGGAAGGCAGTCAC 
ATTTACCCAAGCACTACCCCACAGGCCCTACGGTACATTTGTCTACGCGCTGAGTGTAGTGGGTACCAGGGAGCTGTGAAAATCCCCATGTCTCTTA 
TCAGCAGTCCCTCGACTGAAGTTTATTCCTACAAAGGAGCTTTGATCTTCT 

TGTTTTTTTGTTTTTTCCCTCGAAAGCTGAGGCATTTTGTAGCCCCCACCCCAAAAAGAAAACTCTCCTGGGACCCAAAAGAGGCTCCAGAAGAAAG 

ACACACTTTCCCCCCACATCACAGGGTCCCAAACCCCACAACGGTCOVGCACAGGAGTTGCGATTTGCCTCTATCTCCCAGACACAGAGAAGATGTG 

C GAT TT C CAGCT CAGGGCTCAGC AG ACC CCAAAG CCCT AG ACT ATCCCC AATCT CAGCC AGTG AGGATG AG ACTGG ATGTCCCAGG AGTTGACAAAA 

C ATCAGGCCTTTGC TAC TGATGAGT AGAAATGTAAACTTT CT GGT CCTGCCTGAGTCTGC AGGGCCAGACGTTTAC ATCTTAACTAGATCCTC C CTG 

AGTAATTCATACATGTGCTAACATCGTTACTTAGATTTATTTTTTGTTTTGGTTTGGTTTTTTGGCTTTTTAAGATTTTTTTATTCATTGAACGT 

ATGCATACACTGTAGCTATCTTTAGACACATGAGAAGAAGGTATCAGATCCCATTACAGATGGTTGTGAGCCACCGTGGTGGTTGCTGGGAATTGA^ 

C CCAGG ACTTTTGG AAGAGCAGTCCGT ATT CTT AAC CCT ATTTAGGATTATTTT ATTT TATGGGTAT AAAC AT TTTGCCTGCGTGCATAGAAGTACA 

CCACATGCACAGAGGAGGTCAGAGTTGGGCATCAGAGCCCATGGAACTAGAGTTACAGAGAGCCACGACGCACCGTGTGGGTGCTGGGAACCGAACC 

GGCCTTCTCTACCAGAGCTACACGCACTCTGCTCTTAACCCCTGAGCCAGTTCTTCAGCCCCACATGCTGAGGCTTAAGAGGTGTGGATTTCTAGTC 

AAAGACTCACTTGGGGTTTGAGGGGGAAACTATAGTTTCCTGGGCTCCTCCGTCTAAGATGCTGACCCAAAGGGTCTAAGGTCTGAGAGTCTGCAAG 

AGAACAGAAGC CCC GGGC AATGT CCTGACTGTGAGAT CCGGACTG TGAGCT C AT CAGGTGCTTCCTTCCCCCTTAGGTGTGCTTCTGCCAAGATGAG 

GTCACCGATGACTACATCGGCGAGAATACCACGGTGGACTACACCCTGTACGAGTCGGTGTGCTTCAAGAAGGATGTGCGGAACTTTAAGGCCTGGT 

TCCTGCCTCTCATGTATTCTGTCATCTGCTTCGTGGGCCTGCTCGGCAACGGGCTGGTGATACTGACGTACATCTATTTCAAGAGGCTCAAGACCAT 

GACGGATACCTACCTGCTCAACCTGGCCGTGGC^GACATCCTTTTCCTCCTGATTCTTCCCTTCTGGGCCTACAGCGAAGCCAAGTCCTGGATCTTT 

GGCGTCTACCTGTGTAAGGGCATCTTTGGCATCTATAAGTTAAGCTTCTTCAGCGGGATGCTGCTGCTCCTATGCATC^GCATTGACCGCTACGTAG 

CCATCGTCCAGGCCGTGTCGGCTCATCGCCACCGCGCCCGCGTGCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATGCTGGCCCTCTTCCT 

CTCCATCCCGGAGCTGCTCTACAGCGGCCTCCAGAAGAACAGCGGCGAGGACACGCTGAGATGCTCACTGGTCAGTGCCCAAGTGGAGGCCTTGATC 

ACCATCCAAGTGGCCCAGATGGTTTTTGGGTTCCTAGTGCCTATGCTGGCTATGAGTTTCTGCTACCTCATTATCATCCGTACCTTGCTCCAGGCAC 

GCAACTTTGAGCGGAACAAGGCCATCAAGGTGATCATTGCCGTGGTGGTAGTCTTCATAGTCTTCCAGCTGCCCTACAATGGGGGTGGTCCTGGCTC 
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AGACGGTGGCCAACTTCAACATCACCAATAGCAGCTGCGAAACCAGCAAGCAGCTCAACATTGCCTATGACGTCACCTACAGCCTGGCCTCCGTCCG 
CTGCTGCGTCAACCCTTTCTTGTATGCCTTCATCGGCGTCAAGTTCCGCAGCGACCTC^ 

ACGGCTCCGGCACTGGTCTTCCTGCCGGCATGTACGGAACGCGTCGGTGAGCATGGAGGCGGAGACCACCACAACCTTCTCCCCGTAGGGGGC 

CTGCCCGGACTACAAGGACCTCTCCCAGGAGCCTTAATGTGGTGCACACATGC^ 

CTGGCCAGTCAGGTTGACATGAGGACCTAAGAAACTGCTTAACCCCATCCCACTT^^ 

TTAACACTCAAG CCAAGA CAGCTAT CCC CAAAACGAC AGC C^VAAAGTG AAAGTGAGAGGCTG CCACACTTT CCGGAGTG AGGGATGTGGG GCC AGTG 
AAC A CC CTGGTTGAGTAGT CTTC GGAGGC CTCTG AATGAACCTGCTTCTAGCTT AGAGAGAT GTC CCGG AG ATTCAAGG ACAGAGCTTATCTCCACA 
CTTAGCAAGCAAGCAAGAGATGACAGTCTCTCTAAATGCTCfCCAC^ 

GGCCACACCATCAGCCTGAAGGCAGTCCCAGCTGGGGCTCTGGATGGCAATGACAAGTAGTTGGGTCTCTATGATGGGAATAAAAAGGTAGGGGAAA 
GGTGACAGAAAGGAGAGAAGGTGACCCTGCTGGCTGACAGAGGCCAGCAAGCTACTTCTTTGTTCTCTGTCAGCCACTGATACCTTTCCTCATGTTC 
TGCTTTTGATTCAT ATA TCTTTTATGAAGAAA CAAATAAAAAAAAATT TTC CCTCGAGG AAAC AA CTTGGAAAG AAGGG AGG AAATTCCTTGGTT AA 
ATGGCGAGTGAGTGAGTTGTCCCCCCATTCCACCTGAGAGTCCTGCGCCCCACACCCCTCCGCCCCAGCCTTCCTTTCTCAGCACTCAGAACCATGA 
GGT CAC AGAGCC TCT CCGGAGATGC AAA CCCAGG AGGCTG CAAGAGGTGGAAAAAGAGCGAAGACAGGATGTCTGTCTTGCACATACA CCTGC AGAG 
CCCGTGGGATGGGA C CTAGGC CC TT ACAGGCAGCAGT ATACAT AGGAG AGC A CTCTG AGT ATGGAAAG ACG AGGTTGTCCCAGGAACT CCTGTGTCT 
GGGAACTCAGAGGAACCCCGGGTTGGGGGGAAATGGACACCTGC^TCCTGTTGTGTTTGCTGTAACAGGATAGCCCGGAAACAGCCCTAACTCTTGT 
CAGGCCCCTTCAGACTTGCGTACATTCCTCTAAGAACTCCAGTTTGGTGGGAAGCTCAGACAGAGGTCATGACATTCTAGAGTAAGGGAACTAGGGA 
CTAGTCAGATGCCACAGGCTGGATGGGACTCTGCAGGGTGTTTGAGAGCTAGGACCACTCGGTCCCCTCCTTTCTACCCACTAGTAACAGAAGTTAC 
TGTTACCAGCCTGACCCCTAAGTGCCTCTGTAGTGCGAGTCTCAGTCCATCCCCGTTTCCTGTAACAAAACACTGCGGTCTGCGGAACTTATACACA 
ACAGGAATCTATCTCTCACCATTCTAGAAGTGAGACAACTTCCACTCTAGTATCTCCCAAGTCC^TTTATAAGGCCTTAAATCCATTCATCAGGGTG 
GAGCCTTCACGGCCAGCCAGTTATCTCTCAACCACCCCACCCTCCAATACCATCATCCCCTAGGGTTCAAATTCAAACACAGGCCTATTGGAGGGAC 
ACAC AC ATT CA AAAGAC AGCGGTGTGGAAAGC AG AGGG AG AGGGCCAG ACTCTGTCGCC CCTCACTCCATC CC CACAGG CCAAGTGTC CTTGT CATT 
GTGTCCCAGTGTCTAACCAAAGCCCTAATTTCCCCAAGCCCGAGAGCTTGGTCCAAACTATGATACCTGGGCAAATGTCCAATGGCTTAGCACCCGA 
ATAGCT CTG AG CCTGGC GGT AGG ATG AGGGTGGGGGGTGGGAGAA AAAAGGTGAG AC AT T CAC AC AAAAAC AT AGTTCCAATCCCCCCCCCCCCAAG 
AAAAAAGAAAAATGGAGCCTGTTCCATAACAGTGCTCATTCTCTAAGCTTGTAACCACAAAACACCCATTACCCTCCTCCACAGGATGCATGGGGCT 
AGAAAGACTTTAACACAATTTGTCAGGCGGCCTCCCGGTCTGGAATCAGCATCGTGGGAGGAACGAGGCTGGCTCAAGTGCAGGCAGTGGCTGGTGG 
CGGGTGGCGGGTGGCGGGCAGCTTTCGGGATGCTGGTACTGAATATTAAAAAGGTTCAGTTGCGACTGTGTGACCACAAGCACGCTGCAAGAAGGCG 
ATGCCACTGGCCACGGTCATCCCCAAGGGGAACCTGCCAGAGTGCACGTGGCTTTGTAGATCCTAATGAAGTTACAGATGGGAAAACAGCCCTCGAT 
TCCT AC C AGGG AGG AAATAATGGAGGGATCACGGAAG GGGAAGCT GGG CCAACCACACAAGC ATG CCAC CCTGG CCTCAAATACATTGCG CTAGTAT 
AACTGGACATC TCC AAAT ATTTACAG AAACA ACAGAACCCCCAGCAATGCTAATTCCCCAAACTG CTGTGT CT C AGAAGTTC TATCCT CAG CT TCAC 
GCTGGCCTGCTGCCTAAGACCTGGCCTGATGCACTCTTGTGCCTACCAAGGGTGATCCCCAGGATACAAGAAAGCACCCTGCCTGAAGCCCACCCAG 
GTCTTCTTTCTCATTCTGCCCTTCTCGTCTGCTCCATACAACAGCCCCATGCCTCTAAATCTGTGTCCCTTGCCACGCCATCATTCTGCAATTGCCC 
CAAGCTCCTTCCATTCCACTAACAGTCCCTTGTCCACGCCCCCC^ 

CTCAGATCCCCCCTGGCTGTGACCCTAGGCCAAATGTAAGACATTCCCAGGTCATGAAACTGCCCTTTCTGCTAGAAGTGCCGAACCTTTAATAGGT 
TCCTGGAGACCTGAGGAAGCAGGGGCTGACCCCTGTCGTTCTCTAGACAGAGCAAGGAGCTGGCTTGTCCAAAGAGACACAGCAATGAGGGGAGGTT 
GCTTAAGCTAGCTGTGGAGCCTGAGGGTCTGGCAGAGTTGCCAAAAGGGGGGATGGAGCCTGGCCTACTCTGAATCCCAACCCTGCCGCTCCCCAGC 
TGTGTGACCTTCAGCTGTTGCTCAACTGCTCTGAATCTCAGATTTCTCAATTGCAAAACAAGGGAGATTTGGGTAAACACTTAGCACAGAGCCAGAA 
TGTTGAGCGAGGCAGCAGCAGGGGAGCTCCCACTCCACAGCCAGATGGCCTGGTTCATGTCTCCGCTCCCTTACGGTGCGGTGCCTCGGTTTCCAAA 
CCTGCATAATGGAGTTATTAGTCGTGCCTGCCTCCCAGGATTGAGGTAAGGAAGCAATTCATCTACAAAAGCAAAATATTTAGAAATACCCGGGCAA 
GTGATATCATCTTTTGGGAGATAGTTGAGAAATTATAGATTTAAAACATTGCAGGGATGGAGAGGTGGGTCAATGGTTAGAGCACTGGCrGCTCTTC 
CAGAGGACCCAGGTTCAGTTCCCAGAATCCACATGGCAGCTATTAACGGTTTGCAAATCTAGTGTCAGATAATGAGGTACAGTGCCATGTATGTGTG 
GGGTACATGCCTACATGCAGGCAAGACACTCATACATATAAAATGAAAATAAATCTTTTGTTTTGTTCTGTTTTCATTTTTTGGAAACAGTGTTTCT 
CTGTAGAGTCCTAGCAGTCCTGGAATTGACTCTCTAGAAGAGAATGGCCTCTAACTCAGAGATCCATCTGCCTCTGTCTCCCAAACCCTGGGGTTAA 
AGGTGT AGGACACCACC CACCGG CATAAATAAAT ATT AAAAA ATAAAAT AAAAATGTTT CTTT AAAGAG ACAGAG AGAGAAAAGGCAC AAG AGATGG 
CTCAGTGGGTAGTGTTTGTTGTACATGAGGCCCTGAGTCCAGATTCATAACCCACACTCACAGGCCAGGGTTGGTGTTGCGGACGATGAAGAAACAG 
AGATGC T AG AG CCTCTGGCC AAC CAAGTCTAACAGAAAGT AG AAG CTT CAGGCTTCA CAAAGAGA CCCC CATCT CAATAG AT AAG C AATAAGT AAAT 
GATGGAGAACAGAGGCAGACACCCCAGTGTCTCTCTGCCCTCCACCTGCACACACACGAAGGAGAGA^ 

CAGTTCCTCAGCCCTATGTCTGTTCACACGCAATCCACAACAAACTGGAGGACACCTTAGATGAGAGTCTGGTCTCCCCACAGGGTGGCCAGGAGAG 
ACATTTGAGGCACAAGGAGAAGGAGATCACCTGAGGTCTCTAGGGGCCTCTCTGATACTCAGAATCCTAACGCCCTTTAGGTCCACCCTCAATCCGG 
TCTGCCAGCATCCATCCTTAATACGAACCTCTCTTGCCTGCAGACAGCTCTTGCCTTGATAATGAGTCTTCTTTGAGAACGAGGTACTCACTTGCGG 
GTTTTTTGAGTCAAGGACAGAGTAGGTAAGGAAGAAACAAGCCTATGAGATGGTGGGAATAAAGGACAGAGTCGGAGAGGAGATTCTGAATTCGGAG 
GAAGGAGCAGGGATGGGGGAGCTGTATATTGGAGACGAGACATGGAGAGTATATCCAGTGGGCAGATGAATCCGTGTGTGTGAGTGAGTGAGTGTGT 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCCTCCCTCAAGTCTCCTAGTCT CACAGG AAGAAATGATTCC!ATGGGTGTTTGTTCTT 
TTCAAGGGAAAGGTTAGC03GGGTGTGGTGGCGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGCCAGCCT 
GGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACAC^GAGAAACCATGTTTCAGGGAGGGGGAAGGAGGAAGAAGGTCA^ 

GTCCCTGAGGACTCTCACTCTCACCCCAACAGGAACCCAGCTTTTCCACACACCAGGCATCCCAGCCAGGGACACAGATGAAGGCACTGTCAACAGA 
AGCACCAGCTAGAACTTCTGTGATCCTGAGGTCACATGGCCCACAGGTTCCTCCCTATTCTGATTTGTATGTGACCCGATCACATTTCCTTGCAGAT 
GTTAGGTTTACCCCTTCCCTTGTGCACTTTGAAAATGGCTACTGCCAGCGGGGGGCGTGGTGCCGCATGCCTTTAATCCCAGCACTCAGAAGGCAGA 
GGCCAGCTGATCTCTGTGATTTTGAGKSCCAGCCTGGCGTAC^ 

CTCTATTTTGTCTACTGCCCACTGATCCCACAAAGTTCCCTGACCAGGCTGTTGTGGCACTGTGTCACACCACAGTGACCCCACCGAGTTCT CATCT 

CCCAGTGTGACCTACGGGTAGTCTGCCCTGAAGTGGATACCCAAGTCCTCTATCACGAGGGCAGGAGAGGGGACTTCTCAATCCATGTCACAGCTTG 

CCTAGGACAGTGTACGTTATGTTTGCTCTAAACAAATATCATTGTGAAGCTGAACAGATGTTGCAGATAAAATGCCCTAAACCCTTTCCCTACGTTT 

C AAAGAAGC TGGGAACT C AGG CGTGCTAGTCT TGG ATGGTTTC TGC AT CTGTGG TCC AGTAAC TGGCACGCG CAATCTCTCTTGC AAAGGCAGAGG T 

GAAT TC TTCTGGCCCCGGGGGTGTTTTC TGC AGCCTGGCT AG CTCCAG ACCTGTAAT TAAGTC rAGTGGTATGCTGGCAAGGATGTGGAGAAAGAGG 

AACACTCCTCCATTGCTGATGGGATTGCAAGCTTGTACAAGCACTCTGGAAGTCAGTCTGGCGGTTCCTCAGAAAATTGGACATCGTATTACCGGAA 

GATCCAGCAATACCTCTGCTGGGCATATACCCAGAAGATGTTCCAACAGATAATAAGGACACATGCTCCACTATGTTCATAGCAGCCTTATTT 

TAGCCAGAAGCTAGAAAGAACCCAGATGTCCCTTAACAGAGGAATGGATACAGAAAATGTGGTACATTTACACAATGGAGTACTACTCAGCTATTAA 

AAACAATTT ATG AAATT CCT AGG CAAAT AGATGG AT CTAGAGGATATCAT CCTG AGTG AGGTAACCC AAT CAC AAAAG AACTCA CTTGAT ATGCACT 

CACTGGTAAGTGGATATTAGCCCAGAAGCTTAGAATACCCGAGATACAATTTGCAAAACACAAGAAAATCAAGAAGAAGGAAGACCAACGTGTGGAT 

ACTTCATTCCTCCTTAGAATAGGGAACAAAATACCCATGGAAGGAGTTACAGAGACAAAGTTTGGAGCTAAGACGAAAGGATGTACCATCCAGAGAG 

TACCCCACCCGGGGATCCATCCCATAATCAGCCACCAAACCCAGACACTATTGCATATTGCCAGAAAGGACCCTGATATAGCTGTCTCTTGTGAGGC 

TAGGTCAGGGCCTGGCAAACACAGAAGTGGATGCTCACAGTCATCTATTGAATGGAACACAGGGCCCCCAATGGAGGAGCTAGAGAAAGTATCCAAG 
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GAGCTGAAGGGGTCTGCAACCCT AT AGGTGG AAC AACAAT ATG AA CTAACC AGTACC CC C C AGAGCTCGTGTCT CTAGCTGCATATGTAGCAGAAGA 
TGGCCTAGTCGGCCATCATTGGGAAGAGAGGCCCCTTGGTCTTTGCAAACCTTGGTACAGGGGAATGCCAGGGCCAGGAAGCGGGAGTGGGTGGGTT 
GGGGAGCAGGGCGGGAGGAGGGTATAGGGAACTTTCGGGATAGCATTTGAAATGTAAATAAAGAAAATATCTAATAAAAAAGTCTAGTGGTGGAGAA 
TCACCCAAGTGGGAACCATCTTTGCTCCTGCACTACAGCTCTCACAGGACTGGTAGACATGCCATCTTGTTCGAGCTACCACCTCTGTCTACCACCT 
ACCCTGC CCGTCATGCTCTGGCT CT GCTGGGTTGGCCGTTGA CAAT CT CTGGACT AG AAAGC CTGGTC AGGGAATG AACGCG ATG AGAACTCG TGCC 
TGCTCTAACCACAGTCTCTCTCT CC ATCACCCCAGCAG AGCCAACCCTAAG CAGA CT ACGAA CTAGAAGAGCT TCCGAGCTTTAGGTTAAACAC AGC 
AGAGCAGGGGAAACACAGGAATCACGAGGGATCGGGAGGGAGGGGAGGTTTTGAGGAAGGGAGGAAGGAAGGATGTGCTGACATTTGCTTAGTGTCC 
ATTCTAGGGATCCACCTGGCACTGAATCCACAAGTGAGGAAGCCAAGGGTGAGAGCAGGCTGGT^ 

TGCTAAGATTATATTCCCTACTTCATTTATACTGTGTATTGAAACCTTACCTTATTGAAGTCTGTCCTGTCTGTCTGTCTGTCTGTCTGTCTCTCTC 

TCTCTCTGTTTTTTTTTTTTTTTTTTTTTTTCGAGACAGGGTTTCTCTGTGTAG^ 

GAACTCAAAAATCTGTCTGCCTCTGCCTCCCTCCCAAGTGCTGGGATTAAAGGCGTGC^ 

TATTGAAGGCCCATTTTACAGATGGGGAAGTAAAGCCCAGAGAGGGGAAAGTGCCAAAGCCATAAGGCAATAAAACTACTAAAGGATGTCAGCCAGT 
TCTAGCTGGCCTCCAAAGGCCAATTTGCATCTGCCCTCTTGCTGCACCATT^ 

AGG AGCTAG GC ACTGTGCTGGGT GTACAT AGCAT CTCTCAAAGCACGCTGG ATGC AGAGTTC TGT CCTGGG AGG AGGCACGGT CG AATGAGTACCGC 
CTTCAGGGAGCAGGGAAACAGCCCGGCTGTGATAGGCTGTCCACTCCTTCTTC^ 

CTCTGCCATGG AGCCACTGGGCAACATGAACAAGT CACCTGT CTGCTGC AG CTCCTC TAG AGC AAAAG AAAGT TCC TAG ATGG AGTGAGTTTT ACAC 
CCATCCCCTGCGGTTCTGAGGTGCCTTGGGCATATTGACTGGGTGGGGTCAGGGAGCCCACATC^ 

GTGCCTGGGTGAGGGGCTGAGGAAGCTACTCCACCCCCAGCTGTGGTTTCTGGTGGGGCCACTGGTGGCAGCAGGTGAGGGGAAGTATCTGTCACTC 
TTCCAACAAAGGCAATCAAACAAAAGCCTCCTGGGGCATTGTGGGAACTGCAGTAAAGGCGCCAAAAATAAAACACAACTTTCCTTTGATAGGCACT 
GGTAGCTGCCTGGGGGTTGTGACAAAAGTGTTCAGATTTCACACCTCAGTTCATACCCCACACACACACACACACACACACACACACACACACACAC 
ACCAGTCTCACAAACAAGTCATATAATAGGCAGTCAATGATCCCAGAAGAAACATCTCCGTGGGCTGTGACACATGTCACAAGAACCACATATGTGT 
CAAATGGCTTGTCCTTTCCATTTTGAAATGTTCAATATTTTAAAAGAACTTTTAAATATCCTATTT 

TATGTCTGTGTACCCTACCATTCACAAAGGCCAGATTAGATTCCCTGGAACTGGGATTATAGACAGTTGTAAACTGCCAGGTGTGTGCCTGGAATCA 
AACTTGGGTCCTCTGAAAGAGCAATCAGTGCTTTTAAAATATATATACATATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTAAAATTTATTTTATGTATATGAGTACACTGTCGCTATCTTCAGGCATACCAGAAGAGGGCTTTGGATCCTATAACAGAGGGTTGTGAGCC 
A<^GTATAATTGCTAGGAATTGAACTCAAGGCCTCTGGAAGAACAGCCAGTGCTCTTGACCACTGAGCCATCTCTCCACCCCTAAAGTTTTCAGTAT 
GTTTTAAGCAAGGAGCTGAACGGGGTATTTTGTTCTTTGTCCTTCAAGCTACAGAACCAAACTTGCTAGCATTCATTCATAAATATTCATATGGGCA 
GTATGCCTAAAAATTATAAAATACTGATAGGCAAAAGTGCATACATTGCCCAC^ 

CACACACACTGGTACTACAGGGGCTCCCCTGTGTACACACACTGCAGTACAACTGCTTCCCCCATAAGGGACATTGACACCCACTCGAAAGGCCCAG 

AGAGAGCTCAACCACAACCCAGTGACATGAATGGATTGACTTAGCCAGATCAAGGCACCTCGATGGCACTGAAAGGAAGGGCTTAATCTCCTTTACT 

GTATCTTCAACTGTCACAGCTGGCCTCTCTTTCGGGCTATGGGCTCATCCACCAACCCTTGCAGCGGAGACTAAGGAAAAGGAGACCTTGGGGAGTA 

GAGGAATCCTTGGGAGGAGAGGTTGGGAGGAGGAGAAGAACATTAGCCAAAAACAGCAGGCTGGGCGACAACGGAGGCGCCTAGTGAACTGGCCTGA 

GTGGTGGTTCTGAAAGAGCCTGGGGGAGGCGGGGGCAGGTCTTGAACCCAGACAAGTCCGAGCTCCTGCTCTCACATGAACAGCCAATCCCTTGCCC 

CACTCCTCTTTCTAGCTAGGTGGCCTTGGGCACATGGCTGAACTTGCTTGGCCTCAGTCTCCATCTCCACGGAATGAGAAAGTATTGTCTGCATGGC 

AGGGTTGTGTGGTGCATGTGGAAGTTGCGGCCTCATCTTCTGGGCTCTCCCAACATTCAACAGTTTGCCTTCTTTCCCGGCCTTGCAATCCGACTCT 

TGCTTCAGAAACTCCAGACCCCCTCATTAACGGCAGACCTTCAGAATGTGTTTATTTCTTGTCTCCCCACCCCCACCCCACTTCCCAGCCAGATGCA 

AGCATGGGAGGTGGCCGTAGTTCAAGGCCCACCCTCCCTTCGGGGACTTTCAACGTTCCTCAGTTCCTTTTGATCCCATAGCTCCAGCCACAAGCCA 

AACAGCCCCC^LAGGCCTCCAAAGTTTTCTGCCTGTGCTCTCTACCCCTAACCCCAGTGCTGCCTGTAAGAAACCGTTCCAGGCTCTGCCCAGAAGCC 

CCTCCATCCTGTTAGGTGATACCAGATGCTCAAAGAGTGACCCAGAAAAACAATTCTTTTCAGCTCATTGTGATGGTGCACATCTGTAACCCCACCA 

CGCTGAGGAGG CTAAGGG AG AAAGAGAG AATTCTCGG CCAGCCTGGGCTACAGATCA AGACCCTGT CT CAGAAAAA TTC CAAC AAAAT TG CTGGGG A 

AATATTGCTCAACCATCGGAATGCTTACATAGCATGCATGAGTGCAGNNmW 

NITONNNNNNNNNNNNITONIWNNNI^NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNIOTNNNNNNNNNNNNNNNNN^ 

MOUSE SEQUENCE - mRNA 

GGAC ATGGACTG CT ATC TGC GTCGC C TC AAACAGGAGCTG ATGTCCATG AAGGAGGTGGGGG ATGGCTTGCAGGATCAGATGAACTGCATGATGGG C 
GCAGACTGGGCTAGCTGGAGAGAGACAAGAACCAAAAGCACAGCCTTCCTGTGTGATTTCTACAGCCCCCAGAGCACCATGGACCCAGGGAAACCCA 
GGAAAAACGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTGTGCTTCTGCCAAGATGAGGTCACCGATGACTACATCGGCGAGAATACCACGGT 
GGACTACACCCTGTATOAGTCGGTGTGCTTCAAGAAGGATGTGCGGAACTTTAAGGCCTGGTTCCTGCCTCTCATGTATTCTGTCATCTGCTTCGTG 
GGCCTGCTCGO CAACGGGCTGGTGATACTGACGTACATCTATTTCAAGAGGCTCAAGACCATGACGGATACCTACCTGCTCAACCTGGCCGTGGCAG 
ACATCCTTTTCCTCCTAATTCTTCCCTTCTGGGCCTACAGCGAAGCCAAGTCCTGGATCTTTGGCGTCTACCTGTGTAAGGGCATCTTTGGCATCTA 
TAAGTTAAGCTTCTTCAGCGGGATGCTGCTGCTCCTATGCATCAGCATTGACC^CTACGTAGCCATCGTCCAGGCCGTGTCGCGTCATCGCCACCGC 
GCCCGCGTGCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATGCTGGCCCTCTTCCTCTCCATCCCGGAGCTGCTCTACAGCGGCCTCCAGA 

agaacagcggcgaggacacgctgagatgctcactggtcagtgcccaagtggaggccttgatcaccatccaagtggcccagatggtttttgggttcct 
agt gcct atgctgg ctatgagtttctgct ac ctc att atc at ccgt ac cttgct ccagg cacgcaactttg agcgg aacaagg ccatc aaggtgat c 
attgccgtggtggtagtcttcatagtcttccagctgccctacaatggggtggtcctggctcagacggtggccaacttcaacatcaccaatagcagct 
gcgaaaccagcaagcagctcaacattgcctatgacgtcacctacjagcctggcctccx3tccgctgctgcgtcaaccctttcttgtatgcct 
cgtcaagttccgcagcgacctcttcaagctcttc^ggacttgggctgcctcagcc^^ 

aacgcgtcggtgagcatggaggcgg agaccaccacaacct tc tccc cgtaggggg ct cc cctgcc cggactac aaggac ctct cc c aggagccttaa 

tgtggtgc^cacatgcacagactctccatccaccgaattgctgctgagggaagagcaattctggccagtcaggttgacatgaggacctaagaaact 

cttaaccccatcccacttataactacctcaaccaaagctgtaaaagatatggctgagaagttaaca 

agc c aaaagtg aaagtgagaggctccacactttccgg agtgaggg atgtggggc cagtgaacacc ctggttgagtagtcttcggaggc ct ctg aatg 

aac ctgctt ct agc ttagag agatgtcc cgg agattc aag ac agagcttatctccac acttag caagcaagcaagagatgacagtctctctaaatg c 

tcccacagagcacccctgcccctcccttctgcctctccaccgcctttcctgaggtccaggccacaccatgacgctgaggcagtcccagctggggctc 

tggatggcaatgacaagtagttgggtctctatgatgggaataaaaaggtaggggaaaggtgacaggaaggagagaaggtgaccctgctggctga 

aggccagcaagctacttctttgttctctgtcagccagccactgatacctttcctcatgttctgcttttgattcatatatcttttatgaagaaacaaa 

taaaaaaaaaattttccctcgaggaaacaacttgg 

mouse sequence - coding 

atggacccagggaaacccaggaaaaacgtgctggtggtggctctccttgtcattttccaggtgtgcttctgccaagatgaggtcaccgatgactaca 
tcggcgagaataccacggtggactacaccctgtacgagtcggtgtgcttcaagaaggatgtgcggaactttaaggcctggttcctgcctctcatgta 
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T TCTGT CAT CTGCTTCGTGGGCCTG CTCGGCAACGGGCTGGTG AT ACTGAC GTAC ATCT ATTTCAAGAGGC TC AAGACC ATG ACGGATAC CTACCTG 

CTCAACCTGGCCGTGGCAGACATCCTTTTCCTCCTAATTCTTCCCTTCTGGGCCTACAGCGAAGCCAAGTCCTGGATCTTTGGCGTCTACCTGTGTA 

AGGGCATCTTTGGCATCTATAAGTTAAGCTTCTTCAGCGGGATGCTGCTGCTCCTATGCATCAGCATTGACCGCTACGTAGCCATCGTCCAGGCCGT 

GTCGCGTCATCGCCACCGCGCCCGCGTGCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATGCTGGCCCTCTTCCTCTCCATCCCGGAGCTG 

CTCTACAGCGGCOTCCAGAAGAACAGCGGCGAGGACACGCTGAGATGCTCACTGGTCAGTGCCCAAGTGGAGGCCTTGATCACC^ 

AGATGGTTTTTGGGTTCCTAGTGCCTATGCTGGCTATGAGTTTCTGCTACCTCATTATCATCCGTACCTTGCTCCAGGCACGCAACTTTGAGCGGAA 

CAAGGCCATO^GGTGATCATTGCCGTGGTGGTAGTCTTCATAGTCTTCCAGCTGCCCTAC^ 

AAC ATCACCAATAGCAGCTGCGAAACCAGCAAGCAGC TCAAC ATT GCCT ATGACGTCAC CTAC AGCCTGGCCTCCGT CCG CTG CTG CGTCAACC CTT 
TCTTGTATGCCTTCATCGGCGTCAAGTTCCGCAGCGACCTCTTCAAGCTCTTCAAGGACTTGGGCTGCCTCAGCCAGGAACGGCTCCGGCACTGGTC 
TTC C TG CCGGC ATGT ACGGAACGCG TCGGTG AGC ATGG AGGCGGAGAC CAC CAC AACCTTCTCCCCGTAG 

HUMAN SEQUENCE - GENOMIC 

AATATATGAAACAAATATTGACAGAATTGTCAAAGAAATAAGAGAAACACATAAAGCAAG 

TTAAATATCCTCCTTTCAATAATAGATACAACAACTAAGCAGTTGATCAACAAGAAAACAGAAGATT^ 

CGTCTATCTATAAAACACCACCCAACAACAGCAGAATACATATTCTTCTCAGATATACATAGAACATTC^ 

AAACAAGTCTCAAAAAATGTAAAAGAATTGAGATCAGACAAAGTCTGTTCTCTGACCACAACCAGTAACAGAAGGAAATTTGAAGAATCCATAAGTA 
TGTGGAAATGAATCAAGGAACTCAAGGGAAATTAGAAAATACTTTGAAATGAATGAAAATGAAAACACAGCATACCAAAACTTATGAGATGCAGCTA 
AAATAGTGCTTACAGAGAAATTAATAGCTATTAATGCCTGTATTTTTTAAAAGAAGAAAGATACCAAATTAAAAAAAAACTTTTCACTTTAAGAAAA 
AGAATAGTGA&CCAAGCCCAAATCAAGCAGAAGGAAGGAAATAATAAAGATTAGAATGGAAAAAA^ 

ATT AAC AAACCCAAAAGTTGT TATATCAAAAAGA TTGATAAG TTTGATAAAC ATTTAACT AG ACTT AC C CT AAT ATC AA AAC CAC ATACAGAT ATC A 
CAAGAAAAGTACAGACCAAT ATCTCTCATAAGAC ACAT ATAAG ATAGATGC ACAATTT CT TAACAAAAT AT T AAC ATGC C AAATC C AG AAACATTAC 
AAAAAAAATTGTATACCATGACCAAGTGATTTATTCCAGGAATTCAAGGTTGGTTTAATATAAAAAAATCAATTCATATAATACACCATATTT 
GAATAAAGGGCAAAAACACTCAATTGTCTCAGTGGACACAGGAAAGGCCTCCAACTCCCTCC^ 

ATGGCTGCATAATATTCCATGATGTATATGTAGCACATTTTCTTTAACCAATCTATCATTGATGGGCATTTAGGTTGATACCATGTCTTTACTATTG 
TGAATGGTGCTGCAATAAACATACACGTGCATGTGTCTTTATAATAGAACAATTTATATTCCTTTGGGTATATACCCAGGAATGGGATTGCTGGGTC 
AAATGGTATTTCTGTCTCTAGGTCTTTGAGGAATTCTATACTATCTTCCACAATGGTTGAACTAATTTACACTCCCACCAACAGTGTAAAAGCATTC 
CTTTTTCTCCACATCACCAGCATCTGTTATTTTTTGAATTTTTGATAATAGCCATTCTGACTGGTATAAGAGGGTCTCTCACTATGGTTTTGATTTG 
CATTTCTCTAATGATCAGTGTGTTGAGCTTTTCAAAATATGATTCTGGGTCACATGTATGTCTTCTTTTGTGAAGTGTCTTTTCATGTCCTTGGCCC 
ACTTTTTAATTTTTTTTTCTTGTAAATTTGTTTAAGTTTTTATAGATGCTGGATGTTAGACTTTCGTCAGATGCATGGTTTGCAAAACTTTTTTTCC 
CACTGCGTAGGTTGTCTGTTCACTCTGTTGATAGTTTCTTTTGCTGTACAGAAGCTCTTTAGTTTAAGTAGATCCCTTAATTTACTTATTTGTCAAT 
TTTTGTTTTTGTTACAACTGCTTTTGGCATCTTTGTCATGAAATCTTGGCCCGTGCCTTGGTCCTGAATGGTATTGCCTAGGTTTTCTTCTAGGGTT 
TTTATAGTTTTTAGTTTTACATTTAAGTCTTTAATCCATCTTGAGTTGATTTTTGTATATGCTGTAAGGAAGGGGTCCAGTTCCAATTTTCTACATA 
TGGCTAGCCAGTTATCCCAGCACCATTTATTAAATAAGGAATCCTTTCGCTATTGCTTGTTTTTGTCACGTTTGTTGAAGATCACGTAGTTGTAGGA 
GTGCAGTCTTATTTCTGGGTTCTCTATTCTGTTCCATAAGTCTGTCTGTTTTTGTATCAGTACCATGCTGTTTTGGTTACTGTAGCCATGCAGTATA 
GGTTGAAGTCAGGTAGCATGAGGCCTCCAGCTTTGTTCTTTTTGCTTAGGGTTGGCTTGGCTATTTGGGCTCTTTTTTGGTTCCATGTGAATTTTTA 
AAT AGTTTTTTCTAGTTCTGTGAAGAATGTCAAT GGT AGT TTAATGGG AAT AGC ATTGAATCTAT AAATTG CTTTGGGCAGT ATTGTC ATTTC CAC A 
ATACTGATTCTTCCTATCCATGAACACGGAATGTTTTTCCATTTGTTTCCATTATTTCTAATTTCTTTGAGCAGTGGTTTGTAGTTCTCCTTGTAGA 
TGTCCTTCACTTCCCTTGTTAGCTGTATTCCTAGGTATTTTATTCTTTTTGTGGCAATTGTGAATTGGAGTTCATGCGTGCTTTGGCT CTCTGCTTG 
CCTGTTTTTGGTGTATAGGAATGCTAGCAATTTTTGCACACTGATTTTGTATCCTGAGAATTTGCTGAACTTGCTTATCAGCTGAAGAAACTTTTCG 
GCTGAGACAGTGGGGTTTTCTAGATATAGGATCATGTCATCTGCAAACAGGAATAGTTTCACTTCCTCTCTTCCTATTTGAGTACACTTTATTCCTT 
TCT CTTCCCTG AAT ATG ACTG CC CTGGCC AG AACT TC C AATA CTGTGT TGA ATAGGAGTG ATG AG AGAGAG CAACCTTGTCT TGT CCTGATTTTCAA 
GGGGCTTACTTCCAGCTTTTGCCCATTCAGTATAATATTGTCTGTGAGTCTGTCATATATGGCTCTTATTATTTTGAGGTATATTCCTTCAACACCT 
ACT TTAT TAAG AGTTTTTAAC ATGAAGGGGT ATTGAATTTTATCAAAGGCCT TTTCTGCATCTATGGA GATAAT CATGTGGTTTTTGT CTTTAGTTC 
TGTTTATGTGATGAGTCACATTTATTGATTTGCATATGTTGAACCAACCTTGCATCCCAGGGGATGATGAAGCCAAATTTGATCATGGTGGATAAAC 
TTTTTGATGTGCTGCTGGATTCGGTTTGCCAGTATTTTGTTGAGGATTTTTGCTCAATGTTCATCAAGGATAATGGCCTGAAGTTTTCTTTTTTTGT 
TATATTTCTGCCAGGTTTTGGTATCAGGATGATGCTGGCCTCACAGAATGAGTTAGGA^ 

AGGAATGGTACTCTTTTCTGTACATCTGGTGGAATTCAGCTGTGAATCCATCTGGTCCTGGACTTTTTTTGGTTGGTAGGCTGTTTATTACTGCCTC 
AGTTTCAGAACTCGTTTTCAGTCTGTTCTGGGATTCAATTTCTTCTTGGTTTGGTCTTGGGAGGGTGTATGTGTCCAGAAATGTATCCATTTCTTCT 
AGATTTTCTATGCACATAGAGGTGTTTATGATATTCTCTGGTGGTTGTTTGTATTTCTGTGAGGTCAGTGGTAATATCTCCCTTATCATTTCTGATT 
GTATTTATTTGAGTTGTCTCTTCTTCTTCTTTATTAGTCTAACTAACAGTCTATCTATTTTATTAACTTTTTTGAAAAACCAGCCrGGATTCATTGA 
TATTTTGAATGGTTTTTTTTTTATGTCTCTATCTCCn'TTGATTCAGCTCrGATTTTGGTCATTTCTTGTCrTCTGCTAGCTTTGGGACTTCTTTGCC 
CTTGGTTCTCTAGTTCTTTTAGTTGTGATGTTAGGTTGTTAACTTGAGATCTTTCTAGCTTTTTGATGTGGGCATTTAGTGCTATAAATTTACTCCT 
TAATACTGCCTTAGCTGTGTCCCAGAGATTCTGGTATGTTGTATCTTTGTTCTCATTAGTTTCAAAGAACTGCTTGATTTCTCCCTTAATTTCATGA 
TTTACCCAAAAGTCCTTCAGGAGCAGGTTGTTC^TTTCCATGTAGTTGTGTTGTTTTGAGTGAATTTCTTAGTCT 

TGTTGCCCAAGAGACTGTTTGTTATGATTTCAGTTCCTTTGTATTTGCTGAGGAGTATTTTACTTCCGATTATGTGATCAATTTTAGAGTAAGTGCC 
GTG TGGTGATTAGAAGAATGTATATTCTGTT GTT TGGGAATGGAG AGTTCTATAG AG ATCTATCAGGT CCATATATT CC AGAGCT GAGTT CAGGTCC 
TGAATATCTTTGTT AATTTTCTC TCT CAATG ATCTGT CTAATGGGGTGTTAAAACCT CCC ACTATTATTGTGTGGGAGTC TAAGT CTATTTGAAGGT 
CTTTTTGAGCCTATGTCTGTCTTTGCATGTGAGATGGATCTCTTGCAGATAGCATACCAATGGGTCTTGGTC 

CTTTTAATTGGGGCATTTAGCCCATTTACACTTAAGATTAGTATTGTTATGTGTGGATTCAAT CCTGT CAT CATGATGC TAG CTGGTT ATTTT GCAG 
ACTTGTTTATGCGGTTGCTTCATAGTGTCACTGGTCTGTGTACTTCAGTGTATTCTTGTAGTGGCTGATAACAGTTTTTCTTTTCCATATTTAGTGC 
TTCCTTCAGGAGTTCTTGTAAGGCAGGTCTGGTGGTAATGAATTCCCTCAACATTTC 

CTTAGTTTGGCTGGATATGAAATTCTGTATTGGAATTTCTTTCCTTTAAGAATGTTGAATATTGGCCCCCMTCTCCTCTGGCTTGTAGGGTTTCCA 
CTGAGAGGTCTGCTGTTAGTCTGATGGGCTTCCCTTTGTAGGTGACCT^ 

GGAGAATCTGATGATC^TGTGTCCTGGGAATGATCTTCTCATGGAGTATCTCACTGGGGTTCTCTGCATTTCCTGAATTTGAATGTTTGCTTGTCTA 
GCT AGGTTGGGG AAGTTCTC CTGGAAAATATCCT GAAATATGTTT TCCAAAT TGT TTCC ACTCTC CCC ATCTCTTT CAGATACCC CAATCAGT CAT A 
GATTCCGTCTCTTTACATAATCCCATATTTCTTGGAGGTTTTATTCATTCCT 

AGACAGTCTTCAAGCTCTGAGATTCTTTCAGAGGAAGAATCTTTCAGAGGAAGACTAAGCAGAACTTGGCCTATTCTTCTATGAATACTTCTGATTG 
CATTGTGAAGTTCTTGTAGTGTGTTTTTCAGCTCGATCAGGTCAGTTATGTTCTTCTCTATAATATCTATTTTGGCTGTCAGCTCCTGCATTGTTTT 
ATCATGATTTTTAGCTTCCTTGCATTGGGTTACAACTTGCTTCTTTAGCTCAGCGACGTTCATTTTTATCCACATTCTGAAGTCTACTTCTTTCATT 
GCAGTGATCTTAGCCTTACCCCAGTTCCAAACCCTTGCTGGAGAGATGCTGCAGTCATTTGGAAGGAAGAAGGCACTTTGGCTTTTTGAGTTTTTAG 
TGTTTTTATGCTGATTCTTTCTCATCTTTGTGGGCTTATCTACCTTTGATTTTTGAGGTTGCTGACCTTTGGATGGGGTTTTGGGGTTTCTTTTGTT 
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TTTCTTTTAACAATCTGGCCACTTTTCCATAAAGCTGCTGTGGCTTGCTGAGGGTCCCCTCCAGACTCTAGTCACTTTGGATTTTCCAGTACCTGGA 
GGTATCACCAGTGAAGGCTGTGAAACAGCAAAAATGGCAACCTGCCCCTTCCCCTGGGAGCTC(^TCCCAGGGAGGTATGAACCTATTGCTGGCCTA 
AACACACCTGTAGGAGATGGCCGGAGACCCCATTTGGGAGGTCTCACCCAGTCAGGAGGAACAGGATTGGGGAGCCACTGAAAGAAGCAATCTAACC 
ACACTTCCGAAGAGAAACTATGCTATGCTGGGCACCACTTGCATCCCCAGTTGGCTTGGGATCTCCAAAGCCTGGAGAC 

AAAACAGCAAAGATGGCAGCCTGCCCCTCCCCCGAGGAGCTCTGTCCCAGGAAGTCTTCAAATCTCTGTCAGCCAGAGAACATCGACAGGAGTGGCT 
GGAGGCC^CAGTTGGGAGGTCCTGCCCAGTGAGAAGGAATGGATTAGGGACC 
CACGTTGGGAGATCCTTTC!AGCCCCCAGTTGGTTTGAGCTCCCCAAAGCTGAAAGG 
TCTCCACCATCCAGGCACCCTGTCCCAGGGAGAGATCAAAATTTTATGGGCTGTAGAAAAT^ 

CACCCAGTGAGGAGGAATGGATAGGAGTCCTGCTTAAAAAAACAGTCTGGCCACGTTTTGGCAGAGCAGCTGTGCTGTGCTGGGGGATCCTTTCCAC 
ACCTGGTGGGTTTGGACTCTCCAAAGTCTACAGGCTGGAACAGCTGAGTCGTCCAAAC^ 

CAGAACTCCACCTTGTCTCAGGTAGGTGCTACACTGTTGCCCGTGGCTGGCTGGAATTCCAAGCCAGTTGGTCTTATCTGGTGAGGTGCTGTGGAAG 
TGAGGCCCACAGACCAATGCTGCTCAGCTCCCTGGATTCAGCCTCCTTCCTAGGGGTATGTACAGACCTCCTGCCTTGCCTGAATGGCGGTCACCTT 
TGCTGGGGTTCCGGAGCCTGAGTATGTAAAGCTCCTGGGTGTCTGTGCATGCCTGAGC^ 

GACCCAAAGGCCTGGTGGAGTGGGTTCACAGGGGATCTCCCGATCCAAAGGTTGCAAAGATCGGTGGGAGAAGCATCATTCCCCAGGTTTGCACATT 

CACTCACCACTTCCCTGGTGGGGGGAGGGGAGGGGGTTTCCTTTGGGTCTGTGTTGCTCCTATGTAGGCCATTGCCCTGC 

TCTGTGGGTCGAGCTGATTCCTTAATCAGTCCC^TGCAAGTATCTGGATGTTTCAG 

GAG AGC C ATGCCCC ATACCTGCT TCT AGTTGGCCATC TTTGAAAATAAAATTAAGAAAACAATTTCATTTA CAAGC A CATCAAAAAGAGT AAAATAC 
TTAAGA AAAAAT TTAAC AAAGAAGTG AAAGACTTGCAGCC TAAAAAGT ATAAAGAAATTAAGGAAGACCTAAAT AAATG AAAAAAAT C CC ATGT TC A 
TGG ACC AAAAG ACTTTGTTAGAATGGCAATA CTC CCTAAATTG ATT TAC AG ATTC AATG AAATTCCTATT AGAATC C CAGCTGGTT TC AT TGC AGAA 
ATTGATAACCTAAAATTCATATGGAAAAGAGAGAGATGCAGATTAGTCAAACAATGTTTAAAAAGACAGATAAAGTTGGAGCAGTCACAACTCCCAA 
TTT CAAAATTTG CT ATAAAG CTACAGTA ATCG AG ACAGTGTGGTACTGT CAT AGGAC AG ACACAG AGATTG AT AAAATAGAATTGATAGT C TAGAAA 
T AA ACC ATT AC ATTTATGATGAATTG ATT TCC AAC AG AGGTG CTAAGACAG TTCAATGGTGGAAAAAATAG TCT TTT CAACAAATGATGCTGGGAAA 
ACT AGATAT CC ATATAAAAAATG AAGCTGGATCCCTACCT CA CACCAT ATGCAAC AATT AGCT CAAAATGG AT C AG ATCT AAATGT TGCAGCT AAAA 

O CTATAAAAGTCTTAAAAGAACATATACGAATAAATCTTTGTATACTTGAATTATGCAATGGTTTCTTAGATATGGCACCAAAGACACAATTGACAAA 

rj AGAAAAATAAATCAATTGAACCTCCACATTTAAAACTTTX3CGCTTCTTTTGCC^ 

7Z GACGCTATCAAGAAAGTGAAAAGATGATCCATAGTAAGGGAGAAAATATTTACAGATATGTATCAGATAAGGGATTGTAACCAGAATATATAAAGAA 
W GTCTT ACAACT C AAT AAT AAAAAAT AAAT AACCC AATTTTTTAAATGGG CAAAGGAT TTGAATAGAAATTTCT CAAAAAATAATAC AAATGGT CAAT 
JS AAGAGCATGAAAAGATGCTCAGCATAGGTAGTCATTAGGAAAACGCATATCAAAACTACAAAGAGGCCGGGC^TGGTGGCT 

U:. GCACTTTGGGAGGCCGAGGCTGGCAGATCACCTTAGGTTAGGAGTTTGAGACCAGCTTGGCCAACATGGTGAAACCCCGTTTCTACTAAAAATACAA 
f** AAA TTA GCCAGG CATGGTGG CAC GTGCCT GT AGT CCC AGC TA C TTGGG AGGCTGAGG CAGGAG AATCACTTGAACC C AGG AGGTGG AGACTGC AGTA 
M= AGCCAAGATCGTGCCACTGCACTCCAGCCTGTGCAACAGAGTAAGACTCCATCTCCAAAAAAAAAAAAGAAAAAAGAAAAAAAACACA 
| , | C ACTTT ACAGCCACT AGG ATGG CTAG AATC AAAAGAATGGACAATAGTAAGTTTTG TTG ACAATGTGGAA C CCT CACG CAT TGT TGATGGGAACAAA 
^ ATGGTGTAGTCACTTTGGAAAACAGTTTGATAGTTCGTCAAAATGTTAAATATATAATTACCATATGACTCAACAATTTCACTTCTAGGTATATACC 
5 CAAGAG AAA TAAAG CATATGTCCACACAAACACTTGC ACATAAAT ATTTAT AGC AGC AT C ATTCATAATAGCT AAAAGGG AAAAC AG C CC AAACATC 

|^ CAA CTGATG ACAGAC AAACAAAATG TGGC ATATCC AT ACAAT GGA ATGT TATTC AGCCAT AAAAAGGAATGGG G AATGG ATCCATACCAC CACATAG 
I,.., GTAACCTTGAAAACACTAACTGAAAGAAGCCAGATGTGAAGGTCACATATTGATACTTCTATTTATGTGAAATGTTCAGAATAGGAAAATCTATAGA 
•Q GCCAGAAAGTAGATTTGTGGTTGCCAGGGGCTTTCGATGACAGGAGGGGAGGGGAGCAGTCATTGCTAGTGATTACAGGGTTCTTGAGAGGGAGTGA 
f\\ GAAAAAATGTTCTAAAATGTATTGTGGTGATGGTTGCATAACTCTGTGAACATACAAAAACCATTGAATTCTATACTTTAAAAAGGTAAATTTTATG 
\ GGATGTGGATTATAGCTCAATAAAGCTGTCATTAAGAAAGGAAAAGGAAGGGAGGGGAGAAGAGGGAAAGGGGAGGGAGGGGAGGGGAGGGGAAGGG 
W GGGAGAAAAAAGAT ACATCGTGG CATATCT CTTCCCTTTT AC CTGT CATTATGTGT C AG TGC CTCC CATT GGC CTAAACTACCCAGAAGCCAG AGGG 

fj AAAGCCATTTGGGACTATATTCCACTGTGACACTGAGCAGAGCAAGGGAAGGATGGGGACACTTCTCTCACACATCTCTGACTCCCCAACATCTAAC 
^ CCATGGCCrGGCATGTGATATTCATGCTATAAATATTTGTTGAATCAGTAAATGGCCCCTCATCTAGATTCTGCCAGGCAGCGCAGGGGGCTTTTTG 

AATGTAAATGACCTAATGATTCAGGATCCTATGACCAGCGACTGTCACCCTCTGTCACCTTTCTTCTGCCCCTTTATCTCTGAAGGCATTGAAGGGG 

CCCTGCATGAGTC!AGGGAGGGCTGGGGGGAGGAGCAAGGGTGGGAGGGGCAGGGAA 

TTCCTCCCCAGACAGGGGTAGTGCGAGGCCGGGCACAGCCTTCCTGTGTGGTTTTACCGCCCAGAGAGCGTCATGGACCTGGGTGAGTGAGCCTCTT 

CATGTGAGAAGGAACAGTACCAGGGTCTTGGACACCCAGACTGACCCTGTGGAATGGGGGTGGAGGATGCGGGTGGAGCGCATAGGGGTGCTTCCTG 

GAGAAATCAGTTCTGATTTGGTGGGCATTGGAAGCTACTTCCAGAGAATGCCCCCTGCTGTCCCAGGAGGGGTGCACCTGCCCTTTCAGACCTGGAG 

AAGGGGTCAGCCTTTGGTGCCAGTGGGGACAAGGGAGGGGCCGAGATCATGTCTTTCTAAAACAGCAGGGACATAAGAGAGTGGCGGGGCTGGGCTT 

GGTGGTCTGGCAGAACCTGTTCTCATCATGAGGGTGATAGGTTAGGCCCTCTGGGACCTCTGGGGACAGGTTTCAACAGAGAAATTCTCCCTAGAGA 

GCTTAGGACCTTTTACCCACTCTCTGATTCCCACCTTTCTCCCCTGCCAAATATTCATATACCTGTCCACTTGGGGTCATGAGGAGATGCAAATGAA 

ACGTGACTGGGTTGTGAGAATGGTGCGGTGCATTGCAAATGCAAGCTCTGGTTGCTTTCTCTCTGCTGCACATCCCACCTGCTTTGATCTGGCTTTG 

CTGTTGCCTCTTGCAAGTTAAGGAACGGGCAGGCAACATGGAATGGGATGGCAAGG^ 

GCCCCTGCCCCTGAGCAAACCTCACGCACTCTGCCCCGGCAGATAACTCITACT 

GGAAGCACTGAGGGTGTTCTGAGTGGTTCATCCAGCTCCAGACGCCCCCTTTATAAATCTCAGCTGTGGAGTCAGACCAGGCTGAGGCTAAGAAGAC 
CAGGTTCCTAGATGGCTTGTCTCACCAGCAAACAATGCACCACGGCGGCCTCAAAGCCGCAAAGGGTTGGCTTTAA 

TGTCCTCACACCCCTCCTCGCCACACCACAGCCTCCTTGATTTCATCTCTTACTTTCCCGCCTGCCCTGAACGCTGGATCTGGACCCAGCAAACAGC 
TGTGAAAGGGTTAAAGTGGTCACCCCACCCACTGACTGTGAACTCCTGAAAGGCAGGTTCAGGTGCTTTGAGGATGTGAGGAGAAAATGTCTGCGGA 
ATGAATGAATACACCGTATCAAGAACTCCGTAGAGTTTAGTTTTTGAAAATCTCTTCCAAATGCCT 

CTCCCCTGCACCCCCTCCACCACCCCCCATCACAATCCCACAGTACAGACGTCACCCACAGAAGACCCTGCTCCAGCCTTTCATACTGGCCTCCAGG 
AAGGAAACCAAGCAGTGCCCAGTGGTGTGCTACTGGGCATTTGAAAAGCAGCTCCCCGGGGTGGAGCACTGGTAGCTGACTCCAGGGTGGAGGGAAA 
GTCCTGAATTTGTAGTGCTTGCCAATTTCCATGGTGTAAATTCTCCCACCCTGGCTGATTGATTTCAAGCCATCTACATGACTGAACAGGGGGTTGG 
GAAGATGTGCAGAGTGTCTTACCCCAGACTAGGTTTAGGGGTGCTAGACTCCTAGGCTTTGATGCAGGGAGAGAGGGTAATCTCTGGTCCAGGAAGA 
AGGGAT ATTCTTGCCACTTC ACTCAACTTGAGAGAGCCCCAACGGT CAAACC CGAAATA CTGGTAGGAGGGGAAGCCGCTGAC AAGC TTTACT CAAA 
ACGGGCTCTCTGGGCCGTTGATGGGGTGGGTGGGCCTTGGAGGATCTGTTGTTCTGCTCTTCATCTGCCCAGCAAGGAGCAAGAGGTGGGTGAGGAG 
GGAGGGTTTCTGTTACACAAAATGAAAACTCCCAAGCAGGGCTGAGCTGGCCTTTTTC^^ 

TCG AGC AATGAG ACCGAAGATCTTTCAT AATTCT C AAAAG CAAAG CAAGG AAAAAATG AACCCATGAGAGATGTGTG CCTCACAGCT CGGA CAT AAA 
TGGTGGCAAAGGGGGATGCTGAAGCTAATTCAGGGCCAGTTTCTTGCCTGCCAGAAACTTGGCTGGGGAGGAGAGGAACGCTGCAGCCTGGGGTCGC 
TGTTGGAAAAAATCTCTCCTCTTTGCTTCGGACATTCAGGCAACTCTCATTGTTTGAGGGATTCAGGGATGACCAGTCTGGTGGGGAGAGTGACACA 
TATGTCACAAAGCCACTGCAACAGAATAAGATACAGCATTGACCAGGGCAAGGGCAAGACAAGAAGGGAATAGTGAACCTGGGTCAGAGAGGTATTT 
AAGGTGGGTTTTCAAGGGTGTGTAGGAGTTGTCAAGCCCAACAAGGAGCTGCAATGGGCCCAGGATGTGGCAGGAATGGCAAACTCCATTGCCATCA 
GAGGCCGACAAGGTAATAGAGGGAATGGGCATAAGGTGCAGGAAGTGGAGGGGAGTAGAGTGACCCAGAGGGTGCATGCCCCACCTAAAGGCATTCA 
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AATCTATTATT TCAAAAACT CCATG CTGACAAGC AGAGTGTCCTG GGG ACTAAGGCTGC AGC CGTGTAG ATGGTCAGCTGAGACAACAGTAACAGAG 
A CAAGA AAG AAG CAAGCGGAGGGGAGGGTGGAGT AGCTGG CCATG CCAAGG AGG AAGATGAGGAGG AAGGACG CAGGCACAGGAG ACCGAATCCTTA 
AAACGTGGTACGGCATCCCTAAGGAATTTGAACTTTATCCTGAAAACAATGGGGACCCATCAAAGATTTTTATTTTTCAAGTAGGTGAATTAAAGGA 
TGAGATTTGTG TTTTGG AAGG AGTT CTGTGGCAGCAAAGAGGCAGCAAAGCC CGTGT CAGAACAG AGGTGGGG CCGGCTCTTTTGGGGGCCGGGATG 
TTG GGGGPiTGGAAG CAGGAAGCA CTGGTGGCCAGTGGGGCATGGG GAG C AAAGATTC CC CTCATG CTCCGTGGAGCGTCCTAGATGCAGCAGGCTC A 
TCAGAAACACAGCCTGATGCAGTAGCCAGAGCCCAAACCTTGCTATCCTGTTCAAATCTCAGGTCCCTCTCTTAGCAGCTGTGTGGCCCCAGGCATT 
TGGCTTAATCTCTCTGGGCCTCTATTTTCTGGAATCTAAGATGAGGATCACCAGGGTA^ 

AGTGGCCGCACAGAAGGATGGTTAGGAGTGTGGTTGCTGGAGTTCGAGGGTTTCAAAGCCTGCTCCCCTGCTGGCCATTGTGTGACTCTGAGAAAGT 
TACTTG CTGTGTTTCAG TTTCTCATTGGTAAATGGGG ATAAT AGT AGCGCCTAC CTCATGGGGCTTTTG AGAGGATGAAATGAGAC CATAC ATGCAG 
A CAGTGTGG CACGC AGTGAG AGCTC AGTGAAT ATGAATTATCT CATTGGAT ACAGCAGTTGAT AAC AT AGAAAATG CAAGGTCTACCAGGCTCTCAC 
CCATCCCTGCCTACTGTCTGAAAGTGGATTCCCCAGATCTCTAGACCTCTCACCACCTCCCTGCTTCrCCTCTGCCTCCCTTATCCAAAAAAAAAGA 
GAGAGAGAGAAAAGTTGTGTTGGCAGGTGAGGAAGTTCAAAGAGGAAAAAGCTTGGGGTCGCTCTTGGGGCGAGCTGATTGGGCAACTGTGATTTCC 
TGGGCAGTCTGGAGGACCTGGAAAAATGTGGGGATGGAAGTTCATGGTTTGGAGATGGGGGCAGCCAGCCAATAATTTTCTTTCCTGGATGGGCAGT 
CTCTCATGTTGGGGGCTCCGGTTGGAGTTCTAGGGAATTTTCATGATGATCATTACTGAAAACTCAAGA^ 
ACCACTCCTTAGCCCCAGAGATTCCCAACTTCATTGGTCTC^CTGAAGCTGCT^ 

TCTGACTCAAGGCCACTGACCAGGCTCCCTTTTGCCACATACAAGAGTTCCCCTAAGTTTCCCTGTGATGTTTCTTGAAAGCCACCTGTTTTGGACC 
TACAAGACCCTTGCTCATCCACCTCCTCCAAAACACCTGGAATGAGAAATCACAGGCACACACGCCAGCACTCCCCCAACTCCCCTCCCCTGACGGG 
AACAGCAGGAGTGGGACAGCGTCCGGGAGAAGGGGTTATCAACAGAGTTCAACAGAAACACAGTTCTTGCTCCTCAGATCTCACCCGTTGTGACTTA 
T CAAAACTT TGAGGTTT CTC ATTTCTCC CTGACAGT C AG AGCGGG CTTTCTCTAGAG TAACCGAAAG ACA CTTACGTGTTTTAAAGAAACCTCGAG A 
CTCGACCTCTAGATGAGTCAGTGGAGGGCGGGTGGAGCGTTGAACCGTGAAGAGTGTGGTTGGGCGTAAACGTGGACTTAAACTCAGGAGCTAAGGG 
GTAATTCAGTGAAAAAGGGGAATGAGCGGTGGGGAGCTCTGTTGCAACAGGGTCCAATCGCAGCAGGACTACAAATGCCCGAGCGCAGGCTGGGAAC 
GAG GGG ACAGCGGCTGC CTGTCCCC AGAATAG AAAAT GCAGCT AGGAAGCC CTCTTTGAGTG GAC AGCGGAGG ACTGGACTGCCAGGCCAAGCATCA 
GGGGCTTCATCCTCAGGGCCGGTTAGAGCCCCTGAGGATTTAGGAGGAAGGTGAGGTGGGGACTGCCATCAGAGCTAAGCTTTAGGCAGATGTGTCT 
AGCAGTAGTGTGCAGGACTATTTGCTCAAGGCGGGAGAAATACCTATTTATTTATGTAGCCCGATGCAGTCTCTCCTTCCCGGAGGAACTCCCAAGC 
CCAGAATCCCTGGCTTTACACAATTTCCGGGGAACACATTAACCAACTGCAACCTGGGTGCCATGCAGGGAGGCTTCCAGCAGGTGGTGCTGTGGCC 
AAGGACACGGAAGCGCTGGGCTCTCACTGCTTCCCC^GCCTCCAGGAGAAGGTGCCTTAAACAGGTTCCCACGCATTTCCTGGCGCTATTGAGCTTG 
GAGCTGCCAAGGGCCTGCCTTCACTTGTGGCATCGCAGTTACTGACTCTCCAGTGGGCCAGGCCCTACCTAGCTGGGACCTGAGGGTCAGGATACGG 
GAAGAGGGCTACTGCCGCCCTGACTTGTAGGTAAGCTAAAACATCTAGGCAAATT^ 

AGTTCCTGCTGGGCCCCAGGCTCAGCCACAAAACCAGACTGAGCACATCTGGGCCTTGTGCTACACCGGGGTCCCCATCAGCTGCTCAGACCCCAGG 
AAGCCTCCTGACTTCCAGAACACGTCGCTTCCCAAAGCTGACCCTGTTCCCCCTTTCACCTGCCCCCCAAGCCTAAGTTAGAAAACAAGTTAAAGGA 
GAGAAAGTGTGTGCAGGCACTTCAGTTAAGCAAAAAAATAAAGAAGTCGTGACATCAGGAAAGAGCGTCCTTCCCCCACACTGCAACAGGACAGCAT 
CCCTTCTGTGTCCTCTCGGGCCATCTTGATGCCCTCTCCCTTGGAAAAAGGACAGGCATTCGTTTACTTGTCATTCATGTATTTATTCATCATTCAT 
GTATTTATCATTCATATATTCATCAGCACACAATTGTGGGGCACCTCCTC^TTCTGGGCCTGCACAAGATAAAAGGGAAGGGATCTCTGTCCCAAA 
GGCAGGCCAAGGGTGTGCGCCCAGTGGACCAACAGTCAGTGCTGAATGAATGTCATTTCAGTGAGGGGGTGTTGTGGAGCTGGTGGAGCCCTGAGGC 
AGTCTCCATCGCTGTGGACCAGTAAGGAACCTCATCCCTCTACCCAGCCTGTGAGGTCATCCTCCTTCCTCAGCCCACCTCCCTCAGGAGACACTCA 
GGCTTGGCTTCATGCACGTGTGGCTGCGCACTCACCTGGATGCCAGGAAAATGCTCCTGGGCCAGTGGCTACCAGCACCAAAGCAGGGCTGGAAACG 
TGTGCACCTACCCCACGACCTCATAGCTTTGCCGCCTCATCCCACATCAGGTGGAGTGGGAGGGGGCACTCAGACCCACCAACGAAGCTCAGTCTTG 
AGGGGTGGTCTGGGCTAGAGAAGACTTCCCAGCTCCAAGAAGCAGCCCAGATGGGCCAAGACAGGGGGTGGGAGGGGCTGCCCTGCATTCTGATGCr 
GCTTCTTTCCAGGACCTGGGAGTGGAGGAGGACAAGAAGGAGGTGAGGACAGTGATGCCGGGCAGCAGGCCAGGTGTTCAAAGGCACAATCTGGTTC 
TGATGTTCTCTTTCAGCAAACACAGTGCCTGGAGCTTGGGAGGAAAGTTCCCAACAGCGTCTCCCCCTCCACTGCTTTCTTTAATAACAAAGACTTG 
TCCCTGCCAAGCAATAACTTTCTCGCCTTGTCTCCTACAGGGAAACCAATGAAAAGCGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTGAGGT 
TCTCTGCCAAGGAAAGCTCTGTTCCCTTCTCCACACGGCCTCCCAAGTCAGGGGTTTAGTGGAGGAGCTGGGCATTTGCTAGAAGATACAAGACCCA 
GACATAAGCCCGCCTGCCACTCATTAAGCTAATGGCTGCCCCCTCACCATCTGCTGAGTGACCCTGGAGAGCCACAGATGTTCTCTGGACTTCCTTT 
ACTCCATTTAGGTTAAAACATGGGGGAAAATAACTTACCACCTTCCCTATCCCACCTCCAATCTCCCTGCCCAGGAGACAGAAATATGGACTCTGAG 
TTCTTAGAGAGAGAAGAGCTGCAAAAAGCAAAAAATGTTACTGGGTTTTGTGCACGTGTGCTGAGAAAATCACCCTGGATCTGGCGATCTTGTTACT 
ATCCTATCGCCCCAAAGTTGTCTCTTTCCAACCTTCACCCAAAGGCTTCCCTCGCAGCCCCCGTTTTCTTTGGTGAAATAGAGACCCATCCGGCCTC 
CCTGGCAGGGGAGGGAAGAGAGGATGGGGCCCTAGGGATGGGGAGAGGGGTGGATGAGCACACACTTTATAGCAGGCCTGAACTCGGCAGGAACAAA 
GACCCTCACTTTACAAAGTTCTGTTTCTCCATCCAGAAACAGACAGAAAGTG 

TCAA CTTTCTC C CA TTTGTT AGAGGTGGGAGCTC AGAGTAGGGGT AGGGGAGTC CAACTGAGGCCAGAGCT AGAGAGGC AGTGCT TCAGAATT CGG C 
TCATCCCCAGGTCTGGAGGCCAGAGGGGCAGCTTTCCAGAAGCTCAGGCCAGAGGCAGGCCCCAAGCTGGGCACTGTTCCCAAAACCCCATCCCAGA 
GGCCCCCCGGAGCCTGGGCTGCAGCCCATAGCCTGGCCCCCTTCCCCATCACCACTCCCTCCTCTCTCTCCCCTGTCTCCAGGACTCTGGGGGGCTG 
TATTGCTGGCAGTGGAGGCTGGAAGGGTGGGAAGGGGAGGAGGACAGCTGAGGCTCATTCTTCCCTTTCACTTGGGGAGGGGGTGCTCAGATAGGGC 
AGGATGTGGTGGGGCTGTGGGCTGCAGAGGCAGTGAGGCTAGGGGAGGGAGAATTATAAAACAGGAACAAGGTTCAAGGGGAGAAGGGAAGGCAGGA 
CTGGATGATGCAGAGTGCACTTCATAGAGGCCGGGAGTACTGCCCTCCAGGAAGGCTGAGACCCCCTCATGTCCAGGGACCCCGGGTCTAAGGCCTT 
GGGCCGTAACCATGGCTGGGCCAAAGGCAGCAGCCTCCTTCAGGGCTCCCAGGCTGCAGCCATGCCCAG 

TTCGTCTCAGGCTAGCAATGGCCAAAGAAAAGGCTCCTGGACATCATTTTGTCCTCACCATGTTCCCAAGCAGGACAGTGATGAATCCA 
GTGTGAAAAAGAGATAGAGACAAGATGCGGTGAATCCCTCC^GCATAAATGGATAGAAACCCACCGTCGCTGAGCTGCTGAATCrGGTTCCTGGTGA 
TAGAACCATGAGCCTCGTGAGGCAGGTCCTCCAGCCCAGCTCCCTGCCCTGCACAAACCCGTCCATCGCTGTGCGCAGAGGCAGGGGCAGGCATGGA 
CCCCCTCCTTCATTGATAAAATCACAGCAGGCTGCTGAACCTTCACCCGACCCT^ 

GGGATCTGCCGGAAGAGATGCGAGG AAG AGAGAGGCAGAGAGGTGGGGGGAGAGGGCGGGAGCAC AGAGGGAG AGTGGG AGATGAG AG CT GCG CTT C 

CTTCCAGGGACACAGCTGCTGAGGAGTCCTCGCCCCCACCCCCATGGCCCATCATCCATTCACGGAGTTTATGGCACTTGGGCTCACGTTGCCTGAG 

CTCCTGCTGGGTGCTAAGAGGGGACCCAGGATCTAGAGTCTCTCCAGGTCACCCACCACCCATGATCACAGCAGTTTGCCCCAAACCCAAGGCTTCA 

TGGCCTCAAAATGGGAGCATTTGCCTTCCTCTCTTCTTACTCCAGGCCCTCCTCAACCTTGACCCCCAGGTGGATTTTCCCCCTAAAGCTGAGGTCT 

TTTGTAAGCCCCCTGGAAAACTCCACCAAGTTCTAAAGGAGGGTCAAGAAGAAAGACACATGTTCCCTCTCTCACGGAGCTTCCAGGGGGCAAAA 

AAAGCTCCATGGAGAGGAAATGGTGCTGGAGCCTGGGCTAGGTACCTCACAGCAACCATGGGAAGCAAATAGGATTCATCTCCAT 

AAA ACTG AGGCTCT GAG AGAG AG AG AGAG AG ACAGAA AGAGTTTA ACC AGCTCAAGG CT CAT CTGGCCCCAGAGCCCAGCTGATTTCC AGTCCCCTG 

TCAGCACATGTAGAGTCTCTCAGACTTGGCAAAATCAGCCACTTGCTACTTACTAGCAGAACTGTAGAATCTCAGGCCCCACCTTAGCCTACTGATT 

CAGGATAGACATGTTCACAGGATTAATCAGGTGATGCGTATTGGAGTTAAAATTTGAGAAGCACTGATTTAAAACCAAAGAATCACCTGGAGACTTT 

AGGAAAACACAGATTCCTAGACCCATCACTTGAGATTCTGGTTCAGAATTCTCAGGTGGGGCCTGAGAATTTGCATTCCTAACAAGTTCCCAGGTGA 

TGCTATGATACTGATGATGCGGACCTCACGATGAAACCACTATTCCTCC<^CCTGGGCAACATGGCAGAATCCCATCTCTACTAAAAATACAAAAAT 

TCGCTGGGTGTGGTGGCATAAGGCTGTGGTCCCAGCTACTCAGGAGGCTGAAGTGGAAGGATCACCTGAGCCTGGAGAGGCCGAGGCTGCAGGGAGC 
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CATGATTGCACCACTGCACTCCAGCCTGGGCAACAGAGTGAGACCATGTCTCAAGAAAAAAAAAAAAGAAA 

AGCC AGAGT TGGAGC CACCC AGCTAAACTGGCCTGTTTTCCCT CATTT CCTTCC CCGAAGGT ATG CCTGTGTC AAGATGAGGTCACGG ACGATTACA 

TCGGAGACAACACCACAGTGGACTACACTTTGTTCGAGTCTTTGTGCTCCAAGAAGGACGTGCGGAACTTTAAAGCCTGGTTCCTCCCTATCA 

CTCCATCATTTGTTTCGTGGGCCTA CTG GGCAATGGGCTGGT CGTGTTG ACCTATAT CT ATTT CAAGAGGCTC AAG ACC ATG ACCGAT AC CTACCTG 

CTCAACCTGGCGGTGGCAGACATCCTCTTCCTCCTGACCCTTCCCTTCTGGGCCTACAGCGCGGCCAAGTCCTGGGTCTTCGGTGTCCACTTTTGCA 

AGCTCATCTTTGCCATCTACAAGATGAGCTTCTTCAGTGGCATGCTCCTACTTCTTTGCATCAGCATrGACCGCTACGTGGCCATCGTCCAGGCTGT 

CTCAGCTCACCGCCACCGTGCCCGCGTCCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATACTAGCCACAGTGCTCTCCATCCCAGAGCTC 

CTGTACAGTGACCTCCAGAGGAGCAGCAGTGAGCAAGCGATGCGATGCTCTCTCATCACAGAGCATGTGGAG^ 

AGATGGTGATCGGCTTTCTGGTCCCCCTGCTGGCCATGAGCTTCTGTTACCTTGTC^ 

CAAGGCCATCAAGGTGATCATCGCTGTGGTCGTGGTCTTCATAGTCTTCC^^ 

AACATCACCAGTAGCACCTGTGAGCTCAGTAAGCAACTCAACATCGCCTACGACGTCACCT^ 

TCTTGTACGCCTTCATCGGCGTCAAGTTCCGCAACGA 

TTCCTGTCGGCACATCCGGCGCTCCTCCATGAGTGTGGAGGCCGAGACCACCACCACCTTCTCCCCATAGGCGACTCTTCTGCCTGGACTAGAGGGA 
CCTCTCCCAGGGTCCCTGGGGTGGGGATAGGGAGCAGATGCAATGACTCAGGACATCCCCCCGCCAAAAGCTGCTCAGGGAAAAGCAGCTCT 
CAGAGTGCAAG CC CCTG CTC C AGAAGAT AGCTTC ACC CCAATCCC AGCT ACCTC AACC^^TGCCAAAAAAAGACAGGGCTGArAAGCTAACAC CAGA 
C AG ACAACACTGGG AAACAGAGG CTATTGTCCCCTAAACCAAAAACTGAAAGTGAAAGT CCAG AAACTGTT CCC AC CTGCTGGAGTGAAGGGGC CAA 
GGAGGGTGAGTGCAAGGGGCGTGGGAGTGGCCTGAAGAGTCCTCTGAATGAACCTTCTGGCCTCCCACAGACTCAAATGCTCAGACCAGCTCTTCCG 
AAAACCAGGCCTTATCTCCAAGACCAGAGATAGTGGGGAGACTTCTTGGCTTGGTGAGGAAAAGCGG^ 

CCCTCCCTCCATCGTTTTCTTCACTGTCCTCCAAGCCAGCGGGAATGGCAGCTGCCACGCCGCCCTAAAAGCACACTCATCCCCTCACTTGCCGCGT 
CGCCCTCCCAGGCTCTCAACAGGGGAGAGTGTGGTGTTTCCTGCAG 

GGGGATGACATGCACTCAGCTCTTGGCTCCACTGGGATGGGAGGAGAGGACAAGGGAAATGTCAGGGGCGGGGAGGGTGACAGTGGCCGCCCAAGGC 
CCACGAGCTTGTTCTTTGTTCTTTGTCACAGGGACTGAAAACCTCTCCTCATGTTCTGCTTTCGATTCGTTAAGAGAGCAACATTTTACCCACACAC 
AGATAAAGTTTTCCCTTGAGGAAACAACAGCTTTAAAAGAAAAAGAAAAAAAAAGTCTTTGGTAAATGGCAAATGAGCGTGTCTTTGTTTTGCAATC 
CCCCACACCCCACCTGACCGTCCCCTCTCCCCTCCTCCCTGCCCTCTCCCAGGCCTCGTTTCTTAGAGACGCTCAGAACCACAAAGTTGCTCAGGAA 
TGCTCTGAGCCCGTCTGGTGACACAGGACCAAGAGGCTGAAAGGTAGAAGGAAGATGAGGACAGGAGGTCCATCTTGCAAATACCCTTCCAGAGTTC 
AAGGGATAGGCCCCAAGCCTGCACAGTCAGCACTGAACCGGACATGCTGATCCTACTGCAGCTTTGGGAAGGCCAACTTTTCCGGGTAACCTCGATA 
CCTGGGGACTCTGCATTTAGCAGCTCTTGCACAGTAGCCTGCTCACCTGTCCCAGCTCAGACCAGAACTAGAGCCCTTAACCTTTTGTCAGCCCCTr 
AGACTCTGAAAGCTGGGGTTTGGAGGGAGGGCCTTTCTGTGGGCTTGGAGACCACAGCATAC^ 

ATGAGAGCTGGAACCACCCATGATCCTCTCTGCTAGCCACTGTAGCAGCAGTGGCCTCCGTGGTGTATGTCATAGTCCCTTCCTGCTGGTACAACAA 
AATGCC AC CG ATGGAGTAATTTAC AAACCATAGAAATGTAT TTCT CACAGTT TTAG AGGATT GGAAGTC CAAG ATCAAGACGC CAACAGATTCAGTG 
TCTGGTGTGGACTCTCTCTGCTTCCGAGATGGGGCCTTGCCATGCCCTCTGGAGGGGACAAACACTGTGTCCTCATGTGGTAGAAGAGTGGAAGATC 
ACAATGAGCCTAACTAGTTCCCTCCTGCTCTTTTATTGCTATAAGGGCATTAATCCCGTTCACAAGGGTGGAGCCTTCATGACCTAATCAACTCCCA 
AAGGCCCCCCCCAACTTCTCATACCATTGCCTTGGGGTTTATGTTCCAACCATATGAATGTCGGAAAGATACATACATTCAAACCATAGTGTGGATA 
CTGACTGTGGAGGAGCAGGGAGAGAGGGACAGACAAAGGCCCCCACACAGGCCAGGTGTCCGTGTCATTTCGTGCCTTGGTGCCACCCAGAGCCGTG 
ACTTCTGCAAGCCAGAGAGCAGGTCAGTCCAACCTACAATGCGTCAAGAGGGGT 

AGAGGG AGGGCTGAGGC ACTCAC GCAAT AGGTGATTC CATCC ATT T CC CCAAGTGGAAAGAGCCT CGTTGAAAATGGAGC TG CCTGGT AATGACAC C 
CATTCGCTCATCTCGCAAGCACGTGACATTACCCTCCTCCGAGAGATGCACTGGGCTGGAAGGTTTTTAACAAGATTTGTCAGCTGGTCTCCAAGCC 
GGGAATTGGCGCCCTGAGAGGAATGAGCCTGGCACGGATGCAGTCAGCGTGGAGCGGCGTTTGCA 

CGACTGTGTGCGTGAGCGTGTGGCCAGGGGGCAATGCCAGTGTGCGTGTGTGTCCCCGAGGGGAACCTGCCGGAGCGCATGTGCGTGTGGAGCCCGC 
TGGTGT GGACAT CCTAATGAGGT TAC AGATGGGAAAACAACCCTCGGTTCC TGAAGGGG AAGAATGTAACAG AGCAGTCATGCAAAAGG CAG CCAGG 
CCAGCCGCACAAATGCGCCCCCTCAGCTGCAGATACATTTTTCT^ 

GTCAATTCCCTCGAACTGCTGTGTCTCAGAAGTTCTGACCTCGGTTCAGTGCTGGCCTGATCCCTGCTCCCTTGCCCGAGGCTCTCATGTGCCCATC 
TTGAGCTTGGGGTGACCCCTGGGAAGAAGGAAAAGGCCCCCTGGGAGGCCCACCCAGGGCTTCTCTCTTCTTTTCTTACCTGCTTCATACATCTGAA 
CCCTCTTCCCAGGTTCACAGCCAATAAGCCTTCTCC^GTCACCTGAAATTTCCTTTTCATTTCATCCCCAGGCCCTC^TCTCCCTCATCCTTGCTGG 
CTGCAACCCCAAGCCAAGCCCAAGGTATTCCCAGGCCACAGGTCTTCCCTTTCAGCTAGAAGCACCACACCTTTCTCATCACAGACTCACAGAGCTC 
CAAGGGACCTCAAAGATCAGCTGGGCCAATCCCCCAGTCATTTTCTAGATGGTGAAACTGAGGCCCAGCGGGGAGAAGCTGACTTGCCTGAAGAGGC 
GTGGCAATG CAGGG ATATTGGTTAAA CTTGCTTTGGAAGCCTGAGCGT C TGG CAGGGTTGCAGGAGGGTGTGGGGG AGAGCAG AG ACTCTGGAG CT C 
TGCCGACTTTGAATCTCATCTCTACTGCTTCCCAGCTATGTGACCTTCAGCTGTTGCCCAACTGGTCTGAGTCTTGGGTTTCTCAACTGCAAAATGG 
AGAGGATTTGTGTAAACAGTTAGCACAGTGCCTGGAATGTTGAGTGTGGTGGTGAGGAAGAGGGACTTTGCAGCCAGACTGCCTGGTTTGAACCCCG 
GCTCCACTGCCTGGGCTGGTTACTTAGCCTCCCTGTACCTCAGTTTCTTAATCTGTATAATGGGGATACTAATGGTACCTATGTCCTAGGATTGTTG 
TG^AAGGCAACGAATCCATATATGTAAAATGCTTAGAAAAACTTACATATGTGTTGTTGGTAATCTTTAGGAGATGTTTAAGAAACGGTAACTTAAA 
AAAAAAAGTTGCTAACCACAAGGAAACCAAATTCAATTCCTTAGCCCATTTCCATCCCCACAGAATTTATAA 

TGAGTG AATCTGGC CTTTCC ATCCATTGCATGGTGAGTCAACTCG TGC TCTG AGAGGGAAGCAACTAGCCTGAGGT CTCTTGGG AGGTCTT TGC AAT 
GCTGGAA CTTTAACCTAGTTCCCTGGGCTCCTAACC^GAGCTCTTCCCACCAGACCCTGCTGGGCTCACGGGAACATCCCTGCACATCCACaSTGGC 
ACCCGCCCTCATCCCACGCTGTCTCGTTCTCTCCGGCAGGCACCGTTCATCTTGTCCTTTGATTAGGTCCTTAAGGAGAACCTCCCTTCTCCCCTTT 
AAATAGTTCCTGTCCTTATCACTGAGTCTGGAGAATGAATGAAATATAAAAGAAATATGGCAGCGGTCCCAAAGGAAATCAGGCAACTGAT 
AAGAAA GGAAGATGGGC CGTCTT CTCTGGGAATGGTT CTCAGTTAC ATGTCT TTGTTGAGGGTGAACAGTTGG AGG AAAAGAAAC AGACCT AAAGGG 
ATTGTGGGGGGAATTAAGGCAAAGGAGATGGGGGAAAGGCAGGGGGAAGATGGGGCAGGAATGGGTTTTAGGGAATATACACTGGGGGCGGGACT^ 
AAGCAAAT CTT CAGTGGGT AGTTAG ATGTC CCATCCTGCATGCACACACATGCATGTGCACACACACAC ACACACA CA CACAC ACACAC ACAC ACAC 
ACATAGAGCCTCCCCAGGTCTCCCAATCTCAGGTGAGGAGAAGGGGGCTCTAT^ 
ATGGTTAGAATCGGTCATGTGTCTCTGGTCTGAGACCCTAGAGAAACTCCAAGGA 

G AGCAA CCC TAT CAGGCTTC AGA AAAGC AAGCCC TCCT AGCACTCCAACCCACGGGAGCAGCTGTCTGAGAAGTCCCAACTC CTC CAGCAGCCACC C 
CCTCTG TTCCTTCACTTCTCTTGGGCCCAGGCAGAGACACAGACAGGAGTCTGTGTGCAGAAAAAGCTCCCCAGGCCCTTTTCTGTGGAGAG^ 
CCAGACTAAGCGAATCTCAGCTGATCTCGTGCGAAATCAGAAACGCCAGGATGTGTATTCCCCACGGGCTCCTCTCTTTGTGTCTGTGCATGACCCA 
ATTCAGACTCCTCTGCCGCTGTCAGATTCTCTCCTGCCCATATTCACTTTC 

GACTTCGOTAGCTGGGCTGTCCTACGTGGTACAGTAACCACCACATAGTGTGGCCTC<^GGTTCTCATCTCTGGGTAAATAAAO^TCTCCGCACAGG 
CTTGGCCAGCTCTGGACTGGCTCAGTTCCTGCATGATGCAATAGGTCTCCAGGTCACATATCCGTGCCCACTAGCAAGAGGGAAGGTAAGGAGATGG 
TTCTTGCCCTGAAGTCCTCTTGTTCCTGTGTCAGAGTTTATTTAGGACAGAATAGT^^ 

ACAGCAGATGCAGCAGATGAGATGCCCCCAAACCATTTCCCAATGTTTAAAAAGAGGCCTGGAAAGGACAAGCTCATTGATCTCATTTGGTTTATGA 
AACTGCACTCAAGTGACTGAAGCAGAAACCTGACCACGTGTAAAGACTGAATGGCTCAATCTGCTTCCCAGGCCCAGAGGGTGTTTTCTGTGGCCTG 
GCCCAGGCCTATGGTGAAGCCGGTGCTAGGGGGTCACCTATGTGGGAGCCAGCTCTGCCTCCTGCCCCGAAGCTCCAGAGACTTCTCCTACAACTGA 
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CTCTCTGGACACGCCCTCTTGCTTGCACTAGTATCTCTTACCTGCCACCTGCCCTCCTCCACACACCCCGATTTCAGGGAAGCTCAGCTGCTGAGAG 
TGCTATTGACAACCTCTGGACTTGGAAATCAGGAGGAAGGGAATGGGAA 

CCTTTGTTCTC CAGGGC AAG ACACAGGG CTAGACACG CCGGAATCCTTAAAT CC TTTGT CAAAAGC AG AACTGGGATTTTCGTCC AAG GACT AC ACC 
TGTCCTGTCCTCAGCCTCCTTCTCAGTTGCCCCCCTCCCACCCATGCTCATCAGAGCCGAACCCrGAGGAAACCATGGACCGAAAGACCTTGTGAGC 
TTTTGTTGCTGGGAGTGTCAGAGGCTGGGAGACAGAAGGTATGAGAGATGAGATGCCATGGAAGGGATGGCAGCGATCACAGAGGCTGGGGGGCAGA 
GGAAAGAGACGTTGAAGAGGTGGAGGGGTAATGAACCTAGGCGGGAGACAGGCTAACATTTACTAAGTGCCCGTGATGCACCAAGGACTAGGCTAGC 
TATACAACCTTTCATTGAATTCTTATGACAACAGAGAAGGAAACAGGTTTAGAGAGCTCATGTA^ 

TAAGATTTTGTTACGTACTATACACTGTATTAAGCATCAGTATCTTATCTCATTTAATCCCCACAAC^CTCTACTAGAGTCTCATTTAATAAATGA 

GGAAACTTAGGCTCAGAGAGGTTAAGTAACTTTATCAATGTCACACAGCTACTAAGTGATTTCAACCAGTTCTAGCTGCCTCCAAATTTCTTGCTTT 

AGTCACCCTACTTCACCGTTCAACTAAACAGCTGTTTAATACCAAATATTTCTGGAGCCCCTACTGAGTGCCAGGCACTGTG 

ATCTTTCAAACCCTGCCAGACACAGAGCTCTGACCTGGGAGAAGGGAGAAT 

ACAGCTGGGCAATAACCAGGTCTTTGCTGCTCCACAGTACCTTGTCC^ 

TAGCCCTGCAG ATGACTGAGAGGGCTGAGATGGCTTTGCC AC AGACTAGTCACTGGGCAATATGGGCAGGT CACCT C CCTTTT CTGGGCC TCCAGCT 
CCCCAGCTGTACAATGAGGTTGCTGGACCAAATCATTTCTACCCATCGCCTCCAGCTCTGACTTGCTATGGGTGCCTTGTOTGGGCAGAGTCTGGGA 
GGC C AGTGC TC AGCGGGGTAGGGCTCTGGGCATGTTG ATGCTT GC ACG AGGTGC AGGGCAAGTGTGTG AGGGGCTGAGGAAGCTG CTC CA CCCCCAG 
CTGTGGTTTCTGGTGGGGCCACCATGGCAGCAGGTGGAGGGAAGTGTTTGTCACCCTTCCAACAAAGGCAGTCAAACAAAAGCCTCCCAGGGCACCA 
TGGGAATAGGCGGCAAGGGCGCOU^AATAAAACACACTTCTCCTTTGATAGGCATCGGTGGCAGGCATAGGCAGCTGCTAAGGGAGTCTGTCCTTG 
GGGTGGTGAGGAAAGGGTTCAGATTTCGTGCCCAAGAACACTCACACTCACAAGAGCTGTCCCACCTC 

TATTCACAC AAGAAACACCACCGCACACATCACACAC ACACA CAC CAACAC AGC AGT C ACAC AAT C AG C AT ACACATAT ACACATACAAATAG AAAA 
CTTGCCCCCATACAAGACCAGCTTCATGGGCCGCGACCTGCACAGTC^CTCTGGGCCCTGCACTAAGGCCCTGCACTAAGGCCCATGCTGGGTATAA 
TGCTTTGCTGCCTGCACTTTGAAATTCTTAATTATTTTTAAACAAGGGGCGCAGCATTTACATTTTGCAATGGGCCCTACAGGTTCTGCAGTTGCTC 
CTGTGGGTGTGCACTCATGAGAGCGTTCACTACCTGCTTAAGCATACCTTTTAACACTTTTACACTGTTGATGGGAGTGCAAATTAGTTCAACCATT 
GTGGAAGACAGTGTGGTGATTCCTCAAGGATCTAGAACCAGAAATACCATTTAACCCAGCAATCTCATTACTGGGTATATACCCAAAGGATTACAAG 
TCATTCTACTATAAAGACATATCCACACGTATATTTATTGCAGCACTATTTACAATAGTAAAGACTTGGAA 

ACTGGATAAAG AAAATGTGG CACAT ATACACC^TGGAAT ACT ATGCAGCC ACAAAAAGGAATGAGAT CGT GTCCTT TAC AAGGAC AT GGATGAAGCT 
GGAAAC CAT CA TTCTCAGCA AAATAACAC AG G AAC AG AAAACC AAACA CCAC ATGTT CTC ACT CAT AAG TGGG AGT TGAACAATG AGAAC ACATGG A 
C ACAGGG AGGG GAAC AT C AC ACACTGGGG C CTGT C AG CGGGT AGT AGGGGGC AAGGG AAGGG AGAGC ATT AGG ACAAATACCTAATG CACGAGGGAC 
TTAAAACTTAGATGATAGGTTGATAGGTAC^GCAAACCACCATGACACATGTATACCTATGTTACAAACCTGCACATTCCGCACATGTATCCTGGAC 
CTTAAAGTAAAATTAAAAAAAAAAAAATCCTACCTATATAGACAAGAGTGCATCCGCTACACACATACAC^CAGATGCACAAACACATGCATGCOIC 
AAATGCAACACCCACACAAACAGCAATAGCCCCCCATGTTCACGCGCCAAAGCACACACATTGATACCCAGAAGGGGTCCAGACGCTCACAAGCAAC 
GCTACAACATGCAGCCTATGCCCTGCAGCCTGCTGACACAATTCAATTGACTTA^ 

AGAAAACAGGTTCATGTCCTCCACCTGCAAACATCTCCCACCTCTCTCTTCGGCCCTGGGCTCACCCACCCCACTCAGCGGCTGGGACCAGAAGAAA 
GGGAGAGCCAGGGAGGTGGGCGCATGGACGGAAAGGGGACAGGGATGGAGGTGGAGAGGGAGAGAGAGAGGGCAAGAAAGAACGGGTGGGGAGTGGG 
GGCAGGAACTGAGTCCCCAGGTGAGCTGGCCCCAGCAGTGGTCTCATAAGAGCTCAGAAGGTCTTGGACCCAAAGCAGTAAGAACAATGCCCCTGTG 
CTCCCCGCAAGCACAGGGCGGTCCCGCCCTCAGAGTCAGTCAGAGGATGCCCACCTCCCTTCTTACTAGCTACACAGCCTTGGGCACGCCGTGTAAC 
TCCGGGCCTCAGCTTCCTCCTTTGTGGAATGGGATAGCCAATTCTACATTGCAGGGTGGCAAGGCACACACAGACACAACACCTCCTGGATTCCTGC 
AGCAATCAACAGCTCGTCTCGTGATCAGGCCTGCAATCTGATTCTTGGCACAGGAACTCCAGACCCCCTCATTACAGCAGACATTTGGAACGCGTTT 
ATTTCCCATCTTCTCCTGGTCTCACTTCACACCTCACAGCCAGGTACAAGCACAGAGGTGGATGTAATTCATGGCCTCATGGCCACCATCACTTCAG 
GGACCTTCAGGATCCTGCAGCCCCATCCTGCCCCACCCCCTACCCAAACCAAGCAGGACTCCAAGGCCTGGAAATTTTTCTTCCTCCTCTATTCTTT 
TTTTTTTTTTTTTTTTTTTTTTTGAGACGGAGTTTCACTCTTGTTGCCCAGGCTGGGCAATGGCGCGATCTCGGCTCACCACAACCTCCGCCTCCAG 
GGTTCATGCCATTATCCTGGCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACCCACCACCATGCCCGGCTAATTTTTTGTATTTTAAGTAGAGACA 
GGGTTTCACCGCGTTAGCCAGGATGGTCTCGATCTCCTGACCTCATGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCAC 
CGCGCCTGGCCTCCTCCTCTATTCTTTACCCCCAACCTGTATTCTCCACCCCCAACCCTAACCTTCCTGATTTATTTATTTATTTATTTATTTATTT 
ATTTATTTATTTAGAGA CAGAGTCTTGC TCTGTCGCTAAGGCTGGAGTGCACTGG CT CA CTGC AAC CT C TGCCTCC CAGGTTCAAGTG AC TCTCCTG 

AGTCGCCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCACTGCAAGCTCCGCCACCCGGGTTCACGCCATTCTCCTGCCTCAGGCGCCCGAGTAGC 
TGGGACTACAGGCGCCCGCCAATATGCCCAGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACTGTGTTAGCCAGGCTGGTCTCGAACTCCT 
GACCTCTAGTGATCCGCTCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATCCCTATTTAAAATTGTCCCAGCCTCTGTCCCTTCCCTGAAAAC 
CCCATCATTCTGTTAGGTGATACCAGATTCTTAAAGAGTGATTCGGGAAAAACGAATGGGCCCTAAGGGGCATGCCAGTGTGCCCCATTCCAATTAC 
AGTCACCCACAGGAGAGAGGAAAGGTCGTCCAAGTGGCCACCAACAAGAAACTATTTCATCAGATTTCTTTTTTGTTTGTCTGTTTGAGACGGAGTC 
TCACCCCGTCACCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGCCTCCCGGATTCAAGTGATTCTCTTGCCTCAGCCTCCCA 
AGTAGCTGGGACTACAGGCACGCACCACCACATCCAGCTAATTTTTGTATTTTTAGTAGATACAGGGTTTCACCATGTTGCCCAGGCTGCTGTCAAA 
CTCCTGACCTCATGATCCGCCCACCTCAGCCTCCCAAGGTGCTGAGATTACAGGCGTGAGCCACTGCGCCCGGCCTATTTCTTCAGATTTCTTATAG 
GACATAAGTCACTC AGATC CCCTCAC AGGCTTT CTGT CAACTTAC TGAGAGGG AAACTG AGTC AAGG AAG AGTG AAGTG AC TTGGGTG AGCAGCAG A 
GTTGAGGGTTGTGGTCTTTCTGATGTGCAGTCTCCATGAGACCAGATC 

HUMAN SEQUENCE - mRNA 

GTGAGACAGGGGTAGTGCGAGGCCGGGCACAGCCTTCCTGTGTGGTTTTACCGCCCAGAGAGCGTCATGGACCTGGGGAAACCAATGAAAAGCGTGC 

TGGTGGTGGCTCTCCTTGTCATTTTCCAGGTATGCCTGTGTCAAGATGAGGTCACGGACGATTACATCGGAGACAACACCACAGTGGACTACACT^ 

GTTCGAGTCTTTGTGCTCCAAGAAGGACGTGCGGAACTTTAAAGCCTGGTTCCTCCCTATCATGTACTCC^TCATTTGTTTCGTGGGCCTACT 

AATGGGCTGGTCGTGTTGACCTATATCTATTTCAAGAGGCTCAAGACCATGACCGATACCTACCTGCTCAACCTGGCGGTGGCAGACATCCTCTTCC 

TCCTGACCCTTCCCTTCTGGGCCTACAGCGCGGCCAAGTCCTGGGTCTTCGGTGTCCACTTTTGCAAGCTCATCTTTGCCATCTACAAGATGAGCTT 

CTTCAGTGGCATGCTCCTACTTCTTTGCATCAGCATTGACCGCTACGTGGCCATCGTCCAGGCTGTCTCAGCTCACCGCCACCGTGCCCGCGTCCTT 

CTCATCAGCAAGCTGTCCTGTGTGGGC^TCTGGATACTAGCCACAGTGCrCTCC^TCCCAGAGCT 

AGC AAGCGATG CGATGCTCT CTC ATCACAGAGCATGTGGAGGCCTTTATCACCA TCCAGGTGGCCCAG ATGGTGAT CGG CTTTCTGGT CC CCCTGCT 
GGC CATGAGCTTCTGTTACCTTGTCATCATCCGCACCCTGCTCCAGGCACGC^CTTTGAGCGCAACAAGGCCATCAAGGTGATCATCGCTGTGGTC 
GTGGTCTTCATAGTCTTCC&GCTGCCCTACAATGGGGTGGTCCTGGCCCAGACGGTGGCCAACT^ 

AGCAACTCAACATCGCCTACGACGTCACCTACAGCCTGGCCTGCGTCCGCTGCTGCGTCAACCCTTTCTTGTACGCCTTCATCGGCGTCAAGTTCCG 
CAACGATCTCTTC^GCTCTTCAAGGACCTGGGCTGCCTCAGCCAGGAGCAGCTCCGGCAGTGGTCTTCCTGTCGGCACATCCGGCGCTCCTCCATG 
AGTGTGGAGGCCGAGACCACCACCACCTTCTCCCCATAGGCGACTCTTCTGCCTGGACTAGAGGGACCTCTCCCAGGGTCCCTGGGGTGGGGATAGG 
GAGCAGATGCAATGACTCAGGACATCCCCCCGCCAAAAGCTGCTCAGGGAAAAGCAGCTCTCCCCTCaGAGTGCAAGCCCTGCTCCAGAAGTTAGCT 
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TCACCCCAATC C CAGCTACCT CAACC AATGC CGAAAAAGACAGGG CTGATAAGCTAACA CCAGACAGACAACACTGGGAAACAGA GGC TATTGTCC C 
CTAAACCAAAAACTGAAAGTGAAAGTCCAGAAACTGTTCCCACCTGCTGGAGTGAAGGGGCCAAGGAGGGTGAGTGCAAGGGGCGTGGGAGTGGCCT 
GAAGAGTCCTCTGAATGAACCTTCTGGCCTCCCACAGACTCAAATGCTCAGACCAGCTCTTCCGAAAACCAGGCCTTATCTCCAAGACCAGAGATA 
TGGGGAGACTTCTTGGCTTGGTGAGGAAAAGCGGACATCAGCTGGTCAAACAAACT^ 

AGC C AGCGGGAATGGCAG CTG CC ACGCCGCC CTAAAAG CACACTCATCCCCTCACTTGCCGCG TCGCCCTCCCAGGCTC TCAACAGGGGAGAGTGTG 
GTGTTTCCTGCAGGCCAGGCCAGCTGCCTCCGCGTGATCAAAGCCACACTCTGGGCTCCAGAGTGGGGATGACATGCACTCAGCTCTTGGCTCCACT 
GGGATGGGAGGAGAGGA C AAGGGAAATGTCAGGGGCGGGG AG GGTGACAGTGGCCGCCCAAGGCCACGAGCTTGTTCTTTGTTCT TTGTCACAGGGA 
CTGAAAACCTCTCCTCATGTTCTGCTTTCGATTCGTTAAGAGAGCAACATTTTACCCACACACAGATAAAGTTTTCCCTTGAGGAAACAACAGCTTT 
AAAAG 

HUMAN SEQUENCE - CODING 

ATGGACCTGGGGAAACCAATGAAAAGCGTGCTGGTGGTGGCTCTCCTTGTCATTTTCCAGGTATGCCTGTGTCAAGATGAGGTCACGGACGATTACA 
TCGGAGACAACACCACAGTGGACTACACTTTGTTCGAGTCTTTGTGCn'CCAAGAAGGACGTGCGGAACTTTAAAGCCTGGTTCCTCCCTATCATGTA 
CTCCATCATTTGTTTCGTGGGCCTACTGGGCAATGGGCTGGTCGTGTTGACCTATATCTATTTCAAGAGGCTCAAGACCATGACCGATACCTACCTG 
CTCAACCTGGCGGTGGCAGACATCCTCTTCCTCCTGACCCTTCCCTTCTGGGCCTACAGCGCGGCCAAGTCCTGGGTCTTCGGTGTCCACTTTTGCA 
AGCTCATCTTTGCCATCTACAAGATGAGCTTCTTCAGTGGCATGCTCCTACTTCTTTGCATCAGCATTGACCGCTACGTGGCCATCGTCCAGGCTGT 
CTCAGCTCACCGCCACCGTGCCCGCGTCCTTCTCATCAGCAAGCTGTCCTGTGTGGGCATCTGGATACTAGCCACAGTGCTCTCCATCCCAGAGCTC 
CTGTACAGTGACCTCCAGAGGAGCAGCAGTGAGCAAGCGATGCGATGCTCTCTCATCACAGAGCATGTGGAGGCCTTTATCACCATCCAGGTGGCCC 
AGATGGTGATCGGCTTTCTGGTCCCCCTGCTGGCCATGAGCTTCTGTTACCTTGTCATCATCCGCACCCTGCTCCAGGCACGCAACTTTGAGCGCAA 
CAAGGCCATCAAGGTGATCATCGCTGTGGTCGTGGTCTTCATAGTCTTCCAGCTGCCCTACAATGGGGTGGTCCTGGCCCAGACGGTGGCCAACTTC 
AACATCACCAGTAGCACCTGTGAGCTCAGTAAGCAACTCAACATCGCCTACGACGTCACCTACAGCCTGGCCTGCGTCCGCTGCTGCGTCAACCCTT 
TCTTGTACGCCTTCATCGGCGTCAAGTTCCGCAACGATCTCTTCAAGCTCTTCAAGGACCTGGGCTGCCTCAGCCAGGAGCAGCTCCGGCAGTGGTC 
TTCCTGTCGGCACATCCGGCGCTCCTCCATGAGTGTGGAGGCCGAGACCACCACCACCTTCTCCCCATAG 
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TABLE V 



MOUSE NOMENCLATURE 
ICSGNM Ccndl 
Celera mCG21251 

HUMAN NOMENCLATURE 
HGNC CCND1 
Celera hCG27985 

MOUSE SEQUENCE - GENOMIC 

CACGGGATCCAAAACCCTCTCCTGACTTCTGCAAGCGCTGCACACACATGGTACATAGACATGCACACGCACGTGCGTAAACATGCTGGCTTTCTGA 
TCCTACGGCTTCTAATTTAGCACTAACACTTAGACTTTTTATTCCCTGTTGCTGCCATGACTGACCCCAGACCCTGCCTTTCAGGCTTCTCCTTGCT 
TTTCAGAC^TGGGACAGTTCTGGGGTCTCTTTTGCAAATTGAGACCTGTGCTCACCTAGC^ 

GAATGAGGCATGGTGGCTCACCCTGTGATTCCAATATTCAGAATGTGGAGACAGTAGAATGAAGAGTTCCAGTTTCAGGTAC^CCAAACCTTGTACC 
AGAAAGAAGAGGGGGAAGAGAAGGAAGAGGAGGGAGAAATCTAGCAAAAGATGGAGGAAGCATACAGTGACTTTGAAGATGTGTTAACTTTCCCTTG 
CTGTAAGGAAATACCTGCGGTAATTAACTTATAAAGAACAGAGGTTATGTGAACTCACAGTTT^ 

TTCG AGT CT CT AATGCAG ATGATGAGAGCGAAATGCTG AG CAAA CAGAA CTGAACTG CCC TGT ACT AC ATGGCAAAT C AG AGG AAG AGTAGTCC CAT 
CGTCCTCTCTGAGGCCACCTTCAGTAACCTAAGGACTGCCAGGCTGTGCTGTTTGTTTGTTCTCTGTCTCTAGGGTGTTTGTTTTCGGGTTTGTTTT 
GGTTTTGTTGGTGGTAGGTTTTGTTCATTTGCTTGTTTGTTTTTGATATAGAATCATGCTATGCAGCCCAGGCTGTTCTCAAACTTGTAATCCTCCT 
GTCTTCGCCCCCTGACTACTAGGATCACAGTATGCGCC^CCACACCTGGCTCTAGAATCCATCTCTTGGAAGTTCTACCACCTCCCAATAAAGCCAC 
CCTGAGAATGAAGCCTTGGACAACCAGAGAATATTGCTGAGCCAAACTACAGCAAGAAGGCTGACCAGAAGGCTCTGCCAAATTCACAAGCCACTGG 
CCCATCTGCAATAGATGCCATCTGTAAAGGAGTCACTTCTCCCCAGGGTGACAGATTC^TGCAACCTCAATCAAAACCCTAGCTGTGGTGGGTTTG 
TGTGTGTGTGTGTGTCTGTGTGTGTTTATGTGTGTGTCTCTCTATGCATATCTGTGTGTGTGTTGGGGGTGCTAACCTGTGTCTGTGCATGCATGCA 
CAGTGTCTGAGAAAGAGGGTCCMGATGCCGGCCACTGTAGGCCATGTTACTGTTCGCATGGGACAGGCCTTAAAGAAGAGCTG<:AAAAGAGCCTTG 
TACCCACCTCCTCCAATGAGACCCACCCCAGTCTGCTTTCTCAGTTAATCTCTTCCTCTCTCAACTTGAAGCCTGTCTGGGAATTTGCTGTTGAGTA 
GATTATAGGCTTGGCCTCAGCTGGTCCACTAAAAGCAGGAGGTGAGGGCCCAGGCCTGAGGAAGCATTGGGGAGGCAGATCCCAGGCCTAAGTTTGT 
TGGAGACACATTCTCCAGGACCACAAATAGCTGACATGCTGTTACCAGATTGTCATCCCAGTACCTGCAAAGTGGGAGAACCAGAATCAGGTGTTCA 
AGGCCAGCCTCTC^U^AAAACAAAACTGAAGTAAGTACATCAACTCTACTCTGCCTTTTAAGAGTTAAAAAGGGGCCTGGAGAGATGACTCAGCAGTT 
AAGAGCACTGGCTGTTCATCCAGAAGAACCTAGGCTCAATTCCCAGCACCCATATGGCAGCTCACAACTGTCCATGACTTCAGTTCCATAATTCCAG 
GGG ATTTGT CA TGT TCACACAAA CA TAC ATG CAGGCA AAACAGCAATGC AC ATAAAAAAAAATAAATCATAAAACAAAC AAC AAAGAGTT AAAAGG A 
CAGGAGAAGAGGAACAAGAGAAGGTTCATGTGGCCAGGTTTTCTTTTCCCCAATTTAAAAGTTTCTTCTGAGGTTTACATCTCACACCCATAAAGAG 
CTCAGAAGTC^CAAAGCGAGGAGGCTTGATTCATCATGAAGTCATTTCAAAATTTCTCTTTCTGAATATAAACCAAAATATTTAATTAATTCCACA 
TACCACAGGAAGTACATTAC AGCCT TCAAATTC A CT CT GCTT AACAGT AAACT CGTGAACG AAT GCACAG AAC AGACC CCAGTC AC CCAC CAGCACT 
ATGCTAAATGTCCTATGCACATTACCTGAATTACTTCTGGCCATGGCCTGGCAGAGGAAATCATCCCACTGCCCAGCAGAGATGAGGGCATCTTTGC 
C CACAGCCT CT C CAATGC CAG AG AG C AC AGC TCTGTACGTTG CCTC CTGTAGTTACAGA AATGTTTTCCAA CTTCCTGGCAT CCTGAAAAGAAGTGG 
CTTGCCTTAAAGTGACTTTTCTTTTTCCTTTTCACAAACAGTGCAGGATGGCCTCCTGTGCCAATGACAGGTATGTGCTACCTCATCTAGTTCTGGT 
AGTACGTGGATTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAG 
AGAGAGAGTGAGAGAGAGAGTGTTTGTGTACCTGTTCTTTCTTGTGTATGTGTGTACCTGCTCTTTCTTGTGTGCACATATGTGCACCTGTTCTTTC 
CTCTGTGTGTATGTATCAGCTCTTTCTTGGGGGGGGGTGTATGTCCCCCTGCTTCTGCCTTACAAGGGCTAGTATCTCAAGCAATGAGTTGGTGGAG 
TTACAGATGCCTGTGTCCAGCTCTCTATGGATTGTTAAGCTGCTAACCTCAGCACACTGGTTTTCTGCCCTGTCAAATGCCTGCGATGACAGCATCC 
ACCTGGTAAGCTGGAAGGTGGGGTCTGCTCCTCCATCCCACCTGAGAGGGAATGATTTCCTTAGATAGGCTGCGGCTGGAACTCACCTGATCTAGAA 
GTACACTGGGGGCACACAGGAGCCATGGAGCCCCTAAGGCCCACCAGCAGCTTCTGTCCTATCCCTTCTGGCAAAGGGGCCATCTGTCAAAGGAGGG 
GTTGAAAGATGGCCCAGAAGAGGTCTCCTCAAATCTCTGACCCTTACAGGACTCTGATGATAAATGATACAGTCTCAGTAGGCCTATCCTGGCTTCA 
TTTCTTCTGCTGTGATAAAATGTCATTGGGGGAAAAAAAAAAAAGCAGCTTAGGGAGAAAGGGCT 

ATTACTACATGAAAGGCAGTGTGGCAGGAACTTGAAGCACATGGTCACATCATATCCTCCCAGAGCAGTGAGATATGAATCCAGGGGTGGGGACTGG 
AATGATGGCTCAGGGGTTAAGAGCATCGGCTACTCTTCCAGAGGATGCCGATTAAAACTGTCTGTAACTCCAGTTCCAGGGGATCTGACACCCTCAC 
GCAGACATACTTCAAATAGACATACATGTGGGCAAACACCAATGTACTTAAAATAAAAATAAATAAATTATAATAAATAAATAGGTGCTTAGTTACT 
TTTCCCACCTTTAATCAGTCCAAGACCCAAGCCTAGGGAATGTGTCACCCACTCCCACACTGGGTCCTCTCACATCAGCTAAGATAGCCAAGACACT 
ACCCACAGCCGTACCCATAGGGCTTCTTGAGACTCTCTAGCAACGTGAATCGAGGTTGTGTCAAGTTGACAATTGAAACTATTATGGTGCTTAACTG 
CTC CCTGAT GAACAGGATGTGGC AG AGAC AGGCAT CC CAT TGGTT AATGTC AGAAGTGT CAT CCCTCCGTGGA CTG AATTTTTGCCTTTAACT TATA 
T TT ATTTGCTTGTTTGTT TATTAGTGTTTGTATCTATGTG CA TGTG TG AGTGCGGCT TC C CT CAGAGT CCAGAAGAATG TGT AG AATC CCT CT AGAG 
CTGGAGTAGGGGGCTGCCGGGTATGGGTGCTGGAAATCAAACCCGGTCCTCTGGAAAAATCAGCAAGCACTCTGACCTGTTAAATCATCTACCTAAC 
CTCATCCTTGCTCATGTTTTAAAAATAGTTATAGTGGCCAGGTGGTGGTGGCGCTCGCCTTTAATCCCAGCACTTGGGAGGCATTGGCAGGAGGATT 
T CTGAGTTC GAGGT CAGC CAAAGTG AGTTCCAAG ACAGCCAG CAGGGCT ATACAGAGAAAC CCTGT CTCAAAGAA CCAAAAAT AAATGAAATAATTA 
TAGTGGGTTTTTGGACATTCTCTGTATGCTAAGCACCTTCCAGATGGGCTTCCTGACAAGAGATCCGTGGCCTGACCCACACTGATTTGAAATAATG 
ATGAAAATTCAACTTTCAGAAGCAAAGGCTGTAGTTCCTCTCTGGCTTCTACCACAGTCAGTAACCAGTCCAGTTCCTTAGAGAGTCAGTCACCTTA 
GTACAC ATT CCAGCGGAGCC AAG ACAGT ATAGTG CTCCTTATAAACATGGAAGGGCAGTTCC TACCCAGCGAGTCAC TG CGGGCTTGGTGCCT CTAC 
TGAAGCGCAGGCTCAACCACCGTTCACCGCGGGGAGCGTCCTCAGGCTCTCGCGAGCCACTGCTGCGCGTCGCGTCCAAGGTTTACGGTAAGCGCGG 
CGCTCAGGACGACCACGTGGCGGCGAAACACCGGCCGGCGGGAATGCGCGGAATGCGCGGTGCGGCCTCGCGCGCTCCCCAAGCCGTGACCCCGGAG 
TTC CCGGCG CG CGTTTGTGGG AC CG CGG AGG CTAGTGCCATGGGTCCG CCCGCGCGTCCAAGCCACCAAGCAC CACAGGGAG C AGGGG CC CCATGG A 
GGGCTCAGCACGTGAGGCTAGACAACCTTTTCCAAAACCTGGGGCTAGGTTTG^ 

CCCCATTCTAAAGCCCCCACTGATCTGGATTAAGAATGGAGATGCCTGGTTTTGCAGGGAAGCAGGTCAAAGGGCTCTGCGCCCTGACCGCCCTTCG 
GAGTGCTGAGGGACCAGTGCCAGGGGCTGAAAGACCCCAGTTCCAGCAGCTGCGCCCTGGGATCGCATGAGGGTTCGCTCACCACGATTGAGCCACG 
TGGTCCTGCCTGGC CACTGGTGGCTCACCTTGGCCCGAGTGCCCGCTCCCCACCATTTAGTCTGGGAAGGGCCGAGGGCCAAAGCCCAAAAGGACCC 
TAGGGTTCTTCAGAGCACCTGTGATTCGTTCAGAGAGCTGGAGACCGAGCAGACATCCCTCCTTCAAATTGCATCTCTACCCTTGGGGGTCCCTGGA 
GCCAGGCTTGCCGCTCCAGAGCTCATTGCACGGCGCGCGTCTATTCGCTCTGGGTGGTGGG 

GGGAAGATAGGATGCGACGGTCCCAGGTGTGCACAGGGGATGAGCTGAGCATACACACCTTAGCCGAAGGGTGCCTGAAATCCGCTCAGGGTAACCT 
AGGCGGAGCAGCCGTGTAGCACGTGGGCTGCCACGCGCGCCCCAAAACGCCTTCTGGGTGAGGAGAGGGAAGCCGTAATGCCTCCGGAACCTTGGGG 
TCCATATTCGGAAGTGTTTTCTCTGGGCGACTTGAAGGCTAAATTAACAATGGCTAGCTGGAGCAGAAACAGCCAAGTCCTTTCAAGTTGCCGCCAG 
GTATGCGGCTGCAGGTGACCCCACCTAGGTGCGTCCGCTCTTCTCCAGGAGGGGCTACAGCCAGAGGCTCAGTATTGCCGCCCAGCCCCGCACCCCT 
AACCCGAGCCCCGCGCTTAGTCGCGAGGGTTCCTCAATGAACGCGCTCCCTCCCACTTCTCAATGAAGTTCCCACAGCCAGGGATAGTGGCAAACGC 
AAGACTAAATCTCCGCTCTCTTTGGAACCTTGGCTCCAGTCAGGTGGGGGTAGGGGTAGGGTTCGAGGAGGCAGGAAATCCGAAAGGGGATTAAAGT 



89 



TACTTTGAGATTTTCTTTCCAAATAACGTGGTTCCCTCTCGGCTGGGCAAGTGGCCTTTTGTCTCCAAAAGGTCACTGACA^TAAGCGCAGAACAC 
TTTAAGCCCAGAGCGGACAGTTCCTGCCACGAGATTTTTTTTAAAGCCAATGTAAAGTTAAAATTGC^ 

CTATTTCGCTGGAAAATAAAAATCCAAGTTCTTTCTCTGAGATTCCTAAAGCAGTTGTCCCGGAAAGAACTCTGAATTGCAGCCCACCTCCTGGGCT 
CGGGATTCGGACAAGCAGGGTTTCCAGGATGGAGGATGGGCGCTCCCCGCCGGGTCCCAAATTCCTGCGCCTCCCCTACGGGTTCCTGGACGGCTCT 
ACACGCCCGCCAACCGGACTTGCACTTTTGCTCCTCTCCCTGAGCGCAGAGCTCAACGAAGTTCCTCGTGGAGATCTGCCCGGTCCGCCTAGTAACA 
GCACTGAGTCCGGATTGGCTCATGCAAATTTCAGTTTCCTGGTCTTTCCCGCGGTGGCGGTGGTGGGGGGTGGCTGCGGGTCGCTAGCTCGTGCTGC 
AGGGTGGGCGAGCCCCTTTATGCCCCGCTGCCCTTGGTGGCGCACCCT 

GCGCAGCCCTATAAATCATGCTTACTGCCGCCCGGTAGGGATTTTATGAATGAAAAGGCAGCCTGGCCGCCCTCGTGCTTAGGCTAAGGCTCCCAGG 
CTTGGCGGGGCCTGTGGCCAGAGCAAGCGTGGGCACTCTGCCCGGGCCACCCA^ 

ACTTTAGGGACCCCAATGTCGGGGGAGGGGGAGCGGAAGGCATGAGCTCGGAAGGTGAGGGGTTTGCGTCTTGTGTGCCGCCTGGACGCCGTGCGCT 
CCTTTACCAGTTTCTTCTGGGACACGAAAGTATTCCGTGGGAAATGTGTGTGAATAGTTCGCCTAGCTTGAAGTCGGGTATTGTGAAAACGTTTCCA 
AAACTAGGGAACTAAATAATGGACGAAGGTGGTGGAACCGCTTTATTCTAAAAATATTTTATTTGAATCTATAAATTATAAACTTAGGAGTCCATTC 
AGTTAACCACG ACC ATCTAT AGATT C TCTTT AAATAT C ACCTTAT CGGCTCACAAGTTTATCTTG ATTCTCAC CAGCCG C CCCCCCCC CCCAACTC A 
AGACTGCAATTCTAAAGGTGGAGAAACACCACCACCCTCA 

TGCAGCGATTTACTTTTCTTAATTAAAAAATAAATTAGGAAGGAGCCTATCGTGTCTC^ 

C CTGCAGTGGAGGAGGGGGATAC TGGGGGACCCTGGC CAGGATAAACC GGTCACTGT ATGAAGAC AAATCT C AG ATC CC ACCCCACCCCCC AGCGAG 
GAGGAATAGATGAAATAATGG CC ACC AT CTTGAG CTGTTGCTGGAATTTTCGGGGTT TTATTTTATTTTTG AG CGAGCG C ATGCT AGGCTGGGG ATC 
CTTTAAAGTTCAGATACCCCTCTGGCCCTTTGCAACCACCCCAGTGCGCCAGGATGGAGCCTGCACGAGAGCTTAGGGCTCGTCTGGCATCTTCGGG 
TGTTACACAGTTCCTGAATTTTACACGTGTTGATGAAATTGAAAGAAGACAGGGACGCTGGGATTTCTAAGCAATGGGTCCGCCTGTGGGTGCCTCG 
TGGCGTCCTCGGAAACGCACCCATTCTCCCGGTTTAAGAACAGGGTGTCCTTGCACCCCCAGGCTCCCCTTCCATACATTCTTCCTTGGCTTGCGTG 
TGGCCTGGCCTCCCTCCTAGCTGTCCTCCTGTCCAGAGCCGCCACTACCCCACCTCCACAGGTCTCGGAGGACCCTCTTAGGGAAAGAAGCCCCCCT 
CCCCCTTGCCCCGCTCTTTCCCAGTTTGGAGAGAAGCAGTCCGAGCGATTTGCATATCTACGAAGGCTGAGGGGGAAGGGNNN^NNNNNN^NNNN 

£1 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMW 
NNNNNNNNNI^NNNNI^NIWNNNNNNNNN^N 

M NNNl^NNNNNNNNNNNNNNNNNNNNNNNNNNOTTN 

ri NNNNNNNNNITONltfNNNNNNNNNNNNNN^ 

'J- NNNNNNNNNNNNNNNIWNNNNNNNNNNNNNNN^NNNN^ 
NNNNl^NNNNNNNNNNNNNNNNNNNNNNNNHNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



|jy NIWNNNNNNNNNWNITONNNNNNNNNNNNN^ 

TAGGGGCGCTGGGACAGGCACGAGGGGCCTCGTGGCGGCCGGGAATCACTAAATAAGACAGTGTGAGGAATATTTGGGGTGGGGGATTCTTTGTTCA 
AACAGCGGACCTGGGTACCCCGCACACGCTGCCTGGCCCAGAGAAAACGCCAGGCTGCAAGGTCGCGTGGCCCCCTAGTCAATGAGACTGACTGTTG 
jM? TGCGTGCGGTTCTGGATTTGGAGGAACCTGTCTAGGTCAAACTGGGGGCGGAGGGGGACAAAAGCACATTTTCAGATGTTCAGCAGGAGACTCAGGG 
p=| GTCCCAAAATATTTTAAAATAATTTTTAAAGTGAGGCATAATGCCCTTGTAGAGGCAAACAGCGCCCGCACCCTGCAAAAGGGGGGCCCCCCGGAGT 
J* TTGAGTTGCTGGAGCTCACCCCCCAATGTCCACTGAGCTCCTGACCCTCGGCTTGGATCAAGGGTCGTCCGATGGTTATTTTCAGGTCGTTTTTAGG 
fli CACCTAGTTATTTTTTAAATATTTTTAAATATTTTTTGAAAAGATGACGTCTGGGAAATGCAGCGCGGCGGCCTGGGACGCCACCATTGTGTCTCGC 
| ? | GTGCGCTCGAGCGGCGCCCAGCCCCCGGCGCCCCGCCAGCCCCGCGGTGGGGTTCTCTGGCTTGTGGCTTCCTAATTTCAACAGGGGACCGGGGTCT 
Si CCGTGGCCACCATTGCTTCTACCGGCCCCGCCAGCTGCCCAAAGACTTTCCCTTTCAGTTTCAGGGTGGGTGGGCACTGGGAACCCTCGGAGCAGGA 
Q G CG CAT CCC AGGGAGTTGGC C AC CGG AATCCCTCTCTG TG CAATCTAGGAAAAGGGAATGGGGTCGTATCG AG CGACGC C CCCCTCCC ACGTTG ACG 

y, GTCAGGGTCCGGATCCCCGCAGGTCTGTGAGGAGCAGAAGTGCGAAGAGGAGGTCTTCCCGCTGGCCATGAACTACCTGGACCGCTTCCTGTCCCTG 
GAGCCCCTGAAGAAGAGCCGCCTGCAGCTGCTGGGGGCCACCTGCATGTTCGTGGCCTCTAAGATGAAGGAGACCATTCCCTTGACTGCCGAGAAGT 
TGTGCATCTACACTGACAACTCTATCCGGCCCGAGGAGCTGCTGGTAACCACGTAACCCACCACCGACAATCCCTCTTTATGCTGGGTGGTGGGAGC 
TGCAAATGGCAGGAGATCCAGCCATCCATGGACAGATCTCCTGCCCCACGAAAGGCACCCGCACCGCACCGCACGACACCGAAATGGCAGTGAGCAG 
AATGCCAACCCATTCTACCTCTGTGCCCGCCCTCTTTCCGTACTGGACTCCCCACCATGTCTGTGGTGAAACCACTCTGAAATGTGGGCGGTGTGCG 
CCCCCTAGCGACGTGCAGGCCACGGCGCTGGCGGTGGGAGATTCTTTTTGTCCGCAGTCTGTCCCCTTAGTATTTCTACAGGCACATTTTGTTGTTT 
ACCTGTATGGGGTCCCGGGTTGACCCAACTCTTTAGACCTTCTCGAATCAGCCTGCCTTCCTTTTGACCGTGTTCCTCATTCTCTGCAGCAAATGGA 
ACTGCTTCTGGTGAACAAGCTCAAGTGGAACCTGGCCGCCATGACTCCCCACGATITCATCGAACACTTCCTCTCCAAAATGCCAGAGGCGGATGAG 
AACAAGCAGACCATCCGCAAGCATGCACAGACCTTTGTGGCCCTCTGTGCCACAGGTAAGGCTCCACCCCCATTCTTCCCAGGAAGAGCGGGCTCCT 
TGCCACTCCCCCTGGGGTTAAGGCCTCTTTCCTTGCTATCTATTCTGAGCCCAGGGTGCAGATAGTTTGTTGGAGAGATAGATGATACTGGGTGGCA 
CATTCT CTTGG CTAGTATCTT GG AG ACC T AAC AGAATG AC ACAGCTTCCCTGTGCCAACCTTTCC ACTG CC AGGAAGTG GTACGG CCACAGCATGCT 
TTCTGAAGAGGTTTTTGCCTCTTTTCCTAGGCTTCCTGCCTTGCCTCT 

C CC AGG CTGAGCCACCTG CT AGGGGTTC C TAC ATGGATGGGAC AAATGGTCC AG C CT CAGCCCCACGAGCT ACATACAGTACAGC C CC AGAGCAGGA 
ACTTAAGAGGCCCCACCAGGGGTACTGCAGGCCAGTACTTTTTGGCTGAGTCCTGCCTGGGAGGGTTGGCTGGAAACAGAGAAACTAGAACCTCCCT 
ATG ACT CCTCCAAGGGACCCTCTCTGGC CCC AGACCC CAGGT AGATTAGGCTG ACATAAGTC ATGGTAGAC AG CCATGCAGG AAC CCACA CACATAG 
ACAACCCTGCTACTTCTGAGTGGTTTCAGTACTACTTCTGACAGCTGCTCCTCACCTCTAACTGAGAACCCTACCCAGGAGGCTTAGCAATATTGAG 
GGGGTGTGGTGTTCTCCCACCTGTTGAGGACCCCCTTCCTCTGTGGCCCTGATCAGACCTCAGTTGCTATAGCTGTTTGGCCTGTGATGTCAGAAGG 
GCCGGTAGAGTATCTCCCCCCACCCCCAACCTCCACCTCACCCCCGCTGTCTCAGTGACATCTTCTTATGTCTGACAGCCCCATAGCCAGTGTTTCT 
AAG CTGCTGGTGTGAGAATTTGG AAATT CTAGCTTGCCAACTACCTACTGAGAGTGCTG AGC CCG CCCCAAGGGTACCAAAC AAGGGGGT GGC AGAT 
GGGAGTAGCAAGGCCATCGGTAGGCTGGAAGGATCTAAGGGGTGGGGGTGGCCCTGCTGGCAGTCAGGGAGGCAGCTGGCTGGGGTTCCTCTGAGGA 
GAG CGG AGCCTTTGGAAG CT CCAGGCAGC CT TTT ATGGAG CTGAAAGTGCCTTC AGAGAGGC AGC AGAGTTTCCCAGAGATAACTTGGGGATGGAG A 
GGCACAACCTCCCCCAGGATAGTTCTGGCAGTAGTGTGCCCCTCCCGGGACCCCTAGATGCCAGCAGAGTAATCAGTAAGCGCTATATCTGGGTACA 
GCTGGACTCAGCTCTTCCAGCTCTCCTTACTGCCTTTGTCTGTGCCCCTGGAATACAGGTGGGGGTAAAGGACAGCTTGGTGACCTGCGGTAGGCCA 
GTGTTCTGTGGTCTGGAGCATTTGTTTCTCTGTAGGAGCAACCAGCTCCTTTGTCACACGGGAACCACTTTGGGAAGAGGCCAGTGTGACATCACCG 
TGAATGGAAAG ATG CTAACAT CAAGGCC AGG AGATCTGTCGTCTG TGTGACAGGGGTTCCCC CTAT AGGAG ATGGG AAGGCATAACTT C CCTTCTGC 
AAACCGCCCCCCCCCCAACCTGGATTCCTGACTGCAGTTGAACAATGTGCTAGCTAGGGTTGGGGTTGTTAACACAGGTGGCCAGAGCGCTGTGTGG 
ATGGATACAGACAGTGGGAAACTGAACTTAATGAAGTATGTAGGTGTCAGACATGCCACTAAGCCAGGGTACACCTATGTCATCAGCAACCAGTGTT 
AGTAGCTAGGGATCCTTATAGGACATAACAGATCTTGCTACTGCCCGTGGGAGGGAGAAAAACAGGCTCTGGAAGGGGCTGGGGACTCCTGGTCCTG 
CTGGCTGGAGTCTCACTCCCTGTAAAAGGGCTGTTTGTGATTTGAACTTGACTCCATAGCCAGGTTTTGGTTCACTGGACTCACTAGCATGAACTGT 
CCTTGGTGACACCTGGAGGTATTGCTTGTCTGCTGGATCCCTTTAACAGCCAAAGCTGTATGTTCTCTCAGGGTGGTCCCCAGAGGGCCACCTCGCT 
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ACAGCAACTCTCTGCCTTCCTGATCCCATTCCTTCCAGCCTCCTTGATCCCAGGCTCTGGGTGACACACAAGATGATTGATGGGAGCTGACATCTGT 
TC&TTCTTGTGTCCTCCTCTGGCCTGGAAGCC 

CAGTTTTCAGACTCATCTTTGGTGGGTTTATGCGGAGAGGCATTCCATCGGTGTGACATACAACTGCAGGATGCTGAGTCCAGTGGACCTGAGAGGT 

GTGGAGTTGGTGACCCCAGATGAAGTCCTAAAATGACACCCCACTACTTCCCAGTTGGGACATGTACATATGAGAGTCCCTAAGACTCCAGGGCCCC 

TCATGAGGGACCTTGGCCTGGAGATCTGAGGTGGCTATTCCCAGCTCCCTCTGAGTCTATTCCTACTGGGTTTTCCTGGACTGGGGTCACATGTCTT 

CCCCAACCTGGGTCAGTAAGCCGGGAAGGCTAACTCCCTTCTGTGGAGTCTAGCCTGGCCTCTGAGGAACCTGGGCCTGTCATC 

CCTGTGTCTGAGATGTGTCTCTAGTCCTTGGGGTGCTGCCAGGCTGACTCrGGAAGAGACAGAGATCCCTGCCTCCCACCTCAGCCGCGCAAATACT 

GGGATGGAGATGTGAAAGCACCACACCTTCATAGATTCCATCCTC^ 

TGAACCCGGTGTGTATGCCAGGCAGC^TTTTAAAGTTTGAGCCAAGGCCTTATTGA^ 

GATGTG CTGGGGTCTAG AGATGC AGCACTGTG TGCACCTAGTGCC A CT TAGGTGTCT CC AAGGGAGCT AGAGTTGATGTGGGCATCCATTGGAAGGG 
CTTGAAGACCAGCCCCGAGGCTGTGGGTGGGAGCTGAATCATAGGCATGTCTCTGTCCCTCTCGCTGTAGATGTGAAGTTCATTTCCAACCCACCCT 
CCATGGTAGCTGCTGGGAGCGTGGTGGCTGCGATGCAAGGCCT^ 

TTCCAG AGTCAT CAAGTGTG ACC CGGTGAGT AAGTCTCAT CAAAGGAGGCTACAGCTGGCAAGGG CCTGGG AAC CTTGGTGGGCAGTGCC AAGGCAC 
TGTG TG ACTGGGCTGGAGCTGAGAGGTG C CAAATCTGGGT CCCCAC GAG CCTCTTCCCAGAGTCATGGGTC CCAAAGGC ATAGAGGGCTGTGGTGTG 
GTGACAGTGCCCCTGCAATGTGTATGGGTAGTGGCATGTGAAGCCCGGTCTGGCCCAGCCAGCATAGGACCATATCTCCCTCAGTGCAGACGTTGGG 
G ATTGAATCCATAAAGG ACTCTGGTATAGAC CTACAC ATT CT AGAACT TTC TTTGTGTG AGAGAG GGGTTTGCATCAGG CAGACTGTGCTACGTCTA 
ACTGTACCTTTCTGCCAAGGAGTACAGCTGATCATGCTTGGGGCTCTGGCCGTGAATGTTTTCTCAAGGGCATCACAAGTTGGCATAGACAGGCAAA 
AGTCCAGTTCTGGCTTGAACTTCAGTTGGGGGTTCAATGGAGGTCAGAGCCACTGGCATGTGGAAGGCTTCCAGGTGTGACAGCTGTTCTGCCCTGC 
CTGGCATCAGCCACAGGGACACCCCGTGCCCCATTCCTACATATCTCAATTGTAGAACATGCAGCCCCCTTCCCCTAAGTGTCTGGGGCTCCTGAGA 
ACTGAGAATAGGGTATGTGGGCAGGTTGGAGCCCACAGGCTTCAGAGAGGGAAGGGTGTCTGTGGGTCCATACTAGGCTACTTGGGCCTCACTTTCT 
s . CCATCTGTGAAATGGGGCCATTGCCCTTCCATCCTCTAGCCATGGTGTTCA^ 

^* CCAGGTGTTCCCCAGGAATCCTCCTGGCTTCAGGAATATAGACAGTGCCCTAGCTTCAGCCTGCCTAGGCAGGGCTTCAGTTCATGAGTCTTATTCC 
Q TGGGCAAATGCTTTCCAGACCCTACCTGTGCTGGAGCTGTCCTCTCTCAGAGGCCACCCAAGGCAAAAAGTTATGAG^ 

f=% GAAGTT C AGGGTAC AGCCTGGGAGG AGG AAC CTAGAG AGTAGC AGT AC CTTTCT AGT CT CTGGCAGCT CTCACCGG ATTTAGGGGTGG CTT GGGGAG 
TTTCCTCAAGCCTTGCTCTTAGCAAAGCCTGCCTCGCAGGTCAAGGCCCTGGCCACCGGCCTCTGGCTAAACAAGGACCCCCTCCATCTCCGCCCGC 
U CTCTCCAGGACTGCCTCCGTGCCTGCCAGGAACAGATTGAAGCCCTTCTGGAGTCAAGCCTGCGCCAGGCCCAGCAGAACGTCGACCCCAAGGCCAC 
If TGAGGAGGAGGGGGAAGTGGAGGAAGAGGCTGGTCTGGCCTGCACGCCCACCGACGTGCGAGATGTGGACATCTGAGGGCCACCGGGCAGGCGGGAG 

CCACCAAGTAGTGGCACCCGCAAAGAGGAAGGAGCC!AGCCOGGGTGCTCCTGACGACGTCCC^ 
?"* TGTTCTCTTTGTTGGTTGTTTTTCCTTAATCTTTCTCCTTTCTATCTGATTTAAGCAAAAGAGAAAAAAATATCTGAAAGCTGTCTTAAAGAGAGAG 
|^ AGAGAGAGAGATAGAATCTGCATCACCCTGAGAGTAGGGAGCCAGGGGGTGCTACAAAAATAGAATTCTGTACCCCAGTAATCAACTAGTTTTCTAT 

1 t TAATGTGCTTGTCTGTTCTAAGAGTAGGATTAACACAGGGGAAGTCTTGAGAAGGAGTTTTGATTCTTTTATATGTTTTAAAAAAAAAAGCTTAAGA 
US AACATTGCTTTAAAAAGGAAGGAAAAAAAATACAGCAAACCATTGTTAAAGT^ 

2 CCACAGCTTAGGCCCTCAGCCTCACTCCCTGGCTTGCTCAGTGCCTACAGCCCTGTTACCTGATACCTGTGCTTTATCCCAGGGGTGGGCAGACCTC 
E . TTAACCTTATAGATGGTCAGTGCGACCTCTAGTCGTCTC^TGGCGTGTGGCACAACCCCCCTCCCCAGGGCTCAGCTTAATGTGCCCTCTCCCCCCA 

ACAACCTGCAGGTTCACAGCGCCAGCCACACAGCGGTAGGGATGAAATAGTGACATAATATATTCTATTTTTGTAACCTTCCTATTTTGTAGCTCTG 
O TTTAGAGAGATGCTGGTTTTTGCCTGAAGGCCCTGCAGCCTGCCCACATCAGGTTAAACCCACAGCTTTTGTGTGTGGTTTGTTTTGTTGTGTTTTC 
f[* TTTCTCTATGTTCCAAAACCATTCCATTTCAAAGCACTTTTGGTCAGCTAGCTGGAGGCAGTGTTGCTGGTGTGTGTTGGGGGGAGGGGTTCTAATG 
?y GAATGGATGGGGATGTCCACACACGCATTCAGATGGCTGTACAACAGGTTGTAGGGCTGGTAGTATGAGGTGCTTGGGAAGTTTTGTTGGGTCAAGA 
|jj AGAGAGAACTCTGTTCTCGCACCACCGGGATCTGTCCTGCAAAGTTGAAGGGATCCTTTGGTGCCAGCTGGTGTTTGGAAGTAGGAACCATGATGGC 

ATTACCTGGACAAGGAGATTGGGGACAACTCTTAAGTCTCACACAGGAGGCTTTTAAACACTAAAATGTCTAATTTATACTTAAGGCTACAGAAGAG 
W TATTTATGGGAAAGGCTGCCCATGACCAGTGTGACTCAAAGCAATGTGATCTCCCTTGATTCAAACGCACACCTCTGCCCTGCTGGAGAAGGTTTAG 

GGCCATGTCTGAGAGATTGGTCTTTTATTGGGC^U^CGGGGGGGGGGGGGGGGTCCTTAAAAAA^ 

CTTTCCCAACCCAATTGGCCCCATTGGAGAGCCATCCAAACTGAGGAAAATTAGGGGACT^ 

CCCCAAGTTAACAACAGTAGGTAATTTGCACACCTCTGGCTCTGTGCCTTTCTATTAGGACTTTTTGGCAAAAGGTGGAGAGCGGGAGGCTTAAGAG 
GGGATGTGAGGGAAGAGGTGAAGGTGGGACCACATGGGACAGGCCACGGCTCCTCTCATGGCGCTGCTACCGATGACTCCCAGGATCCCAGGCGTTC 

TGTTCAAGTTTTAATTTCCTCATAGAGTGTATTTACAGATGCCCTTTTTTGTACTTTTTTTTTTTAATTGTGATCTATTTTGGCTTAATGTGATTAC 
CGCTGTATTCCAAAAAAAAAAAAAAAAAAAAAAAAACAGGTTCCTGTTCACAATAC^ 

GGGCGGTCTGCCTCCAGGGAT<XTGGGACCCTGATGGCGATCGTCCTGTCATGCTGGGCCCTTCATTTGATCTGGGACATAGCATCACAGCAGTCAG 

GGCACCTGGATTGTTCTGTTATCGATATTGTTACTTGTAGCGGCCTGTTGTGCATGCCACCATGCTGCTGGCCCGGAGGGATTTGCTCTGAGTCTCC 

GGTGC^TCATTTAATCTGTTAGGTTCTAGTGTTCCGTCTTGTTTTGTGTTAATTACAGCATTGTGCTAATGTAAAGACTCTGCCTTTGCGAAGCCAG 

CTGCAGTGCTGTAGGCCCCCAAGTTCCCTAGCAAGCTGCCAAACCAAAACGGGCACCACCAGCTCAGCTGAGGCATCCCAGCCAGGCAGGACCCTTG 

AGGGCCGCTGTGTCCATGGTGATGGGGTGAGGTTTTGGCCAAAAGGCCAAAGACTGGTGGTGGGTCCACGGAATCTGCCCTGTGACATGAAAGGCTT 

TGAGGGACTCTGGCTGGTGGCCAGGTTGGCTTTTTGTATTTCTGGTTGACACACCATGGC^ 

AAAAAAAAAAGACAAAAAATCTAGAAAATAAAATTGGTAAAATCTCAGCTCACTGTC 

CCATGTGGGGTGGCATCTCTGGC CC TGGCAC C AA CCTGGG ATT CCC AAGGGGAAGGGTGT ATGCAAGTGACAG CCAAG AGCAAAGTGGGAGCCTCTG 
GCGTCAGGGAGGCCAGCGACAGCCACTGATCAGAAGCGGCTCTGGGTCCATCACTGCCAGCTCACCCTCCTGCTT^ 

GAACCCGCGTTCACTGCCCCTTGTCTGGCGCAAATGGGTGTTTGCTCTGGGTGAATCACCGTTTCTTAGGTGCTGCCAAGCGCCCAGAGAGCCATGC 

TTAGTGTTAGGGCCAAGACCCAAGGTGGCCTTTCACATAGTTCTGCAGACCTGAGTGGCCTGGCAAGACTCCTGTGATGTCTTCGGGGTGGCAGTTA 

CTCATGACrAGTATTGTTTCCAAACCCTGTGGGGACAGGTGGCTTTTAGACATCATCTTCTACAAGGTAGCTTTTGGGTCTCAGCTTGGCTGTTTTG 

GTGCCTGGT GGGCC C CTGCC CTT CTGGAAAGTGG AGGGTGTAGAGG ACACC TGGTCTGAGTGGTG CCACCC AATGATGAAAGC AGAAAG AG ATGGAC 

GGAGGGAAGCCCACAGTGACCTTGCCAGGCGACCTACTCAATGAGAGTATATGGGAACACCAGAGCCAGAGGTGGCTTATGAAGACACAGTACTCAT 

CTTGTACCAAATTGTCTGATGGCAGGATCCAGAAGGTATTTTGTCAGCCCAGCTCCATATAAGCArCCGAGCCGGGTCCACAGT 

GGAGAATGGGCATGGGGCCGTCAGCATTCCAGGAAATGGCCAAAGGGCATTGAGGTGGAAGAGTGCTTTCC^^ 

GCGAGCTTATCGTGTCCAGGTAGGCTGACATGAGGAAATCCGCTGGGGTGCAGCAGCCAGGCAGCCAGTGCAGCAGAGTGGTACTAGGAATGCAGAC 
ACAAATAGCAAGGGGCTAAGCATCCACTCAAAGSCTGCTGCTGCTGCTGCAGCTG 

GCCCAACTGGAATGGGCAAGCATCACCATCAACTGCACAGGGCTCACACATGGTGCCGTCTCTTAACGGTGCAATTGGCTTGTCTCGTCATCCAGTG 

GTCTGTGATTCCCAGCTGTGGCTGATAGCCAAGCCTCTTTTCTTGGTTTTAGCAGTGAAACCCCAGATACCACTGGCCACTCCAGGCTCC 

AAGTTTAACTGGTAACACTGGAGGCTAGATCCTCTGCTGGGTGTAGAGATACTCCTGTTCTAAGGGTGATGGTAAAATAGCTTTTTGAGGGCCCTCA 

GAGATAAGGACTC^GGGAGCCAGCACACCCAACTGTCTCCTCTCAGAAGCTCAATGGCTGGTTCTCTCTCTGCCTCTACCAGCTCCATGGTCCCCTG 

CCCTGTGTCCTGGCCAGGCTCTCCTTTTGCAGGCGGCCTCCATCCTGCTGAGCCATCCTACCTCTCTCAGCCAAGCCAAGCTCTTGAACCTGCCTGG 



91 



CCTCTCTGAGCCTCAGCTCCCTGAACTGGAAAGATTGCCCCTGGCTCCCCATTCTGTGTTGCCCCACTGCCTGGACCTGCTCCAGCCTTCCTGTTTG 
T CC TAC AGG ACT CACAGTGAATGTGGCCCCTGTGGAG CATGTGGAGCATGACATTGC AC AGGTGACTC TGTGTAGCCTCTCCCCTCCTGACTC C AC A 
TCCTAGAACCAAGTGTGAGGAAGTTAGCCTGTTGCCCATGGGTCAGCTCAACTGCT 

AGAGGATCTGGGCTGCTGGGGCCCACAAAATGCCAGCAGTGGGAGCTGAGGACTCCAAGGGGTGGGACAC^ 

C CCTGC AGTGCCCCAAAG CCCTGGCCACAACATC C CAGGAGAC AGT CAATGCGGGTTGGGCTGTGTGTTTGGTGGC AAGGCT ATGCAGGCCACATAA 
GGCCAGTGCTGGGAGCTGGCACTGCCAAGTGTGCTTGCCAAGCAAACCTGTACTTGAGCTGATAAAGCAGTCTGTTGAGTGTGTGGGGTGGGGTGTA 
TAGGTAACATGAGGCGGGCATGTTGTAT ACAGGG CCC C ACGTGGG C AC AGCACAGTGCTGGTGGGATGCAGTACCCCTAGTTCTG AGC TGATCTGGC 
CTCTCTTACTCAGGGATGTCATCCCAGGAATGTATTTGATCAAACAGTTGTGCAAGAGGAC^AAGC^ 
NNNNIWNI^NNNNNNNNNNNNNNNTONNN^ 

NNNNI^NNNNNNNNNITONIWNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNIOTNNNHNNNNNNNNNNMN^ 
NNNNNNNNNNNNNNNNNNNNN^NNNIWNI^ 

TGCTAGATCCTTGGACTTCCATTCACAGCTGCTACTGAACCATTGTTGGGAATTGGACTGTAGACTGTAAGTCATCAATAAATTCCTTTACTATATA 
G AG CCTACC CAT AAGTTCTGTGACT CTAG AG AAC C CTG ACTAATA CAAC ATG CATAC AC ATAAAATTAAAATG TGCCAG GCAGTGGTAGC ACATGCC 
TTTAATCCCAACACTTGGGAGGAAGAGGCAGGTGGATTTCTGAGTTTGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACA 
GAGAAACCCTGTCTCGAAAAACCAAAAAAATAAAAATTAAAAAAATAAGTAGGTAAATT^ 

TGAATATTCTACAGTTTTTTAAGAGTTTGTCAAACATTTTGATGTTACAATGGCCAATGATGACAGTAGCCCCTCACATAAATACATAGTACAGAGG 
W GCAAAAGTGTGCACAGGGCCAGTTCAGAAATTGCTGGAAAATTCTGTCCTGACCCTAAGACAAGTTTCACTGAACAATTTTTTTATGAAAATGGTCG 
O GGCTGGAGAGATGGTTAAGAGC ACTGACTACTC TTC CG AAGGTCCTG AGTTCAGG TCCCAGCAACCA CATGGTGG CTC ACAACT AT C CAT ATTGAG A 
p. CCTGACGCCCCCTCCTGGTGTGTCTGAAGATAGCTACGGTGTACTTACATATAATAAATAAATAAATCATCAAAAAAAAAGAAAATGGTCAATTTTT 
AAAATCAGACATTCTAATATAATACAAACCATAGCTATACTAATTTGACACCTGTTGAGGCCTGAGAGCACAACACATTAAAGACTTGGCTTGCCAT 
AAGGAACCCGTCATGCCTCTGTGAGAGCAGAGAGCTGTGTACAAGATGGATGGACGTTGTACTTCTTACAATGCTGGTCTGGGTTGAGGGGGTTTCA 
| ^ TTCGGGTGCTGTGTGTTGGTGC^GATGGAATGACCTCGTGCAGTACAGAGCACAGAACCGAGACTGCATTGAAGTCTTGG 

CGTCTAGAAGCACCCTTAGATCTGATTATGCATTTGATGTACTTCATTTTTAGACAGGACCTCACTATGTAGCTTTGGGTGGCCTGGCTGAGTAGAC 
IjJ CAAGCTGGCCTCAAATTCCAGAGGATGCATCTGCCTCTGCCTTCTGGGTGCTTGGAATCAAGGTATGCGTGCCACCAGGCCCAGTAAATTTAATTTT 
TTCAAATTAATTCTTTTCTCATGTTTAGAAATCTTTTCCTGGTTTTTGTTGTTGTTTGTTTGTTTGTTTGTGACTCCTACATTGGGTTTTTGTGACT 
CCCATACAGAATTGGGATCCACAGTTTTTAAGAAACTGGGCACTAAATTACCCCTGCCTTATGCAAAGCCCACAAGGACAATCTGGAGAGACCGCTT 
ATGTGAACAGCAGGAGCTGACTCTGTTCAGGGAGCTAAAGGGTAAAACTGTGTATGCTGGACCACTCCATTTCTGGGGGTTCCTGGCAAGCCTTAAT 
GTTACTCATCTGGACCTCCCTCTATTTTCAGGCTGTCTTGGGTCTTTACGTTGTAGCTTCTCATGGCCTCCTCCTCAGGGTCCACTACCACCCACCT 
TAAACAATGCCAGTTCCAGCCTTTCAAGCTGAAATCAAGTGTAAAAGGCCCAGCCATGCTTGAGCAGGCCTTCCCTGCACAAGTTCAACCACAAGCA 
AACTTCTGAGTGAACTGGGTGGCTAGGACCCTTCCAGAGCAGACTTGGCAGCAGGCCTCTGAGCTTGAGAGCAGGTTCCAAAACTCTTGGTTTCAAG 
5 g | GGTCCGCACCCACTGGCAACAGTAAGGAACCTTCTGGGAGAATGGGCAAGCACTTCAAGTCTTCAATCCTGCAGACACTTGGTCTTTAGGCACTGAT 
2:f GCAACCTCTCCAATAGAGGACTAAAGGACCCACCATCCATGAAGCATGGGGACAGGCACTCCAGCCCTGCAACCCAGGAATGGCCAGGGTCTTATGG 
Q AAAGTGCCTCTGGGAAGCTGACCACAGAAAGGCAAGGAGTGACTTCCCTTCTTAGAATATTAACCTGAGGTCACCCTAACTTCTAGGAAATGTGAGC 
AAAGCCCTATT C AC CCC AGATGGGGCTCCAG TGACAGACCAA AGTATA C TT CTG CCACAGCCC AATTTGGTGAAC C AATGAGTTT CTAAGTGGTTAG 
TTGGTTATAGGAGTGTGGGAGACACCCCAGCTACAGATAATGATCTTGTGGGAGTTAAAAAAAAAGGGTCCCCACTTCTAGTATACTCCTGAATCCC 
CTAAGAGCTCTGCTGCTGGGGACAACCAGGTGAAGTAGTGGCTGGAATCCCAGGGTCCTCCCTGCCTTCTCCCTGTCTGTCACCCCCTTACCATGTG 
GTTATCTCTGTAACTAGTTGCC^C^GCAACCTACCCTACATCATCATTAAGATGGTGGTAGAGTGAACTGAACTACAACAC^GAGCCACAGGCTGCT 
GACCTTTGAGACAGGTTCCTGGTTGTCAGGGCTCTGGGGTCCCATGCCTCATTTTCTCTGTCAGCCCTGTGACTTCAGTTTCCTGAGATACACAGCA 
ATTTCTAGCTCTGCTCATAAAGCTGGGAGCCACTAAGGTCAGTGGGGCAATCTTTCTAGTCAGCACACTTTGGTCATCTCCAGGTGGAACCTCAAAG 
TGAACATGGGAGTCTCCCTCCCAGGAAGTCTCCCTGCCACCTCCTCAGGTAACTGAGTGCTCCCCTACCTCTGGGCTCCCAGAAGGTGGCAGCCATT 
ATC AGAG AG CC C AC ACGGGGTCTGA AGC ATCTGAC AGCCCTT C AAC ACTGCT CCTTCTCTATAAATCCATGCTTTC CTAGAC CTGCTAGATT C AAGA 
CATGCCATAGGCTCAAATGAGCAAGAAACTAGGCTGGAGCAGCAGGC^CAACTCTCACCCC^CTArGCTTTGTCCCGCCTGGAATTGGCCCTTTACT 
GTGACCTCCTGCAGTCTCTCCACCAGCCCACGTTGCTGACTACAGTCTGGTACATGTAGACTTAGTGGTATGGGGAAGTCAGGGCTTAGCAAACAGA 
ACTGTGGCCCCTTCTTCCAGTTTCCTCATCTCACTACCAGGCAGAGGAAGAAAAGGTTTGGGGTAAATAGAAAATAAGGAGGGAGTGAGCCAGAGTT 
TCTGGGTGTTTGGTCTCAACACAACTATCCTACAAACATCACCATCAGCAGGGCTGGGGGATGGAGAGGTGGTTAAGGGTGCTTGATGATGCTCTTG 
AAGAGGACCCTGCTTCAGTCCTGACCACCATATGACAGCTCACAAGCACATGGAACT 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNN23NNNNNNNNNNNNNNNAGGC AGGCGAATTT CTGAGT T CAAGGCCAGC CTGGTCTAC AGATTGAGTTCCAGGAC AGCCAGGG CTAC 
ACAG AGAAA CC CTGTCT CGAAAAAC CAAAAAACAAAACAAAAC AAAACAAAACTCCT CACTAACACTAGCTTT ATACAGAACTTC AATCTG TATTTA 
GCTCCAGGCTACAATTGGTCCTGCAGTCACAGTTAGGGATTACATAGTGTCTCTGCAG 
AGGACCATTCTGCCAGGCCCTCTCTCTGTTTAGTTACTTTTCTGTGTTGACAAAAAAAAA 

AGGAGTTTATTCTGTCCAGTTTATAGGTGGCAATCATTATGGTGGGAGGCATGACAGAAGGAGCCTGAGGCAGATGCTCATATCGCATCCACGGCCG 
GTGAACACTGTTGCTCAGCTCACTTTCTCCTTTCTTTTCAGTCCIAGGACCCCACTTCAT^ 

TTAAGC CTC CCTGAG AAG ACCCT CC C AGGCAC AC C CAGAG GTATCTACGAG GTG ATTCTAAATTGTCT CAAGGTGA CTATCAAGC AGGCAGCAAGCT 
GAGCAGAAAGACTTGGGGAAAGCCAGCCCAGAAACCTCTGCCTCAGACCTCCAGCA^ 

AGTCACAGGAAGCCAGAACAGGGTGGGATGAGTGATAGAAAACAGACCGGTAGTGGGGACACATAGCTCACAGGACATGGTCTGAGAAGGCAGGGGA 
TTGACCAGCAGCTCAGTGATTATCAAGGAC^CAGGTTCCACTGGTACCTGACTCTGC 

TATGGCAGTCCAAGTAACACCTTCATGGTACAGCATCCAAAAGCAGAGCATTCTTCTGACGACCTATGCCCTCCCCTTTCACAGCAGGTGTGACTCA 
GTGGTGGGAGGAGGCGGCATGCCCACGTGGCAGAATGGTCCTGTAGTCTGTCTAAACTGTGATCCAGAAGGCCTTTTGGGGTAACCAAAGCATTCTG 
GGGACTTGATGGGGACTTTGAGAAAGGGTCAAGTGGGCAGAGGTCAAGGCTTGTACAGGAACATGGAAAGTTATCAGGCAAGCGGCTGTGGTGATGC 
TGATGGGTGGGAGATCTGACACCTGATGTACACAAGGTGAGGGGATAGGAGGTAGAGATGAAGGGAGAAGAGAGGGGAAGAGACAAGAGAGGAACTA 
GCAACTGATTGATACCTGTAGTGAATGTGACAGGCACAGTAGGCCCATATCTGTGAACCTACATCTTGACATGCACCATCTTGGCACTGAAGGACTG 
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CTGCAGCTGTCAGTAGAAATGATCCCAGACAGACTACTGCTTTAAAATACACTATTTATTCTTTTTTCTGAATATAATAGGAAAGTAAAATATGGTG 
ATA ACAGCAGCTCAGTT CTCACACCG AAG AGTCC CTG ATCCTGGGC AC CTG AGAGAGTGGTG CTGAATACCTG AGAGGTAGC CGC CAGGC CCAGGAG 
GCCAGGTGCATGGTGCCCCAAGCCTCAGACTCAAAACATATTTACAAATCCTTCGGACATACAGCTAGGCTCTATTCTGACTAGATTATAACTGAAG 
ATAACCCATTTATTTTAACCTGCATTCTGCCAGCTGGCTGGTTACCTGTGCTCAGGTACCATGTGTCCATCTCCTCACATCCTCCCTGGTAGATCTC 
CCACCTGGCTCTAT CCC AGAATG CTTTCTCTCTCCCAGATGT TCCGCCTATTTCCTGCCTAAGCCATAGGCCATAGGCTTTT TGCCAC AGGTGAGGC 
ATC CAT ACAAC ACAAAG ATATTCTCTCTACG AAATGCCTGAGAGCACTGTGC CA CTC CT GCC AGCTCGT CTTAT ATCTGC CTGTAAAT ACCATGCCA 
GAC ACACAAGCTGTTGCTAAAGG AAGTA CAGT GT C AC CAGGC AAGCCCTACACATTC AC AGTGGCGGC CAC CAATTAGTCCTGCTTTGGTTTCCAAT 
AGTTCCCCTAATGCCACTTTTTGTTTGTTTGGTTTTGAATTCACTTTGTAGACTAGGCTGAC^ 

ACTG-GGATTAAAGG CGTGCACCACC AGC ACCCAGCTC CATTG CCACTC CTTAATGTAT AGAGCTCACCCCACACCTG TT AGG AGCAGCATGAAGGCC 
CTGCTTCAGCTTCCTCCCTTGTCTCGTCACCCTTGTCTTCTCAGCTGGGCATCCAGAGGCAAACCAAGTCCAGTGGAGCCTGAGTCCTGTGCAGATA 
GC^GGAATCAGGTCCTCCAGTGGTGTTGGCAATGACAGCGCATGTTGGAGGCAGAGGCACCA 

CAGCTCTGCC^CCAGGCCCTTAGACTAGATGGGTCTATTTTATCACTTCTAAGCCATAAGGTATCCTCCACCTCTGGGGCTGCCAGAGGGATGGAGT 
GAAACAAGG AGGCCCAAGCATGG CCAGC AGAGGCTCAGCAGGCAGCAAAGATCCTGT CACGG CTTCCC ATG AC C CATTT CAC ATCGGCCTGCCT CTG 
TGAGTCCCACCCTCAGGGTAACCCTACCCAGGGCCACTCACTCTGGCCAGTGACAAGCGGTCTTTTGAGTGTGTTGCCCCTGTGCAGAAGGCAGGGT 
GGTCAGGTAAGCCACCTAGGCAGAGACTCAGGGAAGAGGCCAAGCTGGTCCTCCCACAGAACAATGGATTCCTCTCAGGCTTGCCTTTAATCACTGT 
GACACTTAATGTGCAGACATCTGCTCTTCTGTGGGTCCTCAAAACGCAAGTCC^ 

GTGAGCACAGCTGAAAGTACCCAAGAGCCCATGCCTTACACCTCCTGCACTTCTGCTCAGCTTTCCCTATCCCAGCACAAGCCCAAAGGCTATGTCG 
CCTTAACGTCCCCAACCCACTTTCAGTCATTCTACAAGGGCCAAGGAAACAAGTTCCTCTGGGTCTATATCTGGGCACACATTGCCCAACACAACCA 
TCACATGAACCCTAGGGCCCACGCCAACCTGGCCTTCAAAGCCCCCTCAGGCACACCCCATAGAC 

MOUSE SEQUENCE - mRNA 

GAGGGCTGTCGGCGCAGTAGCAGAGAGCTACAGACTCCGCGCGCTCCGGAGACCGGCAGTACAGCGCGAGGCAGCGCGCGTCAGCAGCCGCCACCGG 
AGCCCAACCGAGACCACAGCCCTCCCCAGACGGCCGCGCCATGGAACACCAGCTCCTGTGCTGCGAAGTGGAGACCATCCGCCGCGCGTACCCTGAC 
ACCAATCTCCTCAACGACCGGGTGCTGCGAGCCATGCTCAAGACGGAGGAGACCTGTGCGCCCTCCGTATCTTACTTCAAGTGCGTGCAGAAGGAGA 
TTGTGCCATCCATGCGGAAAATCGTGGCCACCTGGATGCTGGAGGTCTGTGAGGAGCAGAAGTGCGAAGAGGAGGTCTTCCCGCTGGCCATGAACTA 
CCTGGACCGCTTCCTGTCCCTGGAGCCCCTGAAGAAGAGCCGCCTGCAGCTGCTGGGGGCCACCTGCATGTTCGTGGCCTCTAAGATGAAGGAGACC 
ATTCCCTTGACTGCCGAGAAGTTGTGCATCTACACTGACAACTCTATCCGGCCCGAGGAGCTGCTGCAAATGGAACTGCTTCTGGTGAACAAGCTCA 
AGTGGAACCTGGCCGCCATGACTCCCCACGATTTCATCGAACACTTCCTCTCCAAAATGCCAGAGGCGGATGAGAACAAGCAGACCATCCGCAAGCA 
TGCACAGACCTTTGTGGCCCTCTGTGCCACAGATGTGAAGTTCATTTCCAACCCACCCTCCATGGTAGCTGCTGGGAGCGTGGTGGCTGCGATGCAA 
GGCCTGAACCTGGGCAGCCCCAACAACTTCCTCTCCTGCTACCGCACAACGCACTTTCTTTCCAGAGTCATCMGTGTGACCCGGACTGCCTCCGTG 
CCTGCCAGGAACAGATTGAAGCCCTTCTGGAGTCAAGCCTGCGCCAGGCCCAGCAGAACGTCGACCCCAAGGCCACTGAGGAGGAGGGGGAAGTGGA 
GGAAGAGGCTGGTCTGGCCTGCACGCCCACCGACGTGCGAGATGTGGACATCTGAGGGCCACCGGGCAGGCGGGAGCCACCAAGTAGTGGCACCCGC 
AAAGAGGAAGGAGCCAGCCCGGGTGCTCCTGACGACGTCCCCCTTGGGGACATGTTGTTACCAGAAGAGGAAGTTTTGTTCTCTTTGTTGGTTGTTT 
TTCCTTAATCTTTCTCCTTTCTATCTGATTTAAGCAAAAGAGAAAAAAATATCTGAAAGCTGTCTTAAAGAGAGAGAGAGAGAGAGATAGAATCTGC 
ATCACCCTGAGAGTAG^GAGCCAGGGGGTGCTACAAAAATAGAATTCTGTACCCCAGTAATCAACTAGTTTTCTATTAATGTGCTTGTCTGTTCTAA 
GAGTAGGATTAACACAGGGGAAGTCTTGAGAAGGAGTTTTGATTCTTTTATATGTTTTAAAAAAAAAAGCTTAAGAAACATTGCTTTAAAAAGGAAG 
G AAAAAAAATA C AG C AAACC ATTGTTAAAGT AGA AGAGTTTT TAGGTTG AGAAATGTAC TCTGCTTTG CTG AAAAGC CACAG CTT AGG CC CTC AGCC 
TCACTCCCTGGCTTGCTCAGTGCCTACAGCCCTGTTACCTGATACCTGTGCTTTATCCCAGGGGTGGGCAGACCTCTTAACCTTATAGATGGTCAGT 
GCGACCTCTAGTGGTCTCATGGCGTGTGGCACAACCCCCCTCCCCAGGGCTCAGCTTAATGTGCCCTCTCCCCCCAACAACCTGCAGGTTCACAGCG 
CCAGCCACACAGCGGTAGGGATGAAATAGTGACATAATATATTCTATTTTTGTAACCTTCCTATTTTGTAGCTCTGTTTAGAGAGATGCTGGTTTTT 
GCCTGAAGGCCCTGCAGCCTGCCCACATCAGGTTAAACCCACAGCTTTGTGTGTGGTTTGTTTTGTTGTGTTTTCTTTCTCTATGTTCCAAAACCAT 
TCCATTTCAAAGCACTTTTGGTCAGCTAGCTGGAGGCAGTGTTGCTGGTGTGTGTTGGGGGGAGGGGTTCTAATGGAATGGATGGGGATGTCCACAC 
ACGCATTCAGATGGCTGTACAACAGGTTGTAGGGCTGGTAGTATGAGGTGCTTGGGAAGTTTTGTTGGGTCAAGAAGAGAGAACTCTGTTCTCGCAC 
CACCGGGATCTGTCCTGCAAAGTTGAAGGGATCCTTTGGTGCCAGCTGGTGTTTGGAAGTAGGAACCATGATGGCATTACCTGGAC^AGGAGATTGG 
GGACAACTCTTAAGTCTCACACAGGAGGCTTTTAAACACTAAAATGTCTAATTTATACTTAAGGCTACAGAAGAGTATTTATGGGAAAGGCTGCCCA 
TGACCAGTGTGACTCAAAGCAATGTGATCTCCCTTGATTCAAACGCACACCTCTGCCCTGCTGGAGAAGGTTTAGGGCCATGTCTGAGAGATTGGTC 
TTTTATTGGGCAACGGGGGGGG<3GGGGGGGGGGTCCTTAAAAAAAAAAACCACAAAGACAGAGATTTGGTCTGCTTGACTTTTCCCAACCCAACCCA 
ATTGGCCCCATTGGAGAGCCATCCAAACTGAGGAAAATTAGGGGACTCCAAAAGAGTTTGATTCTGGCACATTCTTGCCGCTGCCCCCAAGTTAACA 
ACAGTAGGTAATTTGCACACCTCTGGCTCTGTGCCTTTCTATTAGGACTTTTTGGCAAAAGGTGGAGAGCGGGAGGCTTAAGAGGGGATGTGAGGGA 
AGAGGTGAAGGTGGGACCACATGGGACAGGCCACGGCTCCTCTCATGGCGCTGCTACCGATGACTCCCAGGATCCCAGGCGTTCAGAACCAGATTCT 
CATTGCTTTGTATCTTTCACGTTGTTTTCGCTGCTATTGGAGGGTCAGTTTTGTTTTGTTTTGTTTTACAATGTCAGACTGCCATGTTCAAGTTTTA 
ATTTCCTCATAG AGTGT ATTTAC AGATG C CCTTTTTT GTACTTTTTTTTTTT AATTGTG ATC TATTTTGGC TTAATGTGATT ACCGCTGT ATT CCAA 
AAAAAAAAAAAAAAAAAAAAAAGAGGTTCCTGTTCACAATACCTCATGTATCATCTAGCCATGCACGAGCCTGGCAGGCAGGTGGGCGGTCTGCCTC 
CAGGGATCCTGGGACCCTGATGGCGATCGTCCTGTCATGCTGGGCCCTTCATTTGATCTGGGA 

TCTGTTATCGATATTGTTACTTGTAGCGGCCTGTTGTGC^TGC(^CC^TGCTGCTGGCCCGGAGGGATTTGCTCTGAGTCTCCGGTGCATCATTTAA 
TCTGTTAGGTTCTAGTGTTCCGTCTTGTTTTGTGTTAATTACAGCATTGTGCTAATGTAAAGACTCTGCCTTTGCGAACGCAGCTGCAGTGCTGTAG 
GCCCCCAAGTTCCCTAGCAAGCTGCCAAACCAAAACGGGCACCACCAGCTCAGCTGAGGCATCCCAGCCAGGCAGGACCCTTGAGGGCCGCTGTGTC 
CATGGTGATGGGGTGAGGTTTTGGCCAAAAGGCCATAGACTGGTGGTGGGTCCACGGAATCTGCCCTGTGACATGAAAGGCTTTGAGGACTCTGGCT 
GGTGGCCAGGTTGGCTTTTTGTATTTCTGGTTGACACACCATGGCGCTTCCCAGCACAGACATGTGACCAGCATGGTCCAGGAAAAAAAAAAGACAA 
AAAATCTAG AAAAT AAAATTGGT AAAAT CTCAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE * CODING 

ATGGAACACCAGCTCCTGTGCTGCGAAGTGGAGACCATCCGCCGCGCGTACCCTGACACC^TCTCCTCAACGACCGGGTGCTGCGAGCCATGCTCA 
AGA CGG AGG AGACCTGTGCGCCCTCCGTATCTTA CTT C AAGTGCGT GC AGAAGG AGATTGTGCCAT CC ATGCGGAAAAT CGTGGCCAC CTGGATGCT 
GGAGGTCTGTGAGGAGCAGAAGTGCGAAGAGGAGGTCTTCCCGCTGGCCATGAACTACCTGGACCGCTTCCTGTCCCTGGAGCCCCTGAAGAAGAGC 
CGCCTGCAGCTGCTGGGGGCCACCTGCATGTTCGTGGCCTCTAAGATGAAGGAGACCATTCCCTTGACTGCCGAGAAGTTGTGCATCTACACTGACA 
ACTCTATCCGGCCCGAGGAGCTGCTGCAAATGGAACTGCTTCTGGTGAACAAGCTO^GTGGAACCTGGCCGCCATGACTCCCCACGATTTCATCGA 
AC^CTTCCTCTCCyU^TGCCAGAGGCGGATGAGAACAAGCAGACCATCCGCAAGCATGCACAGACCTTTGTGGCCCTCTGTGCCACAGATGTGAAG 
TTCATTTCCAACCCACCCTCCATGGTAGCTGCTGGGAGCGTGGTGGCTGCGATGCAAGGCCTGAACCTGGGCAGCCCCAACAACTTCCTCTCCTGCT 
ACCGCACAACGCACTTTCTTTCCAGAGTCATCAAGTGTGACCCGGACTGCCTCCGTGCCTGCCAGGAACAGATTGAAGCCCTTCTGGAGTCAAGCCT 
GCGCCAGGCCCAGCAGAACGTCGACCCCAAGGCCACTGAGGAGGAGGGGGAAGTGGAGGAAGAGGCTGGTCTGGCCTGCACGCCCACCGACGTGCGA 
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GATGTGGACATCTGA 
HUMAN SEQUENCE - GENOMIC 

CATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCGTCT 
CTACTAAAAATACAAAAAAAAAAAAGAAAAAAAAGAAAGAAAAGAAAAATAGCT^ 

TGAGGCAGGAGAATCGTTTGAACCCAGGAGGTGGAGTTTGCAGTGAGCTGAGACCGCGCCACTGCACTCCAGCCTGGGAGACAGAGCAAGCCTCTGT 
CTCAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGGAAAAATCACTCCAATCATC 
ACCAGCTATGTATTTATTTTGCTGAATTTATGCTGGTTTTATCTTTTGTATC^ 

AATTTGCCATTCCCCTCTTCTTGAACATGAGATTGTCTCTAGGGTATCTCTTGCAAATAGGATAATTGTCAATGATGTAACGGGTCTCTATGCAACT 
TGCAAAAGTGATCAGTCTTGGTGTC CTC CTC ATC C TAGTGTTTGGT TC C AAGCAACAGAAACCAATGCTGGCAG AT TTC AGCAGAAAAGGAAAATC A 
GATGCAGAAGATAGGCAGGAACAAGGGAGGCAAGACAGCACGAATCAC^UVTGGAAGCATGCTGC^ 

TAC CCATCCCAACACCAGACCCT CC AGT AAAACTGCAGCGGCCCACAC C AC CTTGTCACCAG AGCCCACAG CTCTATGATGGG CAG CACAGATACTG 
CTGCTCATAAACTTGAATGCCTCACCCAGTTCTACACTAGAAAACATTTCCTACCTTTTCTGATTCT 

GATCCAAAAAGATGAAATGTATAAAT AATAC TAT TTAT AATGACATCAAAAT AT C AAGTACTTAAGAAGAAATCTAATAAAAGATGCA CAACACCC A 
ACACACACACACACACTCACACACACACACCTGCAATTAAATTAAGGACAATCTAAATAAACAGAGGTCTAT^ 
TGTTAGTAAAAGTGTCAATTCCCTCAAAATTGATCTATAGATCCAATACAATCCCAATCAAAATTCCA 

ATTCTGAAATGTTTATGGAGCCCC^AGGGCCAGGAGTAGCC^GGGCAACCATAAAAGAGAAGTGGGTTGGAGGATTTAGCCCGCCAGATATC^GAA 

TTCGTAG CGTATAGCTT CCATGATGC TT CTGGGGTAAGGACTGAAAAAGAG ACCAGTGG AACGGG ATAGAAAG TCCAGAAACAAGAGATTTGTGTT A 

AGTG AGCAGATTTTAGCTGCTCTGTT CAC AAAAAAAAGT AACTATGT GAG ATG AC AGGTATGTTACACTAATTCAC TATGGTAA C CATTTTACTATG 

TATA TG TAT CT TATAACATCATGCT GTC AAG TTT ATTTCAAA AAT AAAT TT T AC ATAAAATCTTAGCAG TG TT ATTCAT AATAGA CAAAAAGTGGG A 

AAATTCTATTTTTTTAAAGTACAGAAACTTTATTCCTCAAAGGGGAAATAGGTTAATAATAATAAGAAGAAGAATTAAGGCCAGGTGTGGTGGCTCA 

TGCCTATAATCCCAGCACTTTGGGAGGCCAAGGCCAGAGGATTACTTGAGCCCAGGAGTTTGAGACCAGCCTGGGCAACATAGTGAGACCCCATGTT 

TACAAAAATTAAAAAATTAGCTGGGTGACGTGACAAGTGCCTATAGTCCCAGCTACTTGGGAGGCGAAGGCAGGAGGATTGCTTGAGACCAGGAGGT 

CAAGGCTACAGTAAGGTATGATCGTG CC ACT GCACTCC AGC ATGG G CA ACAG AG C AAGACCCTGTATCTAGGGGG AAAAAAATGATAAG CCTTGTGA 

111 AAGAGAAATGAAAACAGCATCCACACATAAACTTGTACCTCTGTGTTCATCGGCAGCATTGTCCACAGTAACCAAAGGTAGAAACCATCCTAAGGCC 

CATTGA CTG ATGAATGGATAAGC AA AATGTGGTC T AT CAATACGAT TG ATAGTATTC AACCATAAAGAGGAATG AACCATGGACA CAT GCCAC ATGT 

GGGCCTTGAAAACGTGATGCTAAGTGGAAAAAGCCAGATGCAAGAGGCTGCATAGCATATGATTCC^TCTGTTGGAAATGTCC^GAAAAGGCAGATC 

CAC AGAG AAAGATTAGTGGT TGC CAGGGG CTGGGGAAAAGGGGGAAGAAAAGGGTGAAT AACT ACAAATGGGTACAGGGT TT CTTTGGCGGATGATG 

jU, AAAATGTTCTCAGATGGTGATGGTGGCTGTCCAACATAACAAAATCCATCGAACTATATACTCTAAACGAGTGAACTTTACGGTATGTAAGTTATTT 

I % TTCAATAAAGTGGTTTCAAAAAAATAAAAAGGTGAACCCTACCATTTCTAGGTGCTTTGGAGTTTACCGTTTTTCTCCTTGCATTCTGGAGTCAGAG 
gs& TGAG ^ GGCATCTCAGCTOTACCACAGCTGAATGATGTGTTGGGCTTCGGTT 

|s I CATGAAGATTGAATTAG ATG CCC TCTGTG AAG ATGCAGGGCAGGT GGAGAGC ACAGTAAGCGTCTCGGGGGG CCA CGCCAT CA C CGC CAGC AGGGCT 
CACCAGTATCTTTGCCCAGAGAATAGCCGGGGTCTAGGGAGCAGCCAGGGTCTAGGGAGAAGGGCTGCTTCCCTGTGGAGGCTCCTAACTTTCCCAT 
CTCCTCCCTGTTTCTCCCCTGCCCACCCCTTACTCCTGGAAGTGTGACTCGCTGCCTCCTCCCGTGACTCCCCTACATTGCATCTCCCCAGTTACTG 
£r& TCGTTATCTCTCATCTCTTCTTTCCAAACAGCCTCTGTCTAGCACCTTGGAGCTCGCATGGGAATTCATGGGTGGGTGGATTCCAAGCTTGGCTTCA 
TCAGATGACAA CTAAAAGGC AGAGG CCTGGGGGGGCAAGGGAGCT C CAGGATCTGTCTCCCTTTGG AG AAG CCCAGGTT CACCTT C CCAGC AAGCTG 
W CTCCGGCCCC^CGCCCCTGGCCCTCAATAACCTTCAGTGGCTCCCTTCTTGCAGCTATAATTCCCCTCTTGGGATGCTACAAAAGGGATTCCAGAGA 
f|j TTTTTTTTTCAATAACTCAAAAGGGATGGAGTAAACGATATATGTGCCCAGGACAAGGCAGCTAAGGGAAGGCCTCAGGCTTCCTTTGCCTTTGCCT 
I 1 GGAAGTATACCAACCCGCAGCCCGGAGGGCCAGCAGAGAACCAATGCTCAGAGCCTGGGGGCCCAAGGAATGCTTTCCAGGCTGGCGGTCCCAGGAG 
V™ CCCCCTCTCCCTTCCCTATAGCAAGATACTAAGAGCAGAGGCTGGCCTGGC!AGGCCGT^ 

Pf ACAGTCGTGTCTTTTTGTTTCCAGCAGTGACACTCACTGACCCACACCTTTGGCCAGCGTCCCTTGTGACTGCCCCCTCCCGCTTGAGCAGAATGGC 
;;T TCCTATTACACAATGGGTGCCAGGAACCTGGCCAGTCCATCCAGAGACAGGGCAGAGCCCTGGGCTACCGACCCTCTGGGGCCAGCAGAAACCCTCC 
CTATGGCTCTGCCTCTCATCCTGGCATTGACACTTGTGCCCTTAATTAAAGTGCTTCTTTTTTAAATCAACATTTCTACTCCTTTTGCAGAGTAAGA 
G AAAGAAGAGTTAGGGG AGAAGAAAG AAGGAAGG ATT C ATGGGGTTAT TTTTTTTTTTTC ATGTAAAAGTTTGTTCCGTGATT TAAAT CT CATGTCC 
ACAAACAGCTCAGGAGAAGTCAGCATAGAGAAGCAAGCTGACAGATTATAAAGTCATTTGTAAAATCCTCTTTCTGCATATAAACTGAAATGTGTAA 
TTAATTCCACATACTGCGTGTGGCACATGGCCTTCATCCTGCATAAGAACAAATACATC 

CTGTGCTAAATGCTTTCTACATATTACCTTAATTACTCCTCACCGTGGCCTTGCGAAGAAAACGATCACAGCCCGTTTTACAGATGGGGACAGAGAT 
TTGTCCAAGGCCACACAACTAGTGGCAGATGTAGGTCTGTATGTGACATTCTTGTTCAGTTTCATAAATGTGGCTGCCACCTGCCTGAAATTTTGAG 
GGAAAAAAAAGTGGTTTGGCTTTGCATGAATTCTTGAAAAAGTAGATTTTTATCTTTACCAGCTTCTAAGCAAGCTGGATTTATTCTGACAGTGTCC 
ATGAAATGGCTACAGACCATCACCGTTGCCAATGTTGTGAACCTGAACAAGGCACCTGGTTTCTCTGTGCCTCGGTTTCCTTACCATTAAACGCATT 
TCCCTGAGATTAAGTGTGTGAAACTATAAGTTGGCTGGAGGAAACACCTCATAGAAGTG^ 

CAAACCCTACAGGAGATGGACCTCAAAGCAAATGGCCATGCTTCCCTTAGACTGGGTGAACTGGTGCTGCTCAGGGCTAGAAGTGTGTCTAGAAGCA 
TGGGGACCAATGCAGCCCCCTCCACCCTAGGGCACATAACCTGTCTCTCTCCCATAGGATAGCAGAGACAGCTCTCTAAGTCAGGATTCAGCGAGAT 
GGTCCAGGAGGGGCCTAAGGATTCCTCTGACTTATCCAAGCCTGGGGAGGCAACACAACCTAGCCTGTGTCTCAGGGGGCCCCAATCATGCCCTGAA 
CCTCATGAGCCCAGGCAGGCATCCTGTTGGTTAAGGTCGGCCTCAGGGTCCAGGGATGCACTAGATTCTTTATCTTAAAGTGACCCTCAAGAAAAAA 
AAAATAAAATGTAGCTATAG GCAGTTTCTGG ACGTTTTCCTATTT GTT AAGCACCTTCTTGCTCCGAG CTTTCCAG CCA CAGTGCACTGGACG C CAT 
CGGAAGGCTGCAGCTGCTATTTGGTGCTTTACCACCGCTGAGAAGGTGGAATGGTCACGGC^ 

TATTACGGATTCCAGTAGGGCCGAGCACACTCCTTCCTGCCCGTCTTTACAGATGAACATGCTATCGCTCCAGCAGTACAACCCGCCTTATTACGTA 
AAAAAGCAGCCCCTATCTACCCGCAGGGAAGGAGTATGTTCGGCACCACAGCTGGACTGGTGCTCGAGTTAAGCGTCCTGGGAGGTCCGCATGCGCT 
CCGGAACCGTAATGCGCGCTTTTTCTAAGCCTTACGGTAAACGCGGACGCAGGGCAACCACGTGGCGGTGGAACCGAGGCCCGGCGGGAATGCGCGG 
AATGCGCGGCGCGGCCTCGCGCGGTTCCCGAGCCACGGCCCAGGGTCCGGCGGCGCGCGCTCTCGCCTCCTCCCCTCACCTCTCCCAGCCGCACCCC 
GGCCCTGGCCCTGCCACCCAGAACTCGCTGGGCAAGTCGTGCCCCGCGTGAACACACAGAAGGGGCTTGGGGACCGAGCGCGGCCCATCAGTCCCTC 
AGACCCTGAGGACCCAGAATTCCCTAAGGGGTCCGAATCCGAGTCCTGCCCCC^GCCCTTAAGGCACXSGGCTCCAGGGACCCCAGGGGAAGGGCGCG 
GGG CATTAGGTACGCAACCCGTTTCCCCGCACCTGGAAAAAAACTCCCTTTCCCTCCCCTCCCCTGCTTGTTGAGTGTGCGGATAACCAGAACTCTA 
AGGCGCCCCGTAATAACGACCCCGCTGTCCCTCCACCCACCCCCAAGTGCCAAAGCGAGGGATGGAAGCGCTTTCMGCGTTCC^GGGCATTGAGG 
AGCGAGCTGGAGAGGCGCGGGGATGCGGGGTCCTCCCCGCAGTCTTCCGGAAAGGGCGGGGGAGGGCGCGGCAAGTTCCGGAGTGGGGCATGCCGTG 
GGAGCCCACGAGGGCCTCAGCGCGGATCCTCCGCCGGAAAACCGGCTCCCGCGAGCCGCCGCCGCAGGTTTCCTAGGCCCCGCGAGTCCCGCAGCGA 
AGCCCTGCGTCTCCGTCCGACGCGGGGGTCTGCTCAGCCTCGGGTGGGCCGCGGCCAGGCCTGACTGCGGGGGAGAGGGCCGAACGTGACCTCCGAG 
GTCACCCCCAGCCAGCTTTCTCTCCTGTGGTCGGAAGTGGTTTTCTTCTCGATCTGGGCGCCTACTCCCCACCACTTGGTCTGAGAGGGGCTGGGGC 
CGGAAGGCCAGGGAATCTCTGGTGGATTTGGGGGTTCATATTGCTCAGGGTACCAGCCGATGCGTTTTGAGGGGCGGGAGTCGAGGAATTAGAATCG 
CCTTTAACCCTCAAGAGTTGCGCCTTCAGCCTCGGGATCCCAGATGCGTCGTTGGAGCCAGGGCCGCCCCCCTACCTGTTGGGTTTGCGTTTTAACT 
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CCAGCGCACACCTTGCCGGCAGCCCTCGGAGCTAGGGGAGGGGTCTCGTTTCCCCGCAGCCCGCCGGACAGACGACTGGGGCACGGGAGGGGCGGTG 
GCAGGGTGGTCTGTGTGTGGCTGAAACTAATTGATCTGGAGCGGAAACGCACGTCTGCGGTTGGGGCGATGGGGGGGGCGGTGCGGCTGTCCATGTG 
CCGAGCGTGTGGCTGTCTCGGGTGGGC^CTGGGGCCGGAGTTCGCCCCGGCCCACCTCGCAGTTTTGGGGCGCCTGGGATCGGCGCTACGTAAGCGA 
AGCAGAGCTGCCATAGCACGTGGGCCGCCACGCGCACCCCAAAAGCAAGCAGTGTGGGGGGAAGGGGAGCTCGAGCGCCTTCGGAGCCCAGGGGCCG 
GCTTTCGGAAGCGTTTTCCCGGGCGACTTAAGGGCTTAACAATGGAAAACTCGCGGAGCCTGAGCCAAGTCCTTTCAAGTCGCCGCCAGGTATGCGG 
CTGCAGGTGACCCCACCTGGGTGCGCCCGCCCGCCAGCCGCCCTGGTGGAAAAGCGGGTGCGGGAGGTCGCTGGCGAAAGGTCGGGACTGGTCCCTG 
CACCACCCGCCCCCAACCCAAGCCCCGAGCCCCGCGGCGCGCAGCCGCGCTGAGTCCCGGGGTCTGCGTCGCGGCGCGCCGGTTCCTGAATGAACGC 



CCAGGCCGCGGTCACCGGGGCGGGGGCCAGGGGGCGAGGAAAGCGTGAAGGTGATTTCAGTTAATTTTGGATTTTCTTTCAAACAACGTGGTTACCC 
TCCCGACTGGGCCACTTGCCCTTTGTCTCCAAAT^ 

GGTGGGTTT TAAGAAGACAGTGT AAAGTAAAACTG CAACCGAAAAGTTTTTTAAAGT TG CTT TT CT CTTTGGAAAAAATAAAATCAAAATGCTTTCT 
CTGCGCTTCTTGAAGQU\TGACCCTCAAAAGCCCAG 

GGGCAGCTGGCGCTCCCCGCCGGGCCCCAAATTCCAGCGCCGGGCCCGCAAATTCCAGCGCCTCCCCCGCGGGTTCCTGGACGGCTCTTTACGCTCG 
CTAACCGGGCTTGCAATTTTGCGCTCGTCCCTGAGCCGGGAAATCAACGAAGTTCCTAGTCGAGATCTGCCCGGTCCGCCTAGTAACAGCGCCGCGC 
CCCCATTGGCTCATGCTAATTCCAGTTTCCTCTGTCTTGCGCCCGGGATGGGGGGGTGAAGCTCCCTCCTGGACCCAGAGCCGGTTGTGCCGGAGTG 
GGCGAGCCTCTTTATGCCCTGCTGCCCCTAGCCGACTTCGGCCCGCTTCGCGCCTCGGGCTGGGCCAGGGCGCACGCGGGGCTCGGGGCCCCTCGCC 
CCACGGGATGGGAGAGGCCGGGTGATAGCTCCGGGCCCCATAAATCATCCAGGCGGCCGCCGGGTCGGGATTTTATGAATGAAAAAGCAGCTGGGCC 
GCCCTTGTGCGCGGGCTGATGCTCTGAGGCTTGGCTATGCGGGGGCCAACGCGATTGTGGGTGCTCGGGGAGTGGGGGGGGGCACGACCGTAGGTGC 
TCCCTGCTGGGGCMCCCATCGCTCCCCATGCG^ 

GGGGGCGCGGGAGGAATTCACCCTGAAAGGTGGGGGTGGGGGGGGTCGCATCTTGCTGTGAGCACCCTGGCGAAGGGGAGAGGGCTTTTTCTATCAG 
TTTTCTTTGAGCTTTTACTGTTAAGAGGGTACGGTGGTTTGATGACACTGAACTATATTCAAAAGGAAGTAAATGAACAGTTTTCTTAATTTGGGGC 

Lj, AGGTACTGTAAAAATAAAAACAAAAGTTAAGACAGTAAAATGTCCTTTTATTTTTTAATGCACCAAAGAGACAGAACCTGTAATTTTAAAAACTGTG 

JZ TATTTTAATTTACATCTGCTTAAGTTTGCGATAATA^ 

Q' ACTCGTTTGTTTAATTGATAATTGTTCTGAATTATGCCGGCTCCTGCCAGCCCCCTCACGCTCACGAATTCAGTCCCAGGGCAAATrCTAAAGGTGA 
n AGGGACGTCTACACCCCCAACAAAACCAATTAGGAACCT^ 

TAATTAAAAAAAATGAGTCAGAATGGAGATCACTGTTTCTC^^ 
M GGGGTGCAGTTGGGGACCCCCGCAAGGACCGACTGGTCAAGGTAGGAAGGCAGCCCGAAGAGTCTCCAGGCTAGAAGGACAAGATGAAGGAAATGCT 
45 GGCCACCATCTTGGGCTGCTGCTGGAATTTTCGGGCATT^ 

A GGTTACCCCCTTGGGCATTTGCAACGACGCCCCTGTGCGCCGGAATGAAACTTGCACAGGGGTTGTGTGCCCGGTCCTCCCCGTCCTTGCATGCTAA 
r™ ATTAGTTCTTGCAATTTACACGTGTTAATGAAAATGAAAGAAGATGCAGTCGCTGAGATTCTTTGGCCGTCTGTCCGCCCGTGGGTGCCCTCGTGGC 
Ms GTTCTTGGAAATGCGCCCATTCTGCCGGCTTGGATATGGGGTGTCGCCGCGCCCCAGTCACCCCTTCTCGTGGTCTCCCCAGGCTGCGTGTGGCCTG 
j a 1 CCGGCCTTCCTAGTTGTCCCCTACTGCAGAGCCACCTCCACCTCACCCCCTAAATCCCGGGGGACCCACTCGAGGCGGACGGGGCCCCCTGCACCCC 
w TCTTCCCTGGCGGGGAGAAAGGCTGCAGCGGGGCGATTTGCATTTCTATGAAAACCGGACTACAGGGGCAACTCCGCCGCAGGGCAGGCGCGGCGCC 
B TCAGGGATGGCTTTTGGGCTCTGCCCCTCGCTGCTCCCGGCGTTTGGCGCCCGCGCCCCCTCCCCCTGCGCCCGCCCCCGCCCCCCTCCCGCTCCCA 
M- TTCTCTGCCGGGCTTTGATCTTTGCTTAACAACAGTAACG^ 

m GCTGTCGGCGCAGTAGCAGCGAGCAGCAGAGTCCGCACGCTCCGGCGAGGGGCAGAAGAGCGCGAGGGAGCGCGGGGCAGCAGAAGCGAGAGCCGAG 
Q CGCGGACCCAGCC^GGACCCACAGCCCTCCCCAGCTGCCCAGGAAGAGCCCCAGCCATGGAACACCAGCTCCTGTGCTGCGAAGTGGAAACCATCCG 
ff jj CCGCGCGTACCCCGATGCCAACCTCCTCAACGACCGGGTGCTGCGGGCCATGCTGAAGGCGGAGGAGACCTGCGCGCCCTCGGTGTCCTACTTCAAA 
I ^ TGTGTGCAGAAGGAGGTCCTGCCGTCCATGCGGAAGATCGTCGCCACCTGGATGCTGGAGGTGCGGGGCTTCGGGCGGCTCTCTTAAGACTTCCCTG 
14= CAACTTGTTGCCCAGACCCACGTTTCTTTGCTACTCACCCCCCTCCCTTCTCTCCCGCTAGAACTTTGAAGTTTGCCGTGGTGTTTCTAGGGATCCG 
fj TATTTTCAAAATAAAAATTGCGGGTATTTTCTGAAGGAGGAAGGGGTGGGGGTGGGGGTGCTAGAAGTAGCGTTTCGTGGGAGGGGAGAAGGGGGTC 
J ; CGGGAGGGGTGCCTTCGGGAGAAGCCAGTGCCAGGGGCACCCCAATGGGCCCGAGGGTGCGGGCTGGCAGGCTGGGTGCGCTTTGTGTCCCCCGCCT 
N s GCGCCCCAGCCCGGCTGCGCCTCAGCGGCCGGGAGCCGCCAACTCCGGGGGGAGGGGGCATAGATTTGATTTTTAAATTAATATCCATGGACACGTA 
TGCAAGGGCCGCTCGTGCCAGTATTATGCGCCATCTTTGCTCTTTTATTGCAAAGCAAAAGTGTTTATTAATAATTGGGGGCAGGGTGGGGGCGGGG 
AGCGGCCGCCGGGCGCTGGGGCCGCAGCTAAGGGCCGCGCGGCTGCCGGGAGCCCGCGGGAGGGGCGCAGGGACGCGGCATGGGTAGTTTTGGGGGG 
ACCCCGCTAGGGAAGGGGGGGCCTTTGTTCAAGCAGCGAGTCCCGGGGCGCCCCGAACGGGCAGCCTGGGCCGGAGAGCACGGCGAGCTGCAAGGTC 
GCGTGGCCCCCAAGACGCCAGGGCTTGATCCCCGTCTGCAGGGATATCGGCTTGGAGGACCTTCTCCGAGCGAGCCGGGGGCCTGGGAGCACATTTT 
CAGACCTTCGGTGGGCGCCTGAGGGGCCCGCAAGTATTTTAAAATAATTTTTGAAAGTGCGGCGTGGTGCCCTTGCGAGAGGGAAACGCCGCCCGCG 
CCCAGGGGGAAGGGGGGGCCCCGGAGTTTGAATTCCTGGGGCTCCCCCCGGAGCCTGTAACGAACTCCCAACCCCCGGCCTGGGTAAAGGGTCGCCC 
GAGGGTCATTTTCAGGGTTTTTTTATGCACTTAGTTATTTTTTTAATATTTTTAAATATTTTTTGAAAAGATGACGTCTGGGGAAATGCGGCGCGGC 
GGCCTGGGACGCCACCTTTGTGTCTCGCAGGCGCGGCGCCCAACCCCGCGGCCCGTTCCGCGGCCCCGCACCCCAGTTGGTGTCGACCCCCAGTCAG 
AGGGACCACGGAGCTCCAGGGCGGGCCAGGGTCCCGGGGGCCGGCAGCCCGCGCCGCCGCGCACGCCGCCCAGCTGTGCCCGCTCCCGCCCCCACCG 
TGCCAGCCTCGCGGGGACTTTCCCTTTCAGTTTCGGGGAGGGTGGGTACTGGGGACGCGCGGGGGAGGGGGCGCATCACGGGAAGCTCCTGCCGCCC 
CCAGCCCCGACCCCTCGGCGCCCTCCAGACCTGGCGGCCCTGCCAAGCGCGATGGGGGGTGCGGGGGCGTGCGGGGGGGCGGCGCGACCTGGCGGCG 
GCGGTCACGGGCCCCGTGCCTCCGTAGGTCTGCGAGGAACAGAAGTGCGAGGAGGAGGTCTTCCCGCTGGCCATGAACTACCTGGACCGCTTCCTGT 
CGCTGGAGCCCGTGAAAAAGAGCCGCCTGCAGCTGCTGGGGGC 
GAAGCTGTGCATCTACACCGACAACTCCATCC^ 

GACCACGGGGCCGGGGAAGGTGCAGGCGGTGGCGGCCGGCCCGCCTCTGACATATCTGCTCCTCCGAGGGAGGGCGGCCCCGCCGCCGGGCGTCCCT 
GTCCGGGGAGCGGGCGGGATCCTAGCCGCCCTCGTCCCGCCGCC^ 

AAAAGTGGGCGGOSCGCGCCCTCCAGCGGCTGCACGAGGAGCGCCGCGCTCGGCGCTGAGCCTCCAGTTCCAGGTGGTGGGAGGTCTTTTTGTTTCC 
ACTTGCAGAGTCTTTTCACGCGGCGGGCGCCTTTTCTGTTTTGATCTGGGATTGCGTGTTGCCCCAGCTCCCTTGAGTCCCCAGCATTCGCCAGCCC 
TCCCCTCCAACATCCAGGACCGCACGAGACGCAGGGGCCAGTGCTCTGAGCCGGAGGTGCGGCGTGGCCCGGCCCCCGTGCTGCCGGCTTCCCCGCG 
CCCCCGGGCTGGCCCGCACCTCCCCTGATGGCCGCTCACCCTGTGTTCGCAGCAAATGGAGCTGCTCCTGGTGAACAAGCTCAAGTGGAACCTGGCC 
GCAATGACCCCGC^ CGATTTCATTG AACACTTCCTCTCCAAAATGC CAGAGGCGGAGGAGAAC AAAC AGAT CAT CCGCAAAC ACG CG C AG ACCTTCG 
TTGCCCTCTGTGCCACAGGTAGGGCAGGCCCGGCAGCCCCCGGCCTCCCCTTGAGAGCCGGCTCCTTAGGTGACCCTGGCCGGCTTCTTGCTCTCCA 
CCTGGGTGCTGTCTGGGAAGATGTCCCCAGACCCCCTCCTGCGCTGGAGAGCGCTCTTCCAGCTCTGGTGAGCAGAGGCCCTGGATTGTTTGTCGCG 
CTGGATGGAGGGAGATTTGCTCCCTCACGGCCACCATGCAGTACCTTGGGCATTGGTGTGGACGGCTCAGCCTGCCTGTGTCCCGTTACTCTGGCCT 
CGTCCTTCAGGCC^GGCAGCCTGTGGCCACTCCATGCTGAAAGGGGTTTACCTTGGCCACAGGGCCGCCTCCTTTCTCCACCCACCTCCAGCCCTTC 
TTGTGTCCTTAAGGAGCCTGAGCTGCAGAGGCCCCCTCCTGGCCTCTCCCAGGCTGGGCCACCTGCCAGAGGCGCCTCCAGGGGCGGGGAGAGCTGT 
CGGCCTGCCTGCACCACGTGCTCTGGGCAGCCGAGTGCAGGGGTGTCCAGCAGAGGAGCTCGGCTGCCTGAGGCCCTGCCAGGGGTGCCGGCAGCCA 
GCCGGGCTCAGCTGAGCCCTGAGGGGGCGCTTCAGAGCACTCTCAGCTTGGGCCGCCACCGTGGGCAGCAGAAGCACCCAGTCCTCACTTCCCCTGG 
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CATGGCCCCAGAQGCCCCTCCCTGACATGGCCTTGGCCCCAGAACCCAGTGGGGACAGACTCGCACATACACAGGGTGCCGCCTCCTGCTGTCCCCA 
GCCCTGCCTCTGACCCCCCTGTGACCGCCTCCTTCCCTGGCCCAGGAGGCCTGGTTACCTTCATGGGGGAGCATGGCCCCATCCCACCCAGCTCTGC 
TGTGGCCCACCTTTGGTCAAGCCTCAGTTGTCACATCTGTTTGGGGGCTCACTCTGGGTGACCTAGGCCACAAGGCCCACGGGGCATCAAAGAGGCA 
GTAGCATCTTCTCCCCTCCCCAGAGGGCAGAGCCCCCCAAGCCTACTTCAGAGCTCCCTTCTGACACCGGTAGCCCGCAGCCGGTATTCCAGAATGG 
GTTCTGGTTTAGGCGTGAGGCCTCCCCCACCTCCTCCACCTGCTTGGGGCATGAACCCCTCCCCCACGTTTCCAAGCGAGTCCCCAAGGTGGGCAGA 
TGAAGATGCCAAGGATGTCGACCAGTCTGGATGGGTCTGGGGTGGGGGGGCATGCGGCAGACAGGGAGGCATTCTCTGGCTGGTGCTCCTCAGAGGA 
GAGAGGCCTCCGGAGACTCCAGACAGCCTTTTATGGAGCTGAAAGTGGCTTCAGAGAAATGCAAAGTTTCCTGGAGAGAACGTGGGGCGTGGTTCTT 
G<^CAGCCTCCCTACAGGGTGGCTCCAGCAGTGGAGCTCCCCTCCCAGGACCCCTGGGTGCTAGTGGGAGGCAGTGGGCAGGTGCAGATTCTCGTCC 
TTCCCACTACTGCACACCCTTTGTCTGCGAAGGCGCCCCCAGCGGTGGGTGAAGGAGGAGGGACACTTGGGGACCCAGCTGTGCACGTGCTCTCAGT 
GACTGTGGAGT CCA CTC CAGGGTGGGTC C CG AGGGAGGGG CAGGAGAC C AGGGGACC CACCCCTG CAAAGTGCTCCGGGTCCTGACCCGTGGCCAC C 
CCATGGAACGTAACTGAGCAGCCAGTGCCTTGTTCCTGCTGGACATCTGTGGAGACAAGAGTGACTTACGGCTGCTTAAAGTCAGAAACAGGTTGAA 
GGAGGTGGAGGCGTGGGAAAGAGTCTAGGAAGGTGTTTTTGCCCTCC^CGTGGCAAAGGTTACATTTAAAGGTGATGCTGGGTGTTCTCCCTGCACT 
AGGCATTCCTGGCCCCZAGGTCCCCAGCAGGTGTGCACATGCTGCATA 

GCCAGGTGAGGAACGTCCAGTGCCAAGGAGCTTCCGGGACAGCTGTCACTTCCCTTTACAACC^GGCAGCGGATAGGGTCAAATCCTGGAGCTTTGG 
TGTCTAATTCTGGGTGGCTCCTAATCTAAGCACAGACAGCACCACACACTGGGGTGGGGGCACGAGCTTCTGAAACAACGTGGCCCCAGTGACTCCA 
CGCTGTGTGTGCCCCTGGAGACGGGGGGGTGCACAAGGTGCGGAGCCAGCTAGAACCTGTCGCTCCCTGCAGAAGCGGTTTCTGTGTGCGGTTCTGA 
TTTGCCTCAATGAGAAGGTTTTCATTCATGGCTCCCGGCTCTCAGACTGGGTGGAACTGCTCCCATTTAAAGGGGAAAAGAGGTGGCTCGGCTCGTT 
AAGGATTTCTTTTTCTAAGTTGTTACGGCGCCCAGCAGCCGGCTTTGTCTCCCCTTCAGGGTGGCTGCCTTTCTTCCCGGCCCCTCGCCGGCGGCCC 
TCTCTTTAACAAGGCCGAAGTTGTTTATTCTCTCGGGATGAAGTCTCGGATGGGCCGCCACACCCCTGGCGGCCCGTGGGGGCCCCTCTCCCTTTGT 
GCCTGGGTCGGCTCCCATTCAGCTCCCCCGACCCCCCTTGTTCCCGGGCGCTCAGTGGCGCGAGATGAGGCGATGGGGCCGACAAAGATGCCACACT 
CATCCCTGCCX3ACCTCCGGCTCCCAGCCCAGGGCCCCTGGTTCCTGTGCAGAATTCCTCGTGGGTGTGACAAAAGGCTGCCCCCAGGCTCCGCTGGG 
GTGGGGGCCAGGCCAAGAGGCACATCCCACACTGGCCCACCTGTCCACGGTAGGCGCATGACTGCCCTGAGGAGGGGAGGCCGGCATTCCCCGCCAC 
AAACCAGGACGTAATTGGTGGCAGGGCTCTCTGTGGAAAGAGCCAGTCTGCTGTTTGTCTAGGAGGTCAGTCACAGAGGCCCCGAGACGCCCACTAC 
TGCAGCCTGGCAGGCGGATGAGCCCAGTATCTGGCAGTGACCAGAGGGAGTTTTGTGCAGACCACAAAGGCTGATGGGCTGCCCTAGATTGGTGTCC 
CTCTTGGAAGTGGGCCCAGATGTGCGGGACAGTCCCCAGGAAGCCCCAGGTGAGGGCACTGGTGCCCTCTTGGGAAAGCTGCTCCCTCCTGGGGCCC 
GGCTCCCGGCCCAGTCCTCCAGGGGTGTCCCATGGTGACTGGTGCTAGGAACCCCACACCTCTTCCCTTACTTGGGAAGTCACTGGAATTGTTGGGC 
TACATCAGACGGCCCAGAAAAGTGTTTTTGTCATCGGCCAGAAATAGGAGAGTTGTGAGTAGAGGGCCCGGGTGGAGTTGGGGTGTACTTGGTCTGT 
GCTCTGAAGGTCACTGTGACAGTCATGGTCCCATGGTAAGGGGCATGGGTTGCTGGAAGAGCTCTTCCTTCCCGAGTGAGCCAAGCCGGGCTCTCCT 
GGCGCCAGGGCCTGAGCCGCAGCCACACCACAGCCGCCCTGAAGGCTGCCGGCCAGGGCTTACCCCTCAAGGGACACGGAATGGCTTCATCAGTACC 
CTGCAGCCCCGTGGCCTGGCCCGGGTGGAGGCCTAGGCTTCAGCCATGCGATGTCCCTTCAGAATATGACTTGTCTGCAATCCCTGCTGCTGGGGGG 
TGG C AG GTACTTGGGGTGAGGGTTAGGG TCA T AGAAG CGA CATCT CTACGTCCT C ATATTTG CGTCATCTAATTTTGTTTTTGTGAATACGTGATAA 
CATTCACAAGGCTCAAGATGCTAAAAGGATGAGAAGGCAGTGATGTCCCCATCACCTGTCCTGTGTCTTCCCGTGGCTTTCTCTTTCCTTGGTTATG 
TTTGAGTCAACAGTGGGGCTGACGTTCCAGGAGGGTCCGTGGGCCAGGCTCTTGCTCTCCGAGTGCCCAGGGATGGCTGGAGGCTGAGGAGGGCCTG 
GATGTGGAGCCTCAGATACCAAGTGCTTCCCTTCAGGCCGGGCCGCTTGCTCAGAGCCAGCACACAGGGATGCCCGGATCACGGGGGCCCTGAGAGG 
GTCCCCTGCTCACAGCCTCCTTCCCTCTCTCCTTCTGCCTCAGATGTGAAGTTCATTTCCAATCCGCCCTCCATGGTGGCAGCGGGGAGCGTGGTGG 
CCGCAGTGCAAGGCCTGAACCTGAGGAGCCCCAACAACTTCCTGTCCTACTACCGCCTCACACGCTTCCTCTCCAGAGTGATCAAGTGTGACCCAGT 
AAGTGAGGGTGATGTCCCAGGCAGCCTTGCCGGGGCTTACAGGGGGAGACACCTAGTGCCACGGAAATGCCGAGGCTGGTGCCAAGGCCCCCAAGGG 
TGACAAGGTTGGGGCTGGGGCTGGGCCCCTCGGACCCCAGGCCACAGACTGACAGGGCACCGGCTTCTTCCACTGCTCCTAGAACTTACTGACTGGC 
TGGGAGGTCCTCACAGCCTTCTCACGTCCCCTGGGGCTTCCAGGAGCCGTAGAGTTTCTGGGCGAAGCGTCCGGGACGGAGGCCCCAGGCGGCCCCA 
GCCAATGGTCTGTGTGGTGATGGTGTGTGGGGTTAGGCCCAGGCGAGCTTTGTTTGGGCCACAATGTGCGTGGCCAATAAATAGATGCTTGAAAAGG 
GCTCCTGTGAGGTCCGAGACACCGGACAACGGGCGGATAGAGACAGCCTTGTTGTTTACGGCCTCTTTGAGAGGCTGCTGCTGTTAAACCCTGGGAT 
GACTGTGTCTTTCTTCTTAAAAATGCCATTGTTTT^ 

CCAAATTAGACCAGAGAGGTGGCCGGGTCAGCCGCCGGCCCCGCGGTGTGTGAGGGAGTGACCGCCTGACCCCAGCTTGGGGCTGGGTGGGCCTGCA 
AGACCCGTTTTGGCTCTGGCCTGGGCCGCCTCTTGGTGGTCTGCCCTCGAGCCTCCCGGGGACTCCGCACGGGTCTCAGCAGATGCTATCTAGGGTC 
CACCTGCCTGTCCCCTGCCTAGTGGTGCCTCTGTCCCGGGGACACTGGGAGTAGCGGCTGCCCAGCCCATGTGTGTCTCGGAAGAGGAAGAAGCTTT 
TTTGCCGTGGGACACCGAAGTTGGCAGGGGCCTCCCTTCTGTGTTCTCGGCCATGGCCTCCCTTGCACCCTGCCCCGTGTTATCCTTTGGGGGTGGT 
GAGGTGTCCTCACCCGCTGTAGGGTGGAGGCCAGCAGCCCGCAGCTCTCTCAGGAAAATGGCTCAGAAACACCATCGAGGCCTCCAGAAGCCCAGCA 
AAGAGAAAGCCCCTCCATCAAAATGAAACTCGCGTCTGCACTTTTCATTTCGAACTCCACGCCCTGAGTGAAAACCGCTTCCCCGCCAGGGGTGACT 
GCCCTGGGATGTTGCTGTCTTCGGGCAGTTGTGGGAAGTTGGGCGCTGGCCCTTATTTGAGTAGAGACCATCTTAACTAGATTGGAGGCACACGTCT 
CACAGCTGACAGACACACGGGGTGAAGTTACCCGAGGCGGAGTCCACTCTGCCTGATCAGCTAGTGACCAACGTAGCTGAGCCCAGACTCAGAAAAA 
CCGTCCACAGCAGAGGCCCCTGCATTTTCTAGGGCGTGTTCTAGAATTTTCTTTGGTGGGTGGAATGTCCATCTGTGCAAATCGGGTGCGCAGTGCC 
ACACACCAGTGACTTTTCGCGGAGGAGCGTGCTGCCrTTTTGGAGCTTCTGGCTGTGGGAGAACAGCTTTGTCC^CCGGGGTAGCCTTGCAGGCAGC 
TGTGGGGCCAGAGGAATGAAGGAAGGTCCTGGAGTCTAGCTGCATGTGTGACCCTGGAGTGGGTCATGGGCGAGGGACGGGCCGCAGGTGAAGAATC 
CCTGGATGGAGCTGCCAGGCCCCTGGGGCTGAGAATTGAAGCTGGCTGGTGTTTTAGGTTGAACGTCAGGAGTCTTGTATCTCACCCCAGGCCTCTG 
GCCTC^GTTTCCCCATCTGTACAGTGGGACrGTTTGTG<^GCCAGCCCGGCCAGCTTCATTTGCCATGATGAGAATTTATCTGAGGGGCGGGAGAGG 
AAAGCCCTCCCTATAAAGGTACAGGCGCTAAAATGTCATGACCTCAGTGGTCCACCTAAAAGTCGTTCTGGCCTGGGTCATCGCCTGTCGTGCTATG 
CCTTTGTCCAGCCCCTTCTGGTTGGGAGTTAAGTGGCACCTGTGCGGCACGTGGTGGGGCTGTGGCCCAGCCCTGCTCCTTGTGGAAGGTCTGTTTC 
CTGGGCTGCCTAGAGACTTGGCTTGAAGCCCTAGCGTGGCTTCCTGGCAGTTGGGACACACACAGCCCCAACACATGGAGCCGGTTCTCCATCCAGA 
AGCCCCCGGGCAGTAAGCAGCCACTTCAGGCTGCGTGGGACTTGCCCGTGGTGGAGCCTAGGAGAGGCCCCTGGCTGGGCGTGGCGTTCCAGATTTC 
ACGGCTGCTCTTTCCCACTGACAGTGTGGTGTGGACGCTGCCAAGGGAGTCTGGAGCCCCAGAGGGTGGAGGTGCAGGACTTCCAGGAGCGTCCGTC 
GCACTCCACCCGAGGGCGAGCACCTCAGTGGCCGCAGTGGGTGGATGCATGCTGTGCCAGGCTGATGGCTGGCCCCGGGGCACAGGCCTGAGCGGGA 
GAGGATGGAGGGGAGGGATCAATGGTCCAGGTCCCCCTGGCCACCCAGCATTCATCCTCAGTCATGCACGGCCCAAGGCTTCGACAGCCATTGATCA 
TGGAAGGCCAGGTTCACCTCAAGGGCTGCCACATGGAGAGGTTAAGTCTGAAAAGGCTGAAAAGGCAGGGTTAAAAGGGCCTCCTGTCCAGATCAGA 
TGGCACTGAATTCCCCAGGGAGCTGGCACGGCCAGTGGGAACAGGCGGTGAAGGCGCTGTTGGACATGGGGACGGGCAGGGGGTGTGCAGGGTGGGC 
GGGCAAGCATCTGGTGTCTTGTGGCTCCAGAGACCAGGTGGGAGGTGGAGGCGTTTGGTCCTGAGTGTCCTGACAGGTGATGGCAGCTCCCACATCT 
CGCTCAGGTTCAGAGGAGGCAGCATGGGCCGAGGGACAGTTTTTGGCTTAGTCTTGCTCTTATAAAGGCTTCCGGGTCATGGCACCTGGGAAGGGGC 
CCTCGCTGCAGGCCCCTTCTAAGGACCCCCTCTTCCCACCTCTCCCCACCCTCTCTCTCTCAGGACTGCCTCCGGGCCTGCCAGGAGCAGATCGAAG 
CCCTGCTGGAGTCAAGCCTGCGCCAGGCCCAGCAGAACATGGACCCCAAGGCCGCCGAGGAGGAGGAAGAGGAGGAGGAGGAGGTGGACCTGGCTTG 
CACACCCACCGACGTGCGGGACGTGGACATCTGAGGGCGCCAGGCAGGCGGGCGCCACCGCCACCCGCAGCGAGGGCGGAGCCGGCCCCAGGTGCTC 
CACTGACAGTCCCTCCTCTCCGGAGCATTTTGATACCAGAAGGGAAAGCTTCATTCTCCTTGTTGTTGGTTGTTTTTTCCTTTGCTCTTTCCCCCTT 
CCATCTCTGACTTAAGCAAAAGAAAAAGATTACCCAAAAACTGTCTTTAAAAGAGAGAGAGAGAAAAAAAAAATAGTATTTGCATAACCCTGAGCGG 
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TGGGGGAGGAGGGTTGTGCTACAGATGATAGAGGATTTTATACCCCAATAATCAACTCGTTTTTATATTAATGTACTTGTTTCTCTGTTGTAAGAAT 

AGG CAT T AACAC AAAGG AGG CGTCT CGGGAG AGG ATT AGGTT CC^TCCTTT ACGTGTTT AAAAAAAAGC AT AAAAAC ATTTT AAAAAC ATAGAAAAA 

TTCAGCAAACCATTTTTAAAGTAGAAGAGGGTTTTAGGTAGAAAAACATATTCTTGTGCTTTrCCTGATAAAGCACAGCTGTAGTGGGGTTCTAGGC 

ATCTCTGTACTTTGCTTGCTCATATGCATGTAGTCACTTTATAAGTCATTGTATGTTATTATATTCCGTAGGTAGATGTGTAACCTCTTCACCTTAT 

TCATGGCTGAAGTCACCTCTTGGTTACAGTAGCGTAGCGTGGCCGTGTGCATGTCCTTTGCGCCTGTGACCACCACCCCAACAAACCATCCAGTGAC 

AAACCATCCAGTGGAGGTTTGTCGGGCACCAGCCAGCGTAGCAGGGTC^ 

ACGAAATAGTGACATAATATATTCTATTTTTATACTCTTCCTATTTTT^ 

CCAGCTCACGTCCAGGTTCAACCCACAGCTACT^ 

GTAGGAAATAGCGCTGTTTTTGTTGTGTGTGCAGGGAGGGCAGTTTTCTAATGGAATGGTTTGGGAATATCCATGTACTTGTTTGCAAGCAGGACTT 
TGAGGGAAGTGTGGGCCACTGTGGTGGCAGTGGAGGTGGGGTGTTTGGGAGGCTGCGTGCC&GTCA 

AGG ATG ATAAGTTC CTT TCCTTTTCTTT AAAGAAGTTG AAGTTTAGGAATC CTTTGGTG CCAACTGGTGTTTGAAAGTAGGGACCT CAGAGGTTTAC 

CTAGAGAACAGGTGGTTTTTAAGGGTTATCTTAGATGTTTCACACCGGAAGGTTTTTAAACACTAAAATATATAATTTATAGTTAAGGCTAAAAAGT 

ATATTTATTGCAGAGGATGTTCATAAGGCCAGTATGATTTATAAATGCAATCTCCCC^ 

CACACAAACCTTCTGCCTTTGATGTTACAGATTTAATACAGTTTATTTTTAAAGAT^ 

AAACCACACAAAGACATTGATTCAGCCTGTTTGGCGTTTCCCAGAGTCATC^ 

AGTTCGGTTCCGATGAATTCTTATCCCCTGCCCCTTC^^ 

ATTTTTATTTAGGAAGTGTTGAAGGGAGGTGGCAAGAGTGTGGAGGCTGACGTGTGAGGGAGGACAGGCGGGAGGAGGTGTGAGGAGGAGGCTCCCG 
AGGGGAAGGGGCGGTGCCCACACCGGGGACAGGCCGCAGCTCCATTTTCTTATTGCGCTGCTACCGTTGACTTCCAGGCACGGTTTGGAAATATTCA 
CATCGCTTCTGTGTATCTCTTTCACATTGTTTGCTGCTATTGGAGGATCAGTTT^ 

TTCTCTTAGAACATTGTATTACAGATGCCTTTTTTGTAGTTTTTTTTTTTTTTATGTGATCAATTTTGACTTAATGTGATTACTGCTCTATTCCAAA 
AAGGTTGCTGTTTCACAATACCTCATGCTTCACTTAGCCATGGTGGACCCAGCGGGCAGGTTCTGCCTGCTTTGGCGGGCAGACACGCGGGCGCGAT 
CCCACACAGGCTGGCGGGGGCCGGCCCCGAGGCCGCGTGCGTGAGAACCGCGCCGGTGTCCCCAGAGACCAGGCTGTGTCCCTCTTCTCTTCCCTGC 
GCCTGTGATGCTGGGCACTTCATCTGATCGGGGGCGTAGCATCATAGTAGTTTTTACAGCTGTGTTATTCTTTGCGTGTAGCTATGGAAGTTGCATA 
ATTATTATTATTATTATTATAACAAGTGTGTCTTACGTGCCACCACGGCGTTGTACCTGTAGGACTCTCATTCGGGATGATTGGAATAGCTTCTGGA 
ATTTGTTCAAGTTTTGGGTATGTTTAATCTGTTATGTACTAGTGTTCTGTTTGTTATTGTTTTGTTAATTACACCATAATGCTAATTTAAAGAGACT 
CCAAATCTCAATGAAGCCAGCTCACAGTGCTGTGTGCCCCGGTCACCTAGCAAGCTGCCGAACCAAAAGAATTTGCACCCCGCTGCGGGCCCACGTG 
GTTGGGGCCCTGCCCTGGCAGGGTCATCCTGTGCTCGGAGGCCATCTCGGGCACAGGCCCACCCCGCCCCACCCCTCCAGAACACGGCTCACGCTTA 
CCTCAACCATCCTGGCTGCGGCGTCTGTCTGAACCACGCGGGGGCCTTGAGGGACGCTTTGTCTGTCGTGATGGGGCAAGGGCACAAGTCCTGGATG 
TTGTGTGTATCGAGAGGCCAAAGGCTGGTGGCAAGTGCACGGGGCACAGCGGAGTCTGTCCTGTGACGCGCAAGTCTGAGGGTCTGGGCGGCGGGCG 
GCTGGGTCTGTGC^TTTCTGGTTGCACCGCGGCGCTTCCCAGCACCAACATGTAACCGGCATGTTTCCAGCAGAAGACAAAAAGACAAACATGAAAG 
TCTAGAAATAAAACTGGTAAAACCCCAGCGTGGTGCCTGCCTCTTTGCTTCCTGGGCTGGCCGTGAGCCAGGGACGCGTGTCCTGGTGCCCTAGAAC 
CAGGGCAGGGTGGCAGGCTTGGCGGATGTGGGAGGCCGCAGCCTGTCCTGTGCGCTGTGGGAAGTTCAGCAGCATCCTGACCTCCATCCCCGGGATG 
ACAGTCACGCCACCCGCCGTGACAACCAAGAATGTCTCCTGACACTGCCACATCCCCGGGGGTGGGGACAGAATCCAGCCAGGAGCAGGCACACCCC 
TCCCAACTGGGAGGAAGCCCTCAGCACAGGTGTGTGAGGTGGGAGGCGGTGTCCTGTCCCCGGGAGGCTCCAGAGAATAATTTGCAGGCTGCCTGGC 
TGGGTGAGCCCACCTCCAACCACGCGAGACAACAGCTCCGGCCTGGGTGACGTGAGCGGTGCCCATTGATGGGGAACATCTTCCCCCTCTTCCTTGC 
CCCACCAGTTTGTCTTCCCGGGTTATTTGCAGATAGGAAAATAAATAAAGC^ 

ATGAATCATGGTCCTTTGGCGACGCCAGGCTCCGGGAGAGCAAAGCACCGTGTCAGGGCCATGATCCGGGGTGGCCTTTCACTGGGATCGTGGGGAC 
CTGGAGGCCGCCTTATAGGACACCCATGACGCCCACCTCTGGATTTCAGGTGCACGTGACTGGACTTAACTTCAAACCCCAGGGTGGAGGCAGGTAG 
TGGGAGTGCCCTGGGAAGGTGTCCTCGGACCTTGGTCACTGCTCCTGAACCCATCTGTGAGGCTGGTTTGTCCTCATCCCAAGCTAAGTGGAAGCTC 
AGGTCCCAAGCCACCGATGGGTGCTACTTGTCAGCTGCAGGTTGAATCTCCGTGGCCTTTATGAAGCACCTGCTGTCTACCCTTCCTGCCTTGTAGA 
GCACTCCTCCCAGGGCTCAACAGTGGGGCCGGGGTGGTCGGTGTGTTGGCTCCACAGGCGCCTGCCCTGGGAGGAAGGTGGGGTGTGGAGGGAAACG 
CTTGGCCCCTGTAGGTCTCCACCAGCCTCTCCCCTGAGGGTGGGGGCTCCGGGAGCCTTCCTCGAGGGAGTCCTATATTGAGTGGGTGGGGGAGCCT 
GCAAGGTGCCCCTGACAGGTCACATCAGAAAGAGCTCAAGGGACAGTCGGAGCCAGAGGTGACACTGGTGGCCACTCGGGTGGCTCACAAGGCCCAG 
CTCCTCCTTGCTCCTGGGCAAATTACTCTGAAGGCAGGGACCAGGTCT^ 

GGCTGGGGTCCTAGGGAAGCCATGTCCCCACCCCCGGCCTGCAGCTGGGTTTACATTCATCCCCCGAGAGCACATGGGTGTAGCAGGAGGCCTGTGC 

AGAGAGCTCCGACCATCGCACAGGGCACCTTTGGTTGTTTCACGGAGCAGGCAAGGGAGCCATCGGATCCTGTTAGGTTTGAGCAAGGATGTGGGGA 

AGAAGCTGGAGAGCCACTTTGCCATGCAGGGAGAGGAGCACATGGGTCTAGGGATCTACTTTAGTGTTTGGAAGGTTTTTTAAGATGAAAGAGGGAT 

GTGTAGGCTGATAGGTCTGGCAGAGCCAAAAGGCAGCGACATGTCTACTGGGAGAGATGGAGCTGAGCGCGGGGCTCAGGCAGGGTGGCAGGGCAGG 

GCCGGGGCCCT GGGTGGGTCAGGTGGGTTCACAGCCAAGTGTGTAGAGAGGGCTTGGGCCCAGAGTGAAGCAGTTGCAAGCTCTCCCACAACCCATT 

CTCTCTGTCTCGGCATCTGTGGCATCCCGTGATGGGTGGGTCTGTACACACCCCACCCCTGGCTGTGCCACAATGGGGGTGTCTGTACACCCCTCAC 

CCCTGGCTGTGCCACGATGGGGGGGTCTGTACACCCCCCATCCCTGGCTGTGCCACGATGGGGGGTTCTGTACATCCCCCAGTGATGGGTGGGTCTA 

TACATCGTGGTTATGCCACGAGCTCCAAGGCTGTATCAGTCCGTTTTCACACTGCTAACAAAGACATACCCGAGACTGGGTAATTTATAAAGAAAAA 

GAGGTTTAATAGACTCACAGTTCTACGTAGCTGGGAGGCCTCACCATCACGGCGGAAGGCGAAAGGCCCGTCTTCCATGGCAGCAGGCAAGAGAGAA 

TGAGACTCAAGTGAAAGGGGAAACCCCTATAAAACCATCAGATCTGCTGAG 

TTCAATCACCTCCCACCGGGTCCCTCCCACAACACGTGGGAATTATG^ 

CAAAGGCATCGTCTGGCCACTCTAGGCCCTTGTTCCCA^ 

GACCTCGTCCCTTGTTGGAGTATCTGCGCTACCCAGAGGGTGATGGTAAAAACTGACCTTTGAGGAACTTCAGAGACAAAGCCTTCCGCAGTCAGCT 
CTCCCTGCAGCCCCTTCGCAGAAGCCTGTCCTCACACGCC^ 

TCCATTCTTCATCCTCTGCCCCCAACCCCCAGCCTTCCTATCTCCCTTCAGCTCCGTCCAAGGTCCCTGGGACTCAGCCCC^TCrCTCCCCAGCCTT 
GACCCCAATAAGCTCGATGTCTTTGCCCAGGAGCTCCTGACACACCAGACTGCATCCAGCTTCCTGTTCTGCGCTTGGCCCACCGCCTCGGTCCACA 
GGAACCTTCATCTCTGCTCTGACTGTAAGGATGGGCTCTCTGAGGGCTTCCCCACCCCACAGCAGGCAGGTGGTTCTGGGAGTCTGGG 
CCTGGGCACCTTCCTGCTCTGGCTGAGCCCCTCCAGGACGGAAATCT^ 

TAGGAAGGGGGCCGAATGGAAGGGAGGTCGTGTCGGCTCCTACAGGGCAGCCCCAAGCTGCTGGGCTGAGGTGCAGAGCCATGGGTCCAGCCTGACC 

CGCATGCCGGGAGCTGATTCCATGGCTGGAAGGGTCGGGGGTTCCCCAGGAGCCCAGACAGCACCTAGAGTTGTCACAAGGAGTGAAAGGCCCTGAG 

GGGACAGAGGAAGGAGGCTGGGCAGAGTGGGCACGTGGACTTCGGACCGCGCTTGTGGGATAACAGTGGCTAGGGCTGTGCAGAGGGTGGTGCACTG 

CCCATGGGAGTCCAGAGAAGCCAGGTCCAGTGGCAGTGGCATGCATCCGGCAATCC^ 

CCCAGGAAGCAGTCACAGCCTCGCTCTGCCCAGCATTGGGGGCCT^ 

CACCAGTCCCACCATAAGGCTAGTCCCCAAAGTCAGCACTGCAGCCAGGGCCAGCGAGGGCCAGCAAGGGCAGGCCTGTGCAGGGGCTGCAGCGTGG 
GGTGTGCGTCCCGCACTCTGGGCCAGCCTCGCAGCTCACAGGGCTCTGTCATCCCAGGAATGTCTTTGATCAAGAGTTTATGCAAGACGAGGAAAGC 
AAGCCCAGGACCTCTAATCCGTTCCCACAGTCCCCTGCCCGCCAGGGCCCCACAAGCCCAGCGCCACACCGCCAGGCCTGAGCAGACAATAGCAGGC 
GGACGGGTGGGCGGGCAGCTCTGGGGCCGCTGGCCTTGGGGCCCACATCCGGGAGCATTGTCACCGCCACTCACGAGGGCACACAAGGAGGGCTGGG 
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GAAGAGAGAATGC^GAGCCCTCTGCTGAAGGGGCTTCGGTGGGCCTTGAGCCTTTCAAAGACGGCGGTGTCCATTAATCCTTTTAATTTGCACCTAG 

CGGCAGCTGAGACAAGCCTCTCAACCGGCAGCCACTTTAAGAGCTGAGTGGTCTAACCCGAGGATCAGCTCCTATTCCAGACCCTGGGCCCTCTGAG 

CCTGTCGAGGCAGTCCTCCTTGCCTTTGTTTAATCACGGGAGGAATCGCAGCTGGAGGTTACTGAGCACGCACTGTGTGCCrTGTGCTCAACACaSA 

ACACAGTTTCTCTCTGAGTTTTGCAACGACCCCATGGGTTGGTGACAGGGAGGGGACACTCTGGTGGCAGCCACAGACACAGCCCTG 

GCCTACACTCTCCCGCACACAATCATCCCATCTGCTAGGTCAGGGATTGGCAGGTTTTCTGGAAAGGGCCACAGAG 

CCACAGGTTCTCTTGCCCTGTGCAAAAGCAGCTGCAGCCGCT^ 

CAGCAGACTGGATTCAGGCTGAGGGTGGTAGTTTGCCGACCCCTGTTGTAGATGAGTAAACCGAGGCCCAGAGAGGTAAAGTAACTGAGCCAGGACT 
GCACAGCTAGTGTGTGACCAAGCCAGCCTGGTGGGTCAGCCTCATGGAGAAGAAAAGGCTCAA 

GGCCATTTGCAGCCAGGCCCCACGCACACAGCTCAGGGCCCTGCTTCGCCAGCTCCACCTGTTTCGAGGCTCCCAGTAGGTTGCCCCTGCGGGCGGC 
ACTGTGTCTTGGCTGCTGGCCTGGAGAAGGGCAGCCAAACAGGCCGGCAGTGGACATCTGAGGGTGACCAGCTGTTTAAAAAGAAAACTCACCGGCA 
TGTATAGGCACACATTAAGCACACACGTTTTGGAGTAAA 

GGGCCTCAATGTTCACACCTATAACATGGGGAAATCCTGCAAGAACACAGCCCATGATGGGGAGTCGTCTCCATCTCCTACTCACGACAA 
CACAGAACGGGGGCT CC ATAAAT CT CTGTCAATAACTGAATT CTTT CTGGGGTGCTG AGG ATTCTATGGCTCAATGTGTGTGAAGCACTGCTG AGC A 
CCGAGCTTGGGGTGCAGTGATCACTCAGGGAACGCCATCATCCCCTCGGCGCATCCGCATAATCACCATCAGCATTCCATGCA 
AAGCACAGGCCTCC^TGCCACGGCGGCCGCCTGGGTGGTGGCACCAAA 

TTCTTGACACCACGGATGCCCCCTCCTGGTGTTGACAGCACCCAGACTTTGCTCCAGGGAACCACCATTCCCAGATCTGTGGTTTGAGTGGGACCGA 
ACCCATCCCCAGCTCCAGGCCAGGTCTTCATAGGCTGAAACCCAACTGCACATCCACA 

TCAAATAGAGCCATTGAGTCTTGAGGAGACATTTTCTGGAGCTTCTGAAAAAGATAATGCTTCCCCCTTCCTTAGGATTCCCAGCAGGGAAACACAA 
GCCCAGGGGTGGCACTGCCCCCCTGCAGCCCTAAGGGAAGACTGAGCTGCTCCCAGGACCTCACAGGGGAGGCTGAGCGTAAGGCCAGCCCTGAGGG 
AAAGCAGGGAATGGGTCTCCAGACAAACTGCTGGATCAAGCCTCTCCTGAAGTCATCCCCCTTTTGTTCCAGAAGCCAATCATTGTCCTTCATTAAG 
:; „ TCAGTTTGATATGGGCACTGTGTCATTTGGTTTTCAAAGACACTTAACTATTACAGCT^ 

H 2 CTCTCGCCCAGGCTGGAGTGCAGTGGCCTGCTCTCGGCTCACTGCAAGCTCTGCCTCCCGGATTCACGCCATTCTCCTGCCTCAGCCTCTCGAGTAG 
Q CTGGGACTACAGGCGCCCGCTACCACGCCCAGCTAATTATTTGTATTTTAGTAGAGACCGGGTTTCACCGTGTTAGCCAGGATGGTCTCX3ATCTCCT 
£=* GACCTCGTGATCCACCCGCCTCGGCGTCCCAAAGTGCTAGGATTACAGGCGTGAGCCACCGCACCCGGCCGCTTGTGTGTGTTTTCTACCTCTTCTA 
W GATCTAACATGTTATGATTGATATAAACAACAGGAGCGACTGACAAAGTTACCATC^ 

Q GTTCCACAGGGTCAGGGAATGACTCCTCTTGGGGCCACCAGTGCCCAGAAGTATGCCTAGAACTGCTCTGTCTGAAGGGACAGGTCCCTCTTCCATG 
j£ ; GCACACACTCTGTGTGTAACAGAGCAC^CGTGTGACCAAACTGACCTGGAGAAAGGGCCTGACCACATCCATATCCCAAGTTCTGAGGCCGGATAGT 
^ GCCTGCAGGGGCTGAGACCAGGGGAGAGGCTCAAGCCAGCCTTCTCTGGTGGACCCTGACTCAGTCCTCTGTGCAGGGCAGCTGCGCCTCCCAGAAG 
GCAGAGGCCTCTTGGAGAGTGAGCCTTGCGGTCACAGCGCCTTTCGGGGGCTCCCGCCACTTGCCCTCAGTGGCGGCTGAGACCTGGGGGAGTCCAG 
l*fc GCAAATCCCTGAGGAGGCCCCACATGCAAGGCCAGAAGTGTCTGGATGGTGTGGCCX5GCACCCTTACCCTCCAGACCAGCAAACGAAGCCCAGGCA 
! , GGAGAGGGGCCTTGTAGTTTTCTGTGGCTGCTGTAATGAATTTTCACACACTCAGCAGCTTAAAACAACCAACATTTCTCTCATAGTTCCAGAGGCC 
IIJ AGCAGTGTCTGCAGTGCCCGCTCTCTCCCAGCTTCTGGTGGTTTCGGGCGACCTTCGGCTCTTCTCAGCTGGTCCATGTGCA 

g . HUMAN SEQUENCE - mRNA 

GCAGTAGCAGCGAGCAGCAGAGTCCGCACGCTCCGGCGAGGGGCAGAAGAGCGCGAGGGAGCGCGGGGCAGCAGAAGCGAGAGCCGAGCGCGGACCC 
Q AGCCAGGACCCACAGCCCTCCCCAGCTGCCCAGGAAGAGCCCCAGCCATGGAACACCAGCTCCTGTGCTGCGAAGTGGAAACCATCCGCCGCGCGTA 
CCCCGATGCCAACCTCCTCAACGACCGGGTGCTGCGGGCCATGCTGAAGGCGGAGGAGACCTGCGCGCCCTCGGTGTCCTACTTCAAATGTGTGCAG 
AAGGAGGTCCTGCCGTCCATGCGGAAGATCGTCGCCACCTGGATGCTGGAGGTCTGCGAGGAACAGAAGTGCGAGGAGGAGGTCTTCCCGCTGGCCA 
TGAACTACCTGGACCGCTTCCTGTCGCTGGAGCCCGTGAAAAAGAGCCGCCTGCAGCTGCTGGGGGCCACTTGCATGTTCGTGGCCTCTAAGATGAA 
GGAGACCATCCCCCTGACGGCCGAGAAGCTGTGCATCTACACCGACGGCTCCATCCGGCCCGAGGAGCTGCTGCAAATGGAGCTGCTCCTGGTGAAC 
AAGCTCAAGTGGAACCTGGCCGCAATGACCCCGCACGATTTCATTGAACACTTCCTCTCCAAAATGCCAGAGGCGGAGGAGAACAAACAGATCATCC 
GCAAACACGCGCAGACCTTCGTTGCCTCTTGTGCCACAGATGTGAAGTTCATTTCCAATCCGCCCTCCATGGTGGCAGCGGGGAGCGTGGTGGCCGC 
AGTGCAAGGCCTGAACCTGAGGAGCCCCAACAACTTCCTGTCCTACTACCGCCTCACACGCTTCCTCTCCAGAGTGATCAAGTGTGACCCAGACTGC 
CTCCGGGCCTGCCAGGAGCAGATCGAAGCCCTGCTGGAGTCAAGCCTGCGCCAGGCCCAGCAGAAC^ 

AGGAGGAGGAGGAGGTGGACCTGGCTTGCACACCCACCGACGTGCGGGACGTGGACATCTGAGGGGCCCAGGCAGGCGGGCGCCACCGCCACCCGCA 
GCGAGGGCGGAGCCGGCCCCAGGTGCTCCACATGAC^GTCCCTCCTCTCCGGAGCATTTTGATACCAGAAGGGAAAGCTTCATTCTCCTTGTTGTTG 
GTTGTTTTTTC CTTTGCTCT TTC CCCCTT CCATCTCT GAC TT AAG C AAAAG AAA AAGAT TAG C CAAAAACT GT CTTT AAAAG AGAG AGAG AGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGAAC^CCAGCTCCTGTGCTGCGAAGTGGAAACCATCCGCCGCGCGTACCCCGATGCCAACCTCCTO^CGACCGGGTGCTGCGGGCCATGCTGA 
AGGCGGAGGAGACCTGCGCGCCCTCGGTGTCCTACTTCAAATGTGTGCAGAAGGAGGTCCTGCCGTCCATGCGGAAGATCGTCGCCACCTGGATGCT 
GGAGGT CTGCG AGG AAC AGAAGT GCGAGG AGGAGGTC TTC CCGCT GGCC ATG AACTACCTGG ACCGCTTCC TGTCGCTGGAGC CCGTGAAAAAG AGC 
CGCCTGCAGCTGCTGGGGGCCACTTGCATGTTCGTGGCCTCTAAGATGAAGGAGACCATCCCCCTGACGGCCGAGAAGCTGTGCATCTACACCGACG 
GCTCCATCCGGCCCGAGGAGCTGCTGCAAATGGAGCTGCTCCTGGTGAACAAGCTCAAGTGGAACCTGGCCGCAATGACCCCGCACGATTTCATTGA 
ACACTTCCTCTCCAAAATGCCAGAGGCGGAGGAGAACAAACAGATCATCCGCAAACACGCGCAGACCTTCGTTGCCTCTTGTGCCACAGATGTGAAG 
TTCATTTCCAATCCGCCCTCCATGGTGGCAGCGGGGAGCGTGGTGGCCGCAGTGCAAGGCCTGAACCTGAGGAGCCCCyVACAACTTCCTGTCCTACT 
ACCGCCTCACACGCTTCCTCTCCAGAGTGATCAAGTGTGACCCAGACTGCCTCCGGGCCTGCCAGGAGCAGATCGAAGCCCTGCTGGAGTCAAGCCT 
GCGCCAGGCCCAGCAGAACATGGACCCCAAGGCCGCCGAGGAGGAGGAAGAGGAGGAGGAGGAGGTGGACCTGGCTTGCACACCCACCGACGTGCGG 
GACGTGGACATCTGA 
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TABLE VI 



MOUSE NOMENCLATURE 
ICSGNM Ccnd3 
Celera mCG15576 

HUMAN NOMENCLATURE 
HGNC CCND3 
Celera hCG16683 

MOUSE SEQUENCE - GENOMIC 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AAAGGCGTATACTACCACATACATCCCAAAATACATATAAAAATTAATAGAGCTAGTTGAGAAGTCTCTCCTCTCGTGATAGTCTCTGATTTGGGGA 
TGACAC CTT TCTTGAATATTTGT AT C AACCT ATTTGC TGC AGTGT AGGTGAAGG CTAGCCCAGTG TGTAGATG AAGTCTAGC C CAGTGTGTAAATGA 
AGGCTAGCCTGGTGTGTAGGTGAAGGCTAGCCCTGTGTGTAGGTGATAGAATTGAACTTTTACTTGAAGAGTGAAAGTGATTTAAACAATGAGGAAG 
CGGTAATGCTTTACTCCTCACAGAAAGTCCAGAGGTAGCCTCAGGACTGGTTCAGTAGCTTAGCAGTGTCACTCAGGCGAGGGATTCATTCAGATAA 
TCCACTAGGCCCCTTTCTTCATTGCAGAGATGGCAGAGGTGGCCCCAGCATCATAGTAGAGTGGTTACGGGCATAGCTTTGTGGTAGCATTTTTCTG 
TGATCCCTTTGGTTGGGAAGCTGAGGCAAGAACATTGCCAGCCTAGGCTACATGGTGAGAATCTGGTCAGGCAGGGCACACAGCAGAGTCCTGTCTG 
AGAAGGGTCAAAACAAAAO^y^GCGCAGGGCATATAAAACAGACCTTGTTTGCTAGAGCGTTTGCCTAGTTGCCCACCCTGTAGGAAGCCCTGGGT 
T CCCTC C CTAGCAGCTC ACG AGCTG AGC ATAGTGCTACACACTAT AGTGTT CCA C CATTCCTGCA CTGGGG AGGTAGAGG CAAGAAGATC AGGAGTT 
CCAGGCCCAATCAAGTTTAAGTCTAGCCTGAGGTCCATGATTCCCTGCCTCTAAGAGAGATTTGAGGTTATCCTCAACTGTACATCACATGAGAGGC 
TAGCTTGGGCTACGTGAGCCACTATCTGAAGAAAAAAAGAGCTAGGAAGATAGCTCAGTGAGATGGTTCAGCAAGTAAAAATGCATGGTGGACAAGC 
TTAAAAACCTCTGCTCCGGGGCTGGAGAGATGGCTCAGTGAGTAAGAGCACTGACTGCTCTTCCGAAGGTCCTGAGTTCAAATCCCAGCAACCACAT 
GGTGGCTCACAACCACCCGTAATGAGATCTGACGCCCTCTTCCACACAATAAATAAATCTTTACAAATAAACAAACAAACAAAAACCCCTCTGCTTG 
GTCCTGGATCCTACTGTGGAAGATGAGGGACACACCAGGAGGTTGTCCACAGGTGCAAACACACACATATGCACACCCAATGTAATTGGAACATTTC 
AAATATATCTGAGCATGGTGGTAAAGTGCCTTTAATCCCAGCACTCAGGAGGCAAAGGCGGGGCGATCTGTGTGAGTTTGAGGCTAGCCTGGTCTAC 
ATAGTGAGTTCCAGGACAGCCAGGGCTACACAGCAGTGAGGCCCTGTCTACAAATAAATAGACAGACAGATAAATAAATAAGCGATAAAGTATAACA 
GATCAG TTA CAC AGT AAGGC GTATA ACAG AT C CAGTAGTGTTAC AC AGT AAGGCCTG ACTGATAG ATAGAC CGGCATACAGAATGATAAAATATAAT 
GGGGCTTGTAAGATGACACAGCAGGTAGAGGTGCTTGCTGCCAAGATCTTCAGAACCTATGTAATGAAAAAAGAGAACTAACTCTCTCAAGTTGTCT 
GCTGACTTCTACGTACATATGCATGGCATACACACGCATGCGGAAACACATACACATTTAAAATGAAATAAACCAGTTATAATGGGGAGCTAGCTAT 
AGCT CAGTAGGTAG AGTGCC TGT GT AAC CTGT AGGAT ACCCTGGG TTC AGTCCACAGCATTGTAT AAACTGGT C AAAAGATG ATCATGGT TGT AATC 
CTCGAATTCAGGAGATGGAGTTAAAGAGAAGTTGGATGCCGTCTTCATCTACAAAGTGAGTAAGAGGCTAGCCTGGGCTGCATGAGACCTTATCTGA 
AAT ATAGTAATGGAT ATTGCTAT CTCAG TTTTGCAGTAGG CACAA GGT T GTATATTC CTGCCGTCTCAG AG C CTCAGAG ACAGAAACAGG ATGGAC A 
TAAGTTTAGACCAGACGTATTCACTGTGAATTCTAGGCCAGGTAGACCTAGCAAGAGTTGCCTTAGAAACATCAACAAAACAGCAGCAGCAGCTTTT 
CAT G AA ACAGATTCCAGTCTGATGTACAC ACA CATAT CAAACCGAG CGGCT ATT TGT TC AATT CTTTTTTGTTGTTGTTGTTGTTGTTGTTTGTTTT 
TTGTTTTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGACCTGGAACACACTTTGTAGACCAGGTAGGCCTCGAACTCAGAAATCCACCTGCCT 
CTGCCTCCCAAGTGCTGGGATTAAAGGCGTGCACCACCACGCCCGGCTCTATTTGTTCAATTCTTGAGCATGAGGCTAGCTCTATCAGGAGAGTTCT 
GACCTAGAGGGGCATGGGACTTGTCATTTCAGAGTGTGGCAGCCTCCTTCTGAGTACCAGGTACCTGGATTGCTTAGTGGCTTGTTCTATCGGAAGC 
TATTGTACCTGCCCACAGCAAGGTGGAAGTGTCTAAGAACTGAACCGATCCTGAGTCTGCCTACAGAAGCCTTCAACCAATGGCAGCTGAGAATGTG 
GAGACAAACATCTTGCCTCATTGTGCCTCAGCTACAATCATGATACAGACAGTTGTAAGCTGCCATGTGGTTGTGGGAATCGAACCTGGGTCCTCTG 
GAAGAGTAGTCAGTGCTCTTAACTGCTGAGCCATCTCTCGAGCCCTTGCCCAGCCTTTTTTCTTAAAATTGTATTTTTAAAATTTATTTTCTGTACA 
CAGGTGTGTGAGTGTGAACATGTGGATCCTAGGGGTTGAAATCAATTTGCCGGGGTTTGTATGGTGCCTTCTCACCAGCTTTAGAGTTTCTTGATTA 
CTAGGGAAACCTTCCTGGAAGCAGGATGCTACTCAGTATTTTGATCACTCATAAGACATTTCATTGTGAAGTGGAAAGAAAACCTTCGTATGTTGTT 
TTGT CTTGTTT GTTTAATAT T AT ATTCT ACAGAACATTATATACAGATGTCATAAGGAC AAT AAGG AC ATAGTT TATTATTT ATTTACTT ATTTTTG 
AGAAAGGGTCTCACTAAGTAGCCCTGCCTGGCCTGGAACTCACTATGTAGTCTAGGTTGGACTTGAACTCACAGAGAGCCACCCACCTCTGCTTCTG 
GAATGCAGGGACTAAAGCCATGAGCTACCATGCCTAGCCAGATTGTAGTTTTGTTTGTTTTTGTTTTGTTTTTCAAGACAGGGTTTCTCTGGGTAGC 
TTTGGCTACCCTAGAACTCACTCTGTAGCCCAGGCTAGCCCCAAACTCACAGAGGTCCATCCATCTCTGCCTCCTGAGTGCTGGGATTAAAGGCATG 
CATCATCACACCAGCTGAGTCTTTTTTTGTTAGGAAAATCTCCAAGTTGTCTATCCTTGAGCTTTGTGGTATTCTTTTCCTGAGACTGGGTCTCATG 
TAGCCCAGGOTGGCTTCAAACTCAGCGGATAGCTGAAGATGCTCTTGAGCTTAAAACTCAGCTTCTTTATTTTCTTTATTTACTTACTTTTATTTTG 
AGGCAGAGTTTCTCTGGTGTAGCCCTGGCTGACCTAGGACTCTATTTGTAGCCCAGGCTGGCCTTGATCTCACAGAGATCTGCCTGCCTCTGTCTCC 
TGAGTGCTGGGCTGAAAGGTGTGTACTGTTGCACCAGGCTAGCGGCTTTTTTTTTTAAAAAAAAGATTTATTTTATTTATGTATATGAGCACATTGT 
TGCTGACACACCAGAAGAGGGCATCAGATCCAGTTATAGTCTGCTGTGAGCCACCATGTGGTTGCTGGGAATTGAACTCATGACCTCTAGAAGAGCA 
GACAGTGCTCTTAACCATTGAGCCATCTCTCTTCAGCATGGCCTTTTTTTCTTTTCCTCTTTCTTTTTCTTTTCTTTTCTTTTCTTTTCTTTTTCTT 
TTTCTTTTTCTTTTCTTTTTAAAGGCTTAATAACTATTCTTTTTTTTCTTTTCTTTTGGAGACAAGGTCTCATTTTTAGCTTAAGTTGGCCAGGAAT 
TCACTCCATAGCCTAGGCTGACCTTGAACTTGGGGAGATTCTCAAATTGGGATTCCAGGTCTGTGCTACCAGGCCCAGCTTATACAATTGTGTTTAT 
TTCTCAATTTAAAAAAAATCCATTTTTATTTTATGCGCTGGGGTATTTTGCCTGCATGTATTTATGTGTACACGCATGCCTGGTGCCTGCGGAGGCC 
AGAATGAGCCTCAGAGCCCCTAGAAATAAAGTTACAAATAGCTGCACAGGTGCTGGGAAGTGAACCTGGGGCCTCTGCAGATAGCAGCAACTGCTTT 
TAATCGCAGAACACTCTTTCTAGCCTTTAATTTCTCAATTGAGGTAAATGTTATTAAAAGTTATAAAAATGGGCCATAAAGGGTATGTGTTCTTGGC 
ATACAAGAAGCCCAGGGTTCAGCCTCAGCCACTATTATTATGTTGATAGTTGTAAATACCCATACACGAGGACAAATTTGGCTTTTACAAAGCCAAG 
CC^TAATCAGTTGGACIAGTGGTGGCGCATGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGC 

AGAGTG AGTTC CAGGAAAGCCAC GGACTACA CAGAGAAAC CCTGT CTC AAG AAAACAAAACAAAC ATCCAT AATCACAACTC AACT ACCTGGGCACC 
CACCCCCCTGCATCCCTGCCTCCCCTGCCCCTCCACACACACACTGTGGCTCTGGGGATTGAACCTAGGGACTAAATGAATTCCGAATAAGTGCTTT 
CCCAGTGAGCTACCTCCAGCTCAGAGAAGAGTGTGTGAGAGAGTGGAGAGGGTAATTGTTAGTATATAAGTAACCTTTTTGCTTTTGCTTTTTTTTT 
TTTTTTTTTTTGAGGCCGAATCTCACTATTCAGTAGTCCAGGCTGTCCTTGAACCTGCAGCAATCCTCCTGCCTCAGGAGGATGAATGTTACGATTA 
CAGGTGTGGGCTACCACAATTGGTTTAGAAACTACCACATTTGCATGTTCCCTTTGCCCCGGATACCGTGCAAGATACTTTAGGCCCTTTGGTCATC 
AAAACATCCCGGTAGGGTACGGACTTTGTTATTACCCAACTGGATGAAAAGAAAAACAGACACGAGAGAACCAGTAGGTTTCAAACTCACTCCAGAG 
TGGCCTCATTCCGGTGTTGTGTCCACCTTCCTATTTAGGGGTGGTATCTCCTC^TCTTAATCCCCTACCTGGTTTCTCrrCTCTCrCTCTCTCTCTCT 
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCACACCTCACACACACACACAC^^ 

GGGTGAGGTGAAGGCCCTCACCTGTCACTCTGTTCCCCTTCTGGGTCTCAGTTGGAGAGATTCCAATGATAAAGAGGTGAGGGAGTGGTAGGAGGAA 
GGACGCTCAGAGTCCTTCATCCTCTGGCCTTTCCTTAGGGCTTGTGACAATTACATTCCAGCTGTCCTCTAATTTACCATTTCATTGAGTAACTTTG 
TAATTGGATGCACCTGGCTTGAAGCACTTTGATTTATTCCCCCTGGTTATGGGGAGTGTGAGCTGGGCTGGCTCTTTCTTTCCCTGTCTCTTTGTCT 
CTCTCCTCCTCCACTCCCCTCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTCTTCTTCTTCTTATTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 
CCCACCGCTCTTGCTTGTTTGTTTGTTTTTTGGTAGTTGTTTGTTTTGTTGTTGTTGTTTTGTTTTTGAGGCAGTCTCACCTAGCCCAGGCTGGCCC 
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TGAATGGCTGCCCACCTTGCTTCGACCTTCTAAGTGTTGAGATTCCCAGTGTGTGTGTGA 

GCAGTGATGGTGAAGCCGGATAAGCAGACCTCTCTGGAGTTCTCCCGGGGAGGCCTCCTGCTCCCTCAGCAACCACCCTTCATGTAGCTGGCCAGCT 

CCTTGGCCCTGAAGGCCCAGGACAGTGGGGCTGTTGACCAGTTATCC^GAAACCTTCTAGGACTCAACTGTCTCTCTCTCTGGGTGCCCTTTCTCA 

GTAGGTCAATGAGGAATAGATCAATATACATCCTTGGGACAGCACAAGAGGCAATGGAATGAAGTGGTCAAGAGTTGGTACTGACTGCTGCTTTCAG 

ATCTCTGTGGCTCAGCACCGCCCCTGCGCCTTTGAGTTCTAACTCAGTGTGCTTAATTTTTTGTTTCCTTCATTCTGTCTTCCAAATTCTTGAGCAT 

GTTTGCCCCGGC^CTCTGGGATTTCACCCTGTTTGTTATTCTGAAGACACTCTCATCCTTTTCTATTCCCATCGGTGCATATTTCCATTTATCGGTG 

T T AGATTGTGAATGT C^GCTCAG AAGNNNNNNNNNNNNNNNNNNNNl^NNNNNNC CTTTCTTT AT ATAAG^ TAAAGAT 

AGGAGCTGAACCAAGTGGATTCTGGGAATATGGTGGCTGTTGTCTTGGCAACAGGTGTTTGGGGTCCTGCCTATGTGATGTCACGGGTTTTGGAGG 

CCTGATACCAACAGTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCTCTCCCCCCGCCTCTTTCCGCCACCACCTTGCCACTCTCTCTCCTAATAAACC 

TCAGGTGGAACTGTTTGGCCTGGTGCAGTCCTTCTGAAGCTGACATTCACCAAACAGTTGGCAACAGGAGGTCCTGATACTAACACAAAAGGCAAGG 

CTTTCTCTCCTCCTCAGTTCATCTCTCTTCCTGGAAGCTTTCTTCCCT^ 

TAAAACAAAAAGAAGCCATGGGGGTTAGAAAGGTAGCTCAATGGTAGAGCGTACCTACCTAATATGACCAGCATCGATTTCTCCTGATGTCCACCCA 
TGCAAGGGCATCAACTCTGGAGCTAAACCAGACTGCAGCCTGCTTTTACCAACTCCGAGCTATGTGACCTCTCCGTGGTCACTTAGCCATCCATGGT 
GAGT ATT AG CACCC AGC CCCTATTACACAGGGCAAAAATAATATA C TTT CTGTAAGGCTGTTGTG AAG ATAAAATG ATTTATTACC TCTC AGATATG 
TTCAGTTCTTGGACATGGTATGTGTACATGAAGACCTGTTACTGTTGTTGGTACTCACAGCCAGTTGTTGTGTCCAGAGGATACACAGCTACGCGGG 
ACTTTATCTCTGCCCTTCAGGACAGGAAGTTTGGTGGTGCATTCCTGTAATCTCAATATTTGAGAGGAAGAGGCAGTAGGATTGTTGCAAGCTGGGG 
GCTATTCTGGTCTACCTAGAGTCAGAGCTAGATAGAGTCAGGTCTATAAATGTCAGAGCTACATAGTGAGAACCCGTGTCATAAAACTAACAGAATG 
TTAGTGTGGGTAAGGC^TGCAATTGGGAGGTAAAAGCAGGAAGATCAGATCCAGCCTCAGTTATATACTAAATTCAAAGCTAGCCTGGGTGCCGTAA 
S , GACCCTTGTCTCAAAAAGTGTCCCCTCCCTCATTCGTTAAGTGTAAAGCTTGATGTATAAACATGGTGGTACGAATTTGTAATCCCAGCACTTGGGA 
J7" AACTGAGGCACGAAGCTGGAGAATTCGAGGTCAGCTATAAAATAAGACAAACTGTTTCAACGAGGAGAGAAACCACTGGGGCAGTGACCTTCTGTGA 
O CTGAAGCAATCCCATCATGCCTCCTTCCTGTATGAAAAGCCATCCCCAT^^ 

j/Zt TGTGTAGACTAGTGTGTAGACCAGTGCATAGTCCAGTGTGTAGACTAGTGTGTAGATGGATCTCACTGCTTATATATTGTCCTGTGCTTTCCTGGGG 
.'^r CTCACTTGGGCTGAGCAGAGAGAGCAGTGTCTTAGTGGTGGGCTTTCTTCTTTCTTTCCCTCCTTTCCTCTTTCCTGCCTTCCTTATTACATCCTTT 

TTTCCCTTGATTTTTGTTTTTGAGACATAGTCTCATGATATTAGCTCCTCCTGACCTAGAACTCTTATTTAGACTAGGCTGGCTTTGAACTCAAAGA 

AATCTGCTTGCCTCTGTCTTCAAAGTGCTGGGATTAAAGGGGTGCTCTGTCATGCCTGTCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCT 
z\ TCCTTCCTTGTTATTTGAGTCATCTGACATGGGTCTGGCTGGCCTGGAATTCTCCATGGGCTATTATCATTTGGTTTGCTTGGTTTTGTGATAAAAT 
r* CTTATATAGCCCAGGCTGCTCCTGAACTCATTAGGAAGCAGACAGTGACCTTAATCTTCTGATCGTTTCTGGGTTCTGCAGTGCTGGGGATTGAGCC 
|x& CTGGGCTCTGAGCATGCTAAGCAAGAGCTCTACCACCTGAGATATATCTGCAGCCCACCCCCTGCCACCGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
J.j GTGTGTGTGTGTGTGTGTGTGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGACTCTAGGCAGGCCTTGAATTGTCTGTATAGTGGATAGTGGATCCTG 

GTGTTAAGCCAACACTTCCCTGGGTCTGAGATTACTGGCTTGCTCCACCACAGCCAGCACCAAATGGCTTTTTATATGCTGATTTTGACCAACACTG 
£ CTCACACCTCCGTTGGTGCAACGCAGTGTCCCATTCAGCAGACCTGAGAAACTGCATTTCTTCTTTTGCTTTTTATTTTCGTTGTGTATGTGCATGA 
i ? AGGTGTGTGCGCACATGCTACATACAGCCTTGGTGTAGAAGTCAGAGGACAGCTTATGGGAGTTGCTCTCTTCTCTTCCATCATGTGGGTCCCAGGG 

ATCACACTCAGGTTGCCAGACTTGATGCCAGGATAGTTTACCCTCTGAGCCATTGAATCGGACAGAAACTACATGCTCCCAGATGCTGCTGCAGATC 
O TGCGCAATTCGTTAAAGTAGACGAGATTCTAAGTCAAGTTTCCCGACGGACCAACAATGAGATCTTTGCAGCTACTATTGTGCCTCAGTATCATGGT 
?il CTGCAAAATAAAGCCCAACTCGAGGCCATTCCTAGAAGCCACTTTAGGGTAGGAGTGACCC^CTATGTACAAAGTTATGACCACATTCCCTAGAGC 
! ^ AATGAAATATGACGTATACACGAAGGAATAACTTTTGGCGCCTAGTTCTTTCCTGGGTC 

ill CACCCTTCCAGAAAGTTCTCTCTACCCTCTTGGTCAGAAGGGACCCGAGGGCCCGGAAGCGTGCGGGCCGGAGCCCAGCGGACCAATGGGGCTGCGG 

p| GGAGGGNNNNNNl^NNNNNNWNNNNNNNNNNNJWNI^NN^ 
NNNNNNNNNNNNNNNNNmWNNNNNNNNNNNNNNNNNN^ 

Ms NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNCCATCGCAGGATGGAGCTGCTGTGTTGCGAGGGCACCCGGCACGCGCCCCGGGCCGGGCCCGACCCGCGGCTGCTTGGGGACCAGCGT 
GTCCTGCAGAGTTTACTCCGCCTGGAGGAGCGCTACGTGCCGCGAGCCTCCTACTTCCAGTGCGTGCAAAAGGAGATCAAGCCGCACATGCGGAAGA 
TGCTGGCATACTGGATGCTGGAGGTAAGGCCCGAACGGGTCCCCTCTCCTCACCCCACCCCCACCCCCGGCTCTGGGACAAATTGAACTTTCCCTGT 
CGGATCTTTGCGATCCCGACCACAATCTGCAAGATGCTTTGGCGGCCCAGACTACTGGGGCCTGGCCTTTTCACACAGGCGCCCTAAGCAGCTGCCT 
GCGTTCTGTCCCTTTCCTTTCCCCAATCCTGATCGGCTTCTTCTGTGATTTCCAGTCACTTGGCACTGTAACTAAACTTCACTTTGGGTCGGGAAAG 
CGGGCTAAGGGCCACTCTTTCCCCAGTGGCTTTCCTCGGTGTCTCCTGCTTCCTTCTTCACCCGGTGCTCTCTGGAGGGCGCGTTTGGAAAACCCTG 
GCCTCGGCCGCCTTAGCCCCAGAACTGGTTCCTGTGTCTATCGCGGGGTTCTGGCGGGGGAAACTGGTGGGTGGGGGTGGCGCAGGCTCGGTTCCGG 
TCCACAGGGAAATGAAAAGTGCGGGAGACCGCGGCAGGAGTTCCCTCCCTTCCCCCTGGGAGTGTGTGTGCAGTGGGTGGCGGTAATGAGGGTAAGG 
TCCCAATTCTCTCTAGGTGACCCTTCCGTCCCCCTTCCGCTCGCCGTGGGGGAGGGGAGTCTGGAATCCTTCGTGAGCATGGGCTTCTCTTTTACAC 
CCCCTACTCCGGGGTGGGGAAGAGACCATGCAGAGGAAGTGGGGGGGTCACCTTGGGGAGGGCACATCCTCCTCGGTTATTGATACTTGTCGCTCTA 
AGTTTATCCCTTCTCGGTTTATTTTTTGCTTCGCATTCCACTCTCCCTCCACCCCCCACCCCCACTTCTCCACGCTGCCGTGTTTCACTTGGTCCTT 
CCCTAGCAATGACTCGTGCGCCCTCCCTCAGGTGTGTGAGGAGCAGCGCTGCGAGGAGGATGTCTTCCCTCTGGCTATGAACTACCTGGATCGCTAC 
CTGTCCTGCGTCCCCACCCGAAAGGCGCAATTGCAGCTTCTAGGTACCGTCTGCCTGTTGCTGGCCTCCAAG<rTGCGCGAAACCACGCCCCTGACTA 
TTGAGAAGCTTTGCATCTATACGGACCAGGCTGTGGCTCCATGGCAGTTGCGGGTGCGTGCGGAAATCCTCCCCCCCCCCCGCCCCTCCACCAAACA 
TAC GATCGC CCTATGTACTAGTAAAAGAAAAGCC AAGCT CACTCC CTC ACCCACCAACCCTC CCTTCG CC C CTGGCTGGTATTTTGGG ACAAAGGCC 
TTGTAGTGTGGTCGTGGTGCTTCGGTTGCAAGCTTAAAACTAAATGAATCCCTTTCCTCTGGCCGCAGTGGGGTTCCACTCCTAACTCGCCCCCCCC 
CCCCAATAGCAACAAGTCCAATTGCGGGTCTATACACCAGTAC^ 

ATGTCTGCACACCCCCACCGGAGTGGAATGTGCTTCACGTGGGGGAGGGGGCTGCCTTGCATGAGCCTTTATCCCCGTGTGGACTTTGTGATGGAAA 
AGCCTCCCGGAGCTGATTAAAGCCAGGCCCTGCTCGCTGGAAGGAAGCGCTGAGGTGTTTTCATCCCTTTCCCTCCTATCCCCGAAGACACCGCTCT 
TTTCTAAACCCTCCCTTTCTTGTCTTCCTTCTGAAAAACTCGAGCAGCTGCTCTGGGCCCGTGGAAGAGTTTCTCCCTTTGCTGCTCAGCCAAAGGC 
CAGAGCCGAGACAGACAGGGAGCTGAGCTGCAGCTGCTTCCGATGTGATAATGATGCCGGTCCCCTGGCCCTCCCACTCAGACCAAGGGCTTTCATA 
ACCCTGTGGGG AGC TGC TGTCACTC CTGTGG TTTGGGAAGTGGCT CCTTGGGGCCTC TTGCTGGG AACTTGGGCTGGGCTCCTCC AGG CGGGGCAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAATCGGTTGGGTT CTG CCCTGCCC CGG AGGAGT CTAGGG AGGAGGTGG ATCTGCTTGGATCTGGAAGGGAGCT 
TGAACGT CATCTCC C AC AGGGTCTG GGGGAGAGGGTGGGTGAGCTGGC ACCT CC CTCCACCTTGTTCTGTGGCTGG CCAAGCCTTCTCTG ACCCTGA 
GAGGAGTCTCTGGGCCCCATCTCCTAGGGAGGAGGGCAGCAGGCAGACTCCTGTGATTGGCTGGCTGGGAAGCTAGGGTACTGGGGCCCCTGACTGC 
TGTGAGTCGGGGAACTGGATTCCTCTTCCTGTCAAGCAGCTCTCACGGATGGGAGGCCTGAGTTGGGAATCCGGGAGGCCCCACAGCTCTTTGTGAT 
GAATCTAGTTTCCAGAGCCAGGGAGCACGATGTTTTTGAGGGATTGAGGCTGGGAGGTAGGAGACTTACTCTTTGTGAAGAGATTAAGGGTGGTCTA 
CATGGCTGTGTGTGAGGGGAGTTTTTCAGAATAGGCAAAGCCCTCCTTCTCCTGCTCGAGGCCCTGAGCTGAGAGTGGCCTGCGGTCAGAGTTGTGA 
GGG ACT CCAAGAACT CCAGCAGT CT AAATCC ACACAG CAG AG C AG AGGCCCC AG AAAGG AGGGGAAAATCCGACCAGAGCGC CTTGAAGGC AGGGAT 
GGGTTGCTGCTGCCAGCTCCTCTGATCTTGGGGTAGGGCTGGGGTTTTTCATGTCATCTGGATACAGGGGAGCGTGCCCACCTAGTGAGGACTCCCA 
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CACTCCTGGTTGCCATTGGGGTGCCCCTTTAGAGGCTGGATGCTATTTGGTTATTAGGATAGAGCCCAGGGTGCAAGCTTCAAGCTTCCTTGCCTTA 
CTTCTGGCTCTAGTCCAGGCTGTTTCTGTGGGGTTCTGTAAGAAGCCTGGGGCTGACACCTAGGTCATGGGACAAAGAAGAGGGACAGCTGTCACTG 
GGGGGGTCCTGGAAGATACTGGGTGGGGCTGACATTTCTACCTCAGTGAGGAGTCAAACCCATGAGTTCCTTCC^ 
CTCCTGTGTGCAACTCGGAAGTGCAGCTCGGGCCAGGGCTAGGAACTTCATATGAGTAGACTGAGGT^ 

GAATGGTTATOTAGGCTTGGACAGGTTTCTCAGGGGTCATCTGGATTCATCCTGTTTGGCTTGGATGCCCTTGTCTGGAGACAGGACCCACCCGTGA 
TGTCCCACAAAGGGGGGTGTGGGTTGGGTTGGGCAACCTCCT 

CGATGTTGGGGAAGAAGAGATTTCCGAAGCCTGTGGAATTTAGAGTTGGTCTGTTTCAACTCACGCCCGCCTCAGACTCATGGGAGGATTAAGAGGG 
ATTAATTCAGGTCGTTCTAGCACCTAGTAACCATCAATAGATGCTTAAGTGTACGGCCCTAGTTCTTCCCTGCCCCAAACCX^GTCATCCCGCGGCAG 
CAGCCCCTGAAGCCAGAGGCAGAACTGTGACCCTTCTGTCAAATTCTCGGCCCCCTCGGCCGACTGCTGACTCCATGTTCCTTCCCTAACCCCCAGG 
AGTGGGAGkSTGCTGGTCCTGGGGAAGCTCAAGTGGGACCTGGCTGCCGTGATTGCGCACGACTTCCTGGCCTTGATTCTGCACCGCCTGTCTCTGCC 
CAGTGACCGGCAGGCTTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCra 

TGGGAAGGATACGTAGTAACAGCCAGGAGATGGTTTCAGTACTCAGACCTGTGTACAGTGATCTGGGAATACTGAATAAAAGTTACTTGCCTAAGGT 
CTTCATAGCAGGGTTTTGGATATTTTGTTGTTCCTGGTGCCAGGTACACCTTGGGTAGATG^ 

GGGCTGTGGTCTGGTGGGTCAGTGATAGGGTGCATTCCTCAGTCAGATTGCCAACACTGTGAGCACACACCAACC^AACCCCCTCCTCCCTAGGATC 
TACCTTGTCCCCTTGGCTC^GTGTCACCCCATTCACCACTGGGGGTTC^ 

ATTAC^CCTTTGCGATGTATCCTCCATCCATGATCGCCACAGGCAGCATTGGGGCAGCCGTGCTAGGCCTAGGCGCCTGCTCTATGTCTGC 

GCTCACAGAGTTGCTGGCCGGGATCACAGGCACTGAAGTGGTAAGTCTCAGCCCCTCGCCCCCCTTCACAAAATTAAAAAAAAAAAAAAAAAAGCTG 

TTCCTCTGAATCTCCAGCAGAGGGCACCATCCACCCGGGCCCCCAACTATGATTGTACGCCCTTGCCTGCTTCTCTGGCCTTTTATCTCTGGCTCCA 

GTCTGGAGAGGAGTGGGTGGGGTGAAGGACACTTGGCATGTAGATACCCTTCTCTTGACATCTCTTGACAAGTGCTGTGTCTCGGACAGGGGTGGGG 

GACTTGGTGTCAGATGGAGGGTTGCTCTCACACCCTTGCCACATCCTCTTGCCAGTTTCTGAGAACTAAAGACCTGATTCCTGGGCTGGGCTGTGGC 

CTAACCTCCCCAGACTTCCCAGTGCAC^GGGGGTGGCCTTTCCACCTGCTTGCTGTCAGTGCTGTGAGGGAGAGAGAGAGGCCCAAGGATGAATGAC 

CTTGTTTTTTTTTTTTTTTCTAGGACTGCCTGCGAGCCTGCCAGGAACAGATCGAAGCTGCCCTCAGGGAGAGCCTCAGGGAAGCTGCTCAGACAGC 

CCCCAGCCCAGTGCCCAAAGCCCCCCGGGGCTCTAGCAGCCAGGGGCCCAGTCAGACCAGCACTCCCACAGATGTCACAGCCATTCACCTGTAGCTT 

GAGACAGGCCCTCTCAGGCCACCAAGCAGAGGAGGGGCCCCTGCCACCCCCTCCCTGCCTCTAGAACAATCCATGCTATATCTGAAGCCCGAGGGGG 

GCTCTCCTTCCCCCTCACAAAGCCCAAGGGGCCAGGTCCTGCCTATCCCCACAGTGTGCACTAAGGGGTTGCATGGTCATGAGGGGCGTCTTCA^^ 

CCAGTCAGCTCCTCTTCCTTCCCACTCAACCAGCTTGGCTGTCCTGGGCCATGATGGTCAGAGAAATACAAACAGGTAAAATCCACACACCAGCATT 

TCTTTTGAGTCCCTCTTCTGTCCGGGGCTCCAACCTTCTCAGTTGCCAAAACGCCCCAGTACCTTCCAAAGGTGTTGGCCCTTGCAGGCTCCTTCGG 

GCATCCCGATAGAAGCTTATGAGGAGCTGCCCTAGArGGCCTCTGTGTAATCCGTACTCCAGCTGCTCTTAGAGGGAACAGCCTAGGCTTTGGCCAG 

AGCAAGAACCCATACACTGTTGCTTTGCTTGCTGCTTAGCTTCTGTGATTGTGGGGTCTTAAGGGTGCCGTGGTCATTTTAATTTATTGCTTTGAAT 

ACAACTGTAAGAGGGTACAGTGAGGCCTGTACCCCACAAGTGGTGGTAACCCTGGCGGTTGCTCTTTCCCTCCCTTCTTCCCCTCTGCTACTGCTTT 

GTGGCCCAGGAGCTGCTACAGCCTGGGATGGGGGTCACGCCTTCCTCTCCTCACCCCTCCACCTCATCTTCATCAGAGCAGGGTTAGGTTGGGATGG 

ATCGATGCCGTGGAGGTGACAGAGCTATCTGGAGAGAGGGCAAGCCCTAGGGTCACAGGTCTTTCCTCGGGCCACAAGGTTTGGGCTGGTGGCCCAT 

TTCTATCATGCTGCCTTAATAAAGATTTCGGAATAAAGGTGGCTCTTGTGATCCCTGGGGTTGTGGGAGCTGGGGGGCACCAGGCAGCTGTCTTAGG 

CAGAGCACTTTTCCTGGGAAGGGTTAGAAAGGTGCCGCCTAGGCAGCATGGCCTGCTTTCCTTGTGGACGAATTACAGCGCTAAGGAGCAGCCAGCA 

TTA C CCAGAGGGCT CAT CGGGAG AC AGGGCTGGAC CC TTACACAG AAC C CT AAGAT TCAT ACG AC AT CAGG CATGGTGCTTC CT TAG CAC ACT C AGG 

GCAGACAGAGGAAGAGGGCTAGCCTGTGCTACATAGTGCAGAACTTTGGCTCAGCGACAGCACTGGCTACTCCTGAGAGCCTGGACTCAATTCCCAG 

CAGTTAACCATTTTAAACTCCAGTCTCAGGGATCTGATGCTGAGGCACAAAGCTCAGAATGTATTAAAAAAACAAAACCGGATGAATGGGGTTAGGG 

TGTGGCTGGAAGCACTTCCCCTAGCAGGGAGGCAGGCCCTTCCTCAGTGGTGCTGCAAAGGAGCAGCCCTGCATGGGAGAGGAGAGGAGAGGAAGAG 

GATG AT CTACTGTGTAAGC A AAT ATG AACTC AAC AGACCAAACCTTCTCAG ATATACAAA CT GGA TTGTTATAGAC AC AC GTT TTC AAAAGT GAACG 
TGATAC AGCAAGGTGACTTCAAGTC AGT CAGCTGTGATGC AAT GGTCT AAGGGT TTTTGTTGTTGTTTTTGGTG AGGTAGTG TTT CAC AGC AGTGG C 
T GG C CT CAAACTTAC AG ATCC ATGGTTG GCTGCT ACT AAAATTTC ATAC AG ATT TTCTCAGG C CATTAAAATAATACAAAGATTGTAAGCCTAGCCC 
TCAAAAAGTTGAACCATCAGGCAGGTGGGTGGCCAAGTTCAAGGTCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGGCAGGGATACACAGGGAAAC 
CCTGTCTTAAAAAACCIAAAACCAAAACCAAAACCAAACCAGACAGTATTCTTTGCTCATTAGTTATGCAGACACAGG 

CCAGCCTAAAGCTCTGGTTGCTTGATAGCTAGGAAGAAATCCCATTTTTTCCTGATTCTGTTGTGTCACTGTCCCCTAGCAAGGGCCTGCAATATAG 
TCTTCCCTGACAGTGAACCACTGCCACTTTTCCAACAGACACCTTTCCAACATGGCAGCAGAATCTGGCATCCCTAGAGTCGGCCCTGACTGTACGT 
CATGGTAGGAGTCAGGGGCGCGTGTGCATACATTTCCTGTGAAGAACAAAGGAAGCAAGAGTCTAGAGACACAACTTTGAAGAATAAGGTGGGGTGA 
GGTTTCTGTGGGAAGTAAGAACCCGCTCTTCTGAGCTGGGGACAGGGAGGAGCCAGCAGTGACTCCCACCTCTCCCGTGGGTGACAGGGGAGAGAGG 
CAGGGACCTCCAGGGACCAAAACACAAGAGCTGCGATAAATAAATGCCTGTTTTCTCCGCACCCCAGCGAGCTTACACAGTATCTGGAGAGAGCAGA 
CCTTATACCAACTGCCAGGTGGGTTGTGGGGTCGCTGAACCGAACTGAGGTAACTCCGGTTAGCTTCCAGAAGTGAGCAGGGACCAAAGCCTAAAAT 
GGCAGAGAAAGCCAACCAGATGCAGTCCTCCCTGCCCAGGATTAGGCCACGGCAGCCATAGTAGCCCTTCAGTGTCTTGACCTAGGTCTTTAACTAT 
ACCAGCTGGGGTCCTGGGGTGGTCCTGCGCCCCCTGGGCCAACACAGGTGACTGTTCTCATTCCATGGTGACACCACCATCCTCCACGTCTCTGGGG 
ACAGCGCTCTGAAGCTCACGCCTGATCTCAGGGGACAGCTGCGGGTGGGGCTGCAGCCGAAGCAGCTCCAGGAGGGCCTCCTTCTGCTCTGTGGCCA 
GGTCAGCCTTATAGCGCTGCGCTAGTGTCAGCAGGCACTGGTGCCAGAGGACAGGCAGCTGGCGCTTTTCCGTCCGGAAGGCCAGAAAGTGGAAGAC 
CAGAGCGTCGAGCACGCGGTAGGGCAATGCGTACTTCTTATCCAGCAGCAGGCGCAGGAAGATGCTGTTGGCACCGCTGTATTCCATTTCTGCGATC 
TTCAGCATGGCCGCACTGGAAGGGAAGGTGGAGTCACAGGAATTAGGCCACACCTATGGGAGTCCTCTTCTTACACTGAGCCTTACTTCAGTGTTCC 
CAAGGGGGCCACCACAGGCTGCTCACCTGTTGATTACTATGCTCCTGAGAGTTTCTACCATCTCCATCCATCTGCCCCACCCACACCTCTGCCTCTG 
GCTCGGAGAGCAAGCAATCCCTGTGTTTAACTACAGATCATGTCCAAATGTCTCACGATCCGCACATTTTAGCACACTTTCTTCCTAGATGCCCATT 
TACACACGCAGTCATTCAATTATACTAGCTACCAGGACCCTGGCCCCTGCTCTTGAAGACCTACTATTCTGTTGGAGTAATCATACACCCACTGACA 
TAAACGGCTAGACTCGAGGTAATCATAGCTCATTCTCCATTCGCACCCTCACGACCMGTTCCGGCCTCTCTATCACACGGCTCCTCTCTGACCATC 
CTCCTCCACCTCCCCACCACTACCTGGAGTGCAGCACGGGGATGGAGCACTTGGTGATGATGCTGCCCACGATGATGGCTTCACGGAGGGTACAGGT 
GCCGGACTCACACAGTGGGATCAGGATTCCTAGGCAGAGTCAGATGCAGTGGCTTTTAGACCGGGGTGAGGGAAATGAGGCGCAGCCCCCAGCAGCC 
CCAGGCATCAAGGACCCCCAATGATCTGCCAAGTAAGAGCCCCACCCCCTCCTTTCTCCCCTGCTTCTGCTCTACCTTTGAACCAGGCTCCAGGCTT 
GAACAGGGCCTTCTTGAGCGCCATGTAGAGGTGAAAGTTGAGTCGTTTGTATTCAGCAATGTCATCTCGCACTCGGGGAAGCAGGACAAGGTTGTAG 
AAGCGCTGAGC CAT GCG CTCCTTCAAGT TAG AGG CAAAAATCCTAGCAGGGTAGGAGTTTCCAAGTTA CCAGAG AG CCTG AC CCTGGCCCT TT AAG A 
ATCATCGCGGATGGTGCATTCGAGAGGTTTTATAGTCAGACACGGGGACAGCGTGTTTTATAGTCAGAGCTGTAGCATGGGAGTGAAAGCTCTTCCC 
ACTGAAGAGTACCACAGGACTTGGTCATGAAAAACCTGGTGAGATGGCTCAGTGGGTAGGAGCACCCGACTGCTCTTCTGAAGGTCAGGAGTTCAAA 
TCCCAGCAACCACATGGTGGCTCACAACCATCTGTAACAAGATCTGACGCCCTCTTCTGGAGTGTCTGAAGACAGCTACAGTGTACTTACATATAAA 
AAATAATAAATCTTTAAAAAAAAAAAAAGAAAAACGAAAGAGCAGGGGGCTGGAGAGATGGCTCAGCGGTTAAGAGCACGGCCTGCTGTTCCAGAGG 
CCTTGAGTTCAATCCCCAGCAACCACACAGTGGCTCATAGACATCTATAATGAGATCTGATGTCCTCTTCTGGCCCGTAGGCATATATGTAGACAGA 
ATACTGTATAGATAATAAATAAATCCTAAAATAATAATAATAATAATAATAAAATAAAAACCAATAGACTATAGTTTCCCCTCTAAAAAACTAAAAG 
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CCAAAAACCCATAAGACTGAAAGTGACTGAGTTCAAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTC 
TCAAAAAAAAACAACAATAACAACAACAACAAAAAAAAAAGACTGAA 

TTG AAT ATG CAGTC ACA C CTG AAAATAGGTT T AAAATCCCCAGAAAGAACAAAGTTAAGGGAGCAG CAGGAGCGAGTCC AAGGGAGGC CC ATGGGC A 
ACTGAACAGCTCTGGCAGGCAAAGCCCGGACCCCCCACAGCCCACA 

AGG CCT C AGGCTCAGTG ACATAG AGGAT CTG CTC CCAGTTGG ACAGTGC AGGGATGACCTTAAAAGCCT TGGGCAGTTT CCC ACTGCGGT ACTTACA 

TAACACCTGGGGTGAGAGAAGCAAAATCGCCAACTGTGCTGGGGTCCTGGCCCTGTTGCTCACTCATCTCCAGAACCATCTAGAA^ 

CGGGTG ACCGG AGTTTAGTAGAC TTC CAGGGTCAATAAGAGACAGTGC AAG AAC CTG AG CCT ATC AGCTCT CCC CC AATCCTGTT CTGAT CGC CATG 

G CTTATTTGGTGGGGGC CATGGATGAGAGCC C AC C CC GAG TC AGG ATATCC C AG C ACAC ACACCACTTAGCTCT TACCT CCCGGA CTC CT CGGT AAA 

CTTCCAGGACCCGGGGATCCAGCTGAGGCATAGGGAAGCCCGATACCTCTGACATGACAGTCTCCACCTCTGTCTGCTTCTCCGTCAACTTCTCCAT 

GATGATATCAGCCAGGGTGCGCCTATGGAGACACGGGAAAAAGAGAGGTGGAAAGAATTGGGGTCACTTTCCATGAAACTGTAACGTCCCCTCAACC 

TGTTTTTTATCTTTGTTGTTGTTATTTGACACAGGGGTTTTATGTCTAGCTTTGCCTGTCTTAGAACTGGCTCTGTAGACAAGGCTGACCTCAGACT 

CAGAGATCCACTTGTTTCTGCCTCCTTTGTGCTGGGATTAATGGTATGCCTGGCTCTGTTTATTAAAAAAGAAAGGCAGTAGTGGTGGTGTAAGCCT 

TTAATC C CAAC ACTTGGGAGG CAGGC AG ATCTTTGTGAGTTG AAAGGTC AGCATGGT CTT CATAGGGAGATCT AGGCCAGAC AGC AAG CC C CGTCTC 

AGAGGGGAAAAAGAAAGTCACAAAACATCCCCAAACAAAACAAGAATGCTTCACAA 

TGTTCTAATTT T AGTGTACGGGAAGCTGGAAGCG AGCAATGT ATAATTCCC C TGG ACGCTGACAAATCT CGAT CCCGCT GGG ATG CAC AG CCT CAGT 

TCCCCTCTATGTCCATTTAGCCGCCTCAAAGTGCTGCAGAGCACGGGCCCTGGATGTCTCTGCATCTGACGCAGAGCCAGAGTGGTGGGGACGTTAG 

GCCCTTAGGCCCTGAAGTCAGCTCCGGCTCTTCTGGTCCTTGTATGATCCTGGGCCCGTCACTTGACTCCTCTAAGCCTCAGTTTCCTCATCTGTAA 

ACTGGGGAAGTAAAAGGACCCCCCTGGACAGTCAGCTGAAGGGTAAAATOTAAGACTGCAGCAGGCCCGACTCTCAGCCAGAGCCAGCAGGTCCTGA 

GGGCTGTTTACTTCAAGCTCACCTCCCACTGCTCACACAACGGGGAGCAACTTCTGAGTGCAGAGTGGAGCTCTGTGCCAGGCAGCGCTGGTGCACA 

CCTTTCATCCCAGCGCTCAGAAGGCAGCGGCAGGCAGAGTTTTGTGAGTTCAAGGCCAGCCTATCCTACTCACCCAGCACTTGCCTAGCAAGGATGA 

GACCTGGGTTCTGGCTTCCAGCACAAGGGGAAAAAGTTTCTCTGCAGTCTTTTTCTGTGCCGAGCAGCACTGAATCTGACACTTTCCAGACTTGTGT 

CCTGTGGTCTGCATGACACTGGTGGCAGGTGCTTTTATTCCCCCGCACTGCCCAGGCCGGGGAATCGAGACTCAGGGCCAACCAACAACTCTCCTAA 

GCCCTAAGACTTTCTCAGGGAAAGGGTAAATTTTAGTCTTTCGACTTCTAGTTTGGGTGCTTCGCAGCGTCTTCAATGTGCCTTAAGTGACAAGGCC 

TGATTGCTGTGTTCCCAAAGACCCTTAGCCTGGTTTCTGAAGATGCCAGCAGCCACTGCTCAAGAGTGCCCACGGTCCCCCTTTCCACCATGTTGTC 

ACTTCCGTATAGCAACTGCAGAAAACATTCTGCTAAGTTTGTCAGATCAAGAGCCACTGGGTATGAGGTGCTGCCCGAGAGCCTTTGGGGGGCACAT 

CACACCCCATGACCATTCTCAGAGCTGTCAGACACCTTCAGGGGACAGGGTTCTCCTGTGTTCTCTGAACCCAAGCTGCTCATCACCGTACTATTAA 

TGATAGTGGTGTACTACACCCCACTCTGAAGAATAAACTGGTTGAGCCCCTTTGCCCTAAGCCCAGTCCTCTCAATGAGGCCAGATCCATCTGAATG 

GAACCAAGAATCTGACCACAGGCAGCCTAAATAACTGCATGTTTAGAAGTAGCATTTATAACACATGGGGGTACGTTTGTGGTTCTACCCAGAGGTA 

GTTTTAACCCCTCTGGGAACATTTTGTCAGAGATAGCTGCTATTGGCCCCAACACTGTAACTGAGATGGAAAACCCTGAGTTAAAGACTAAGCTATG 

GGCTGGCGAGATGGCTCAGCAGGTAAGAGCACTGACTGCTCTTCCGAAGATGCTGGGTTCAAATCCCAGCAGCCACATGGTGGCTCACACCCACCCG 

TAATGAGATTTGGTGCCCTCTTCTGGTGCGTCTGAAGCCAGCTACAGTGTACTTATGTATAGTAAATAAATCTTTGGGCCAGAGCGAGTGGGGTTGG 

CCAGAATGAGTAGAAGTCCTAAAATTCAATTCCCAACAACCATATGATGGCTCATAACCATTTGTACAGCTACAGTGTACTCACATACATAAATAAA 

TTAATCTTAAAAAAAAATAAGACTAAGCTATGAGAGTAACGTACCAAGAATAAACACTACTAAACGGATCCCCAGCTATCACAAGGAACCGGTAATC 

CAGAGGTTCCAGATCAGATGGGCAAAGGGGAATCCGGGTCACTCCCGTTCAGGGGCCACTCTTGGCAATGGTCAGACCAACAGCTGTGGTCTGGATG 

AACTTTCTACCAGGACCCACTGGTCCAACCTTTAAGCCTTACCTCACGGGAGGATTCTTGTTCATGAACATTTCAATGGCACGCTCATCT 

CCACGATCACCTCTGCCTGGTGGTCCACCCCGGCCATTTTGGCTGCCTTCTCCAGGGTAGGCCACTCCTCATCCTCCTCATCTGATCCATCCTGTGG 

CAATCCCGGCCCTACAAGATAATCCCAGCCGCAGGGGCTTGTGAGTCAGTTCCCCATTCCACGTACAGCACACCCAGGGAGCAGGAGGGACACCTGC 

AGAGGGACTATCCTTTTGTCCCCAGAGACTTGGAATCTTGATAGAAAATACAGCACACGAAGCGAGAGATGATAAGGCCAGTAAAGACTGGATGACA 

TGTCCCCTCAGTGAATGGATGACATGTCCCCCCTCAATGACCAAGAAGGATCCATTCTATACTACACACATGTAACCTGGTCTGTCCAGATGACCTC 

AAGGAATAGGTACTGTTGCTATACCCAGTTCAAACCGGGGCAGAAGAGTTGATGTGCCCACAGTAGGTAGGCACTGTGGGTCCTGACTTCCCACTAT 

GTTGCTTGCACAGTGGAACTGCATCATGTGCTGAAACTAAAACCATGGTGGAGGCAGTGTGCACCTCAGGCACTTGGATGAGTACAGAGGCTGAGGA 

GGCTCTGGATGGGCAGTCTGGGAGCAAGAGCATGCTCCACGTCCTGGAAAGTAATGCTGTTAGAGTGTTGGGCACTAAGGAGCCAGAAAGGAAACGG 

AAAGCCAACCTGGAAGGCTCTAAGGCTGGGCAGTGGCATTCGGACTCTGATCAACCAACACATTTAAGTTACTGGCTTAGGTAGTCGAGGATGACTT 

GGGAGTTCCTTGTTCTCGCGCCTCCACTTTCCAAACACGCACCACAGTGCTGAAGGCTGAGCTTCTGATGTTGGGTTAATCCCGTGTGTTCCATGTA 

AAGGCTGCCTGTGTACTATTCACATGTTAATCTTTGTTCCAGCAGAGAGCTGAGTACAGTCGGCATTGGTTTTTTTTTTTAATGAAGCATCACCTGT 

CTCAGTGTTGTGGAAACTTCTTCTCCATCACCTTCTGATTGGTTTAATCAAAAGCTGAGTGGCCCATAGCAGGGCTGGAAAGGATAGGCGGGACTTC 

TGGGCAGAGATAGGAACTCTGGGAAAGAAATCAGATGTGAGGGGACTCAACAGCCAGACACAGAGGAAGAAACACGTGCTGTTACTGAAGAGCAGTA 

ACGAGCCCCGTGGTTGAATGTAGACTAGTATAAAGCAATTATGTTGTATCAGCGAGTTCGGAACAAGGCAGCTAAGGCCAAGCTTTCATAACTAATA 

AGAGGTCTCTGTGTCCTCACTTGGGTGATAGGTGGGTTGAGCCATTCCTGCCACATCCTGAGGTCCTGACCATGGAGTCTTGGCTGGGTACTTCCTG 

AGTGCTGGGATTAAAGTCATGTGCCAATACACCGGGCTGACTTTTGAAATACCGCACATAGCTGGCAAGGCAGTTGGGAAAGAGTCTAATAAACCAA 

AGT AGTTGAATTATTAC C AT AGC AACAG ATC AGT TGC T ATGG AAGTGG ACTAAGGGT ATATT ATAT AG AGC TCC AGAGAGTAC AC AGAG CATACCC A 

GTGAGAGCCAAGGTTGTTTGTTTTAGGTTTATTTTACTTTTAATGTTTGAGTGCTGACTGCACGTATGTCTGTGCACAAGTGTGCCTGGTGTCTAAT 

GACATCAGAAGATGGCATTTGATCCCTTGCAAATGGAGTTACAGACAGCTATAGCCJTGCCATATGAATGCCAGAAACCAAACTTGGGTTTTCTGCAA 

GAACAAGTGTTCTGAAGAGCTGAGGACCAACTCTTGAGAGTTTTCCTCTGTTCTCTACACACATCATGCCACGCACATGCACACAAAATACATGTTA 

AACAAAGGAAGACAAGAGCTATGAAATACGACGTAGGGAAAAGAGAGAAAATCAGGAGACAGTCACTGACATTTGTGTGGTGTTTTGTTGTTGTGAT 

CATTTGCTTTTTGTATATGAGTGAACCCTGCATCATGGAAGTGACTCTTGGGAGCTGATGACCTTGTTTTTGAAACAGGATTTGACTGGGACCTGGG 

G AT AGCTGATT AGG CTAG ACTGGTT GCC C AGT AGGCC C AAGAGAT C TGTCTT CT ACACTGTAG CT C AG AAACT ACAAGCTGCTGT GAG TGGAAGTC A 

GGGCTCATGCTCTCTCAAAGCATGGACCTCACCAGCTGTGAGCATTATCTTATCCTGTCCCTAGCTTGTGGTCAGACCTGAAACAGAGACAGACCAG 

ACAGGAC GC CAT CAATC TCT AGCTG CTG AATGGAC CAC AG AG GTCACAG AATTC AAAGCACGATGT TT CCAC AGTGATCC AGGGG AGAGAC CCTGGT 

AGC TTAATCAGGGAGTG TCT C AGGTGAATGG AAAGTGGTCTGGCT CTG AAAT ACT CT AAAGA C AA CAT CAC CAGGATTT ACTTTG AAGTG AGAAAGA 

GGAAGCCTGGACT CTGACTGG AG CAGTGGAGCTAATGTGCTAACAACCTCCAAGT AGCAAAC AGAGAG CAAGTT TGGGAAGG AAGT CACAACTTCTG 

TTTTAGATGCAATCTTCTTCTTTTATTTTTTTTTTTCCTTCTCCGAGAC!AGGGCTTCTCTAGGTAGCCTTGGCTGTCCTGGAACTCACTTTGTAGAC 

CAGGCTGTCCTTGAACTCAGAAATCCACCTGCCTCTGCCTTGTAAGTGCTGGGATTAAAGGCGTGCGCCACCACTGCCCTGCTTGGACGCAATCTTA 

CTAGAGTTAGAT AC T AGTATTCAACTTGGAG ATGTCAGGT AG ACC ACT GGATATAAAGATGT GAGCTT TC TTGGACGAACTCTGAGGG AG AAAAAAC 

TGGAAGTAGCTGTCAAACAAATGGCCTCAAAGTGAAATTCCCTGAC 

MOUSE SEQUENCE - mRNA 

AGCGCCAGCGCCAGACCCGCCCGCACCCCGCGCTCTCCCTCACGCGCGTCGCTTCTCCTAGGACTCGCTAACCCGCACTCTTGCTCTCACCCGTGAG 
CCATCGCAGGATGGAGCTGCTGTGTTGCGAGGGCACCCGGCACGCGCCCCGGGCCGGGCCCGACCCGCGGCTGCTTGGGGACCAGCGTGTCCTGCAG 
AGTTTACTCCGCCTGGAGGAGCGCTACGTGCCGCGAGCCTCCTACTTCCAGTGCGTGCAAAAGGAGATCAAGCCGCACATGCGGAAGATGCTGGCAT 
ACTGGATGCTGGAGGTGTGTGAGGAGCAGCGCTGCGAGGAGGATGTCTTCCCTCTGGCTATGAACTACCTGGATCGCTACCTGTCCTGCGTCCCCAC 
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CCGAAAGGCGCAATTGCAGCTTCTAGGTACCGTCTGCCTGTTGCTGGCCTCCAAGCTGCGCGAAACCACGCCCCTGACTATTGAGAAGCTTTGCATC 
TATACGGACCAGGCTGTGGCTCCATGGCAGTTGCGGGAGTGGGAGGTGCTGGTCCTGGGGAAGCTCAAGTGGGACCTGGCTGCCGTGATTGCGCACG 
ACTTCCTGGCCTTGATTCTGCACCGCCTGTCTCTGCCCAGTGACCGGCAGGCTTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCTAC 
AGATTACACCTTTGCGATGTATCCTCCATCCATGATCGCCACAGGCAGCATTGGGGCAGCCGTGCTAGGCCTAGGCGCCTGCTCTATGTCTGCGGAT 

G AG CTC ACAGAGTTGCTGGC CGG GATCAC AGG CACTG AAGTGG ACTGCCTG CGAGCCTG CCAGGAACAG AT CGAAGC TGCCCTCAGGGAG AGC CTC A 
GGGAAGCTGCTCAGACAGCCCCCAGCCCAGTGCCCAAAGCCCCCCGGGGCTCTAGCAGCCAGGGGCCCAGTCAGACCAGCACTCCCACAGATGTCAC 
AGCCATTCACCTGTAGCTTGAGACAGGCCCTCTCAGGCCACCAAGCAGAGGAGGGGCCCCTGCCACCCCCTCCCTGCCTCTAGAACAATCCATGCTA 
TATCTGAAGCCCGAGGGGGGCTCTCCTTCCCCCTCACAAAGCCCAAGGGGCCAGGTCCTGCCTATCCCCACAGTGTGCACTAAGGGGTTGCATGGTC 
ATGAGGGGCGTCTTCATGGCCAGTCAGCTCCTCTTCCTTCCCACTCAACCAGCTTGGCTGTCCTGGGCCATGATGGTCAGAGAAATACAAACAGGTA 
AAATCCACACACCAGCATTTCTTTTGAGTCCCTCTTCTGTCCGGGGCTCCAACCTTCTCAGTTGCCAAAACGCCCCAGTACCTTCCAAAGGTGTTGG 

C CC TTG c AGGCTCC TTCGGGC AT CCCGATAG AAGCTT ATGAGGAGCTGC CCT AGATGGC CTCTGTGTAATC CGT ACT CC AGCTGCTCTTAGAGGG AA 
CAGCCTAGGCTTTGGCCAGAGCAAGAACCCATACACTGTTGCTTTGCTTGCTGCTTAGCTTCTGTGATTGTGGGGTCTTAAGGGTGCCGTGGTCATT 
TTAATTTATTGCTTTGAATACAACTGTAAGAGGGTACAGTGAGGCCTGTACCCCACAAGTGGTGGTAACCCTGGCGGTTGCTCTTTCCCTCCCTTCT 
TCCCCTCTGCTACTGCTTTGTGGCCCAGGAGCTGCTACAGCCTGGGATGGGGGTCACGCCTTCCTCTCCTCACCCCTCCACCTCATCTTCATCAGAG 
CAGGGTTAGGTTGGGATGGATCGATGCCGTGGAGGTGACAGAGCTATCTGGAGAGAGGGCAAGCCCTAGGGTCACAGGTCTTTCCTCGGGCCACAAG 
GTTTGGGCTGGTGGCCCATTTCTATCATGCTGCCTTAATAAAGATTTCGGAATAAA 

MOUSE SEQUENCE - CODING 

ATGGAGCTGCTGTGTTGCGAGGGCACCCGGCACGCGCCCCGGGCCGGGCCCGACCCGCGGCTGCTTGGGGACCAGCGTGTCCTGCAGAGTTTACTCC 
GCCTGGAGGAGCGCTACGTGCCGCGAGCCTCCTACTTCCAGTGCGTGCAAAAGGAGATCAAGCCGCACATGCGGAAGATGCTGGCATACTGGATGCT 
GGAGGTGTGTGAGGAGCAGCGCTGCGAGGAGGATGTCTTCCCTCTGGCTATGAACTACCTGGATCGCTACCTGTCCTGCGTCCCCACCCGAAAGGCG 
CAATTGCAGCTTCTAGGTACCGTCTGCCTGTTGCTGGCCTCCAAGCTGCGCGAAACCACGCCCCTGACTATTGAGAAGCTTTGCATCTATACGGACC 
AGGCTGTGGCTCCATGGCAGTTGCGGGAGTGGGAGGTGCTGGTCCTGGGGAAGCTCAAGTGGGACCTGGCTGCCGTGATTGCGCACGACTTCCTGGC 
CTTGATTCTGCACCGCCTGTCTCTGCCCAGTGACCGGCAGGCTTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCTACAGATTACACC 
TTTGCGATGTATCCTCCATCCATGATCGCCACAGGCAGCATTGGGGCAGCCGTGCTAGGCCTAGGCGCCTGCTCTATGTCTGCGGATGAGCTCACAG 
AGTTGCTGGCCGGGATCACAGGCACTGAAGTGGACTGCCTGCGAGCCTGCCAGGAACAGATCGAAGCTGCCCTCAGGGAGAGCCTCAGGGAAGCTGC 
TCAGACAGCCCCCAGCCCAGTGCCCAAAGCCCCCCGGGGCTCTAGCAGCCAGGGGCCCAGTCAGACCAGCACTCCCACAGATGTCACAGCCATTCAC 

CTGTAG 

HUMAN SEQUENCE - GENOMIC 

CACACCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCTGTCTCAGAAAATAAAAAAAATAAAAAATAAGTGGGATCTATTATTATTACTACCTA 
ATAGTATTATTACTATTATTGCTCTGGTGTGCTATCAGGGGTGTTGTGCCACACTGGGGACATACTACTCCTGACCCCCCACCTAAACTTTCTTTGG 

GGAGTGCAGTGACATGATCTCAGCTCACTGCAAGCTCTGCCTCCCGGGTTCACGCC^ 

GCCTGCTACCATGCCCGGCTAATTTTTTGTATATTTAGTACAGACGGGGTTTCACCATATTGGCCAGGTTGGTCTCGATCTCTTGACTTCGGATCCA 
CCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCTGGCCTACTCCCACTTAATCTCTAAACTCCTGCAATCCCATTTCTT 
TCACCATGGCTGTTTCATGGAATTCAGGTGATGAGAACCACCAATCTCCACCAGTCCC^ 

TACTTCTGATGGGTGTTGCAAATGCCGAGGTAAAAGTCTTCTGTCTTCATGACATCITTAGGATTCCTGCATCATGGAGGATCATCATATCATGCACT 
TTAGACTGGAAGCTTAAAAGGGTGGCTTCTCGGCCCAGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGA 
GGTCAGGAGATCAAGACCATCCTGGCTAACACGGTGAAACCCCAACTCTACTAAAAATACAAAAAATTAGCTGGGCGTGGTGGGGGGCGCCTGTAGT 
CCCAGCTACTCGGGAGGCTGTGGCAGGAGAATGGCGTGAACCTGGGAGGCGGAGCTTGCCGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGGGT 
GACAGAGAGAGACTCCATCTCAAAAAAAAAAAAAGAAAAAAAAAAGGGAGTCTTCTCATCTCTGACCCATGTCCAGCATGTCCCAGGGCTTCCGAAG 
GCCTGCCTGATGTATGGAAGGCACATGATAAGGCACCTGATTGTTAATGACCACAGCTATCTTTATTGACCACCTACTATGAGCCTGGGACAGCTAA 
AGTACTTTTCCCTTATTAATCCCCTCATTCACCATATGAAATAGGAGTTGTTACCATCCTCCTTTTGCAAATGAGAAGGCTGGGACGGAAAAGTTAA 
GTAAACTGCCCAGGATCACACAATCACACAGCTGGC^AATAGCAGAGCTGGGCATCAACCTAAATCCAGAGCTCACTTACTTA 

CTAATGCCAGTGCTATCTCCTGCCTAGAATGAGGAATGCTTGCCAGCCAAGAACCGAAGACAGCTCTCTTTTTGCAGGTATATTGCTGTAGCAGGTT 
ACCCTTCCACACCTGAGGCCCTGTGTGCACCTCAGTCTTACCTGTGGAACAGGAATCTCAAGCAGAATGACAGAACAGGAAAGCCTCATTTATTCAG 



TGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAAGCTTCCCGGGTTCACGCCATTCTCCTACCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACCC 



GATCCGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCTTGAGCCACCACACCCGGCCCTGTAGGTTGTATTCTTATTTTATGAGGAGAGACT 

TGACAGTTTCAGGCTAGTACCTGAGGTTTCTTTTCCCAAGC^GCTTATAAGCTAGAAATTGCAAACTGCTTTTTGTCCCTCAGACCTGGTATATAGG 

CATATTATGTGTGGCTTGCACGGTTTTGTTACTGTTTTTCAACTCAAGTAATATAGATATTTTAGGAAATTTCTTTTTTTTTTGTTTTGAGACGGAG 

TCTCGCTCTGTCGCCGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCTGGCGGAAATTTCTTTTCTTTTGAGACACAGTTTCATTCT 

GTCCCCCGGGCTGGAGTGCAGTGGCACXjATCTCGGCTCACTGCAACCTCTGCCT 

GGGATTACAGGCGTGCGTC^CCATGCCTGGCTAATTTTTGTATTTTCAGTAGAGA 

CCTCGCTGGTCTTGAACGACTGGCCTCAAGTGATCCACTCACCTCAGCCTCCCAAAGTGCTGCGATTACAGGCATGAGCCACCACACCCAGCCTTTT 
TCTTTTCTTTTGTAGTAATGGGGTCTTGCTGTGCTGCCTAGGCTGGTCTTGAACCCCTGGCCTCAGTGATCCTCCCAACTTAGCCTCTCAAAGTGCA 
GGG ATTGCAGG CGTGAG CCAT CATG CCCGAT CTAAGAAAT AT T AT CTTGGGGAAACAGGGAATCT ATGGCCAC ACT C ATT ACT ATGGTGT AGCAAAA 



TTTATCATGTGGTGTATGCTGGTGTTTTTCTTT1 
GTGGCACGATCTCAGCTCACTGCAACCTCCGCTTCCCGGGTCCCGGTTCAAGCAATTCTCTTGCCTTAGCCTACTGAGTAGCTGGGATTACAGGCAC 

GCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGG03TGAGCCACCGCGCCCAGCCTGGTGTTTTTCTTATATCCAGTTTGTTTTATTCATTAACT 
C AAAAA AGATTT ATTGAGCC AGACGTGG TAGTGCACG C CT AT AGT CCC AGCTACTTGGGAAGCTG AGGTGGGAAGAC CACTTGAGCCT GGG AG ATAG 
AGGCTG C AGTG AGTCGTT AT C ATGC CAC TGC TTCC AGCCTGGGTGACAGAGTGAGACCCTGTCTCAAAAAAAAAAACAAAAAAAAC CAAAAAAC TTA 
CTGAACATCTACTATGAACTGGATACCATCCTGTTTTGGTGGTGTGGTGTGGGAGTAGACAAGCTGTAAACAGAATATGCAAGCAAAGTATATGCTA 
TACTAGTGATCAGTGGTAAGCAGAAAAATAAAGCAGGGAAAGAGCATAAGAAGTGTCAGGGCTGGGGAGCTGGCATTAAAAATTTAGGTAGGATGGC 
CAGGAAAGGCTGTAAGAGGAATGCAACTTTTGGACTTCCTGAAGGAAGTGAGGGAGTGGGCCATGTGGATATCCGGAGGCAGAACATTCCAGGCAGA 
TGGAACAGCACGTGCAAAGAGAACATGTCTGCCCGGTGTGTTCCATGAACCGTGTGGCCAGAGCAGAGTGAGGGAGAGTAGTGGGAAATGGGGTGGG 
AGTATAGATGCTGCAGGGTCTTTGGCAGTCACTCCAAATGAAATGGGGAGCCTCTGAGAGTGATGGGTTGAGACTAGACTGGAGGGCCAAGGGTGGG 
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T CAGGG AG ACCCGGCAGGAGGC TACCAC TGTAATC C AGGTAAGAGAT GATGC AACTCGGAC CGG GGGGT AGAAGTGGAAAT CG AGAAAAGAAGGGGA 
TTGGAG ATGTAT TTTTG ACGTAG AATCAACCAGAAGTGGGTCGCAGGTTGG ATGTGGGGGAC CAGTAAGGT CAAGG AAG AAC TGAGTT TGGACAGAG 
GAAGTAACAGAGGAGGAAAAAAAGAGTAGATTTTGGAAATATTTTGAAATTATTAATAGCTCCTCTGTGATCTGCTGATCAGAGTGTGGGGATTGTG 
TGGGATTTGAGAAAGAGGAGT CAGG AATGACACCAAGTGTTTGGCC TG AGC CATTGG AAGGATGG AGTTGC CATTT TCT GAG ATAAGC CTGTAGGAA 
GAGCGGGTTTTTTGGGGAAAGATTAGGATATCAGTTTTGGAGAAGTTGAGTTTAACTGGCCTATCGGACATCCTTGTGGTGTTGCCAAGAAGGCAGC 
CTCCAGTATAAAATGGAAACTAAAAACAGGTCAGTTCACACCAACTCTGATTTGGTAGTGCTGCCTAAAGTGCTGTGTTGGGATTTTAAATGCCCAT 
CCAGGCTGTTTCTTTTCTTTTCTTTTTTTT^ 

ACCTCTACCTGGTGGGTTCAAGCGATTCTCCTTCCTCAGCCTCCTGAGTAGCT^ 

ATTTTTAGTAGAGATGGGGTTTCACCATGTTGGTTAGGCTGGTCTCGATCTCCTGACCTCGTGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGAT 

TACAGGTGTGAGCCACTGCACCCAGCO^TTTTTGTATTTTTAGTAAAGATGGGGTTTTGCCATGTTGGCCAGGCTGGTCTTGAACTCCTGACTTCA 

GGTGAT CCGCC CAC CTCGGC CTC CC AAAGTGCTG GGATTACAGGTGTGAGC C AC CGCGC CCAGC CGAAGAGT GCT CAATTTTTACTGAT AGGAATGG 

ACAAGCAAAGAGTTTTAAAGATTAATGTGAAATATTGTACAAATATTACAGATGCTCATAATAGTGACCTGGAATACTGTAAGCACCGAGATAG 

CTTGATTCTCAAAGAAAAAAGCTAAATTATATGTAGTTTGGTTTACTGCT^ 

CCAGGGTGGAGTGCAGTAGTGCGATCTGGGCTCACTGCAATCTCCATCTCCCAGGCTCA^ 

TACAGGCGTGCGCCACTGCGTTCGGCTAATTTTTATATTTTTTTTTTGGTAGAGATGGGGTTTTACTGTGTTGGCCAGGCGGTCTCAAACTCCTGGC 
CTCAAGTGATCTGCCTGCCCCTGCCTCCCAACGTGCTAGGATTACAGGTGTGAGCCACCGTGCCCGGCCAACTTTGTGCTTTTAAGTCCCGCAATCA 
CAACTAACTTTTTAAAAAAGGGTTCACCACGTGCCAGCCACTCTTCTATGTATTTTAAACAAATCAACCCATTTCCTCCTCACAACCACTCTGTGGG 
GGGGTACTACTGTTATCACCATTTTTAGAAAAGGAAACTGAGGCAC^GAGAGGTGGTTTAACTTAGCCAAATTCACACAGTGAGGGAGTGGAGTGGC 
AGAGGGGTGTGGCTCCAGGCCCTGGAGGGTCCTCTCCATACGCTAGACCATGTGTTTACCGTTAGAAGCCAGGTCTTCAGGATTTGCTAGAGATGCA 
ATTCTAGGAGTAGGGCTCTTGCTGCTCAAACTTTTGCTTTTCTTAGAGGGGTTATGGATGCATGTTCATAACAGAAAGGGTACTCCTTGAGAAAGGG 
AATAATAATAATCATTTCAGAAACTAOTATCTCTGAATC 

ATTATAGGGCTATTATATTATCCCCACTGGATAAAAGAGGAAGCAGAGAGGTTCCAGTAATTCTCTCAAAGTCATAAGCTGTGGAGCCAGGTTTGAA 
CTGAAAGCAGTCTCTGACTCCAGAACCCTGTTTAGCACCCCTGTTCTCAGCCCCATGGC^ 

TAACTAAGCCATGGCATGTGGACTGTGGGGTGATGGGCCTCACCCATCACCTTGTTCAGCCTCTCACTTCTCAGATGCGGACTTTTTTTTTTGGAGA 

CAGAGTTTCGCTCTTGTTGCCCTGGCTAGAGTGCAATGGCACGATCTCGGCTCACTGCAGCCTCTGGCTCCAGGGTTCAAGTGATTCTCCTGCTCAG 

CCTCCGGAGTAGCTGGGATTACAGGCATCCACCACCACACCTGGCTAATTTTTTGTATTTTTAGTAGAGACAAGGTTTTACCATGTTGGCCAGGCTG 

GTCTCGAACTCCTGACTTCAGGTGATCCACCCGTCTTGGCATCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCTGGCCCAGATGGGGGTA 

TTTTGATGATGAAGAGGAAGAGAAAGAAAATGTATCTTTCCTCCTTGGGGTTTCTTATTTGGCCTGTGACATTCCATTCCCACCCCTCCTCCAAT^ 

GCCATTTTATTGACTTTTTCTAACTGGATGTACCTGGGTAGAAACTCCTCATACTATTGGCCCTGGTCATTGCTTTGTCTTTAGTTTTCCTTTTAGA 

AAAACAAAAACAAAAACAAGAAAAACAAGAGGAGCCTCTATTTCTTTGAGGTGGAGGTAACTGGTGAACTTGTGCTGGTCCTGCATTGAGCTCACCT 

GCCTCCCCAGTGGCCAGTCCTTGGCCCTGAAAGTCAGGGATGATCTAGCTCTAGGCATGGCTATTCCATCCAGGGGTGAGGCAGAAGTATCTCTCCT 

AAAGTCCTGCTCACCAGGAGCTCCCGTCCCCATACTACAGGTTCACATCCAGCTTTCAGGACTAGTCAGTCTATGTGGCCCTCCCTCAATTAATAAA 

TCAGC^ACTAATTTGCCAGGTGCGGTGGTTTGTGCCTGTAATCCCAGCACTTTAGGAAGCTGAGGCAGGCAGATCACTTGAGGTCAGGAGTTCGAGA 

CCAGCCTGGCCAACATGGTGAAATCCCGTATCTACTGAAAATAC^AAAATTAGCCGGGCATGGTGGTATGCACCCGTAATCCCAGCTACTCAGGAAG 

CTG AGG C AGGAGAATCACTT GAAAC CAGGAGG CAG AGGTTGCAGTA AGCTGCAC TCC AGC CTGGTGAC AAG AGC AAAAC TTTGTG TCAAAAAAACAA 

AGA AAA C CAAA AAACAAAGAAAA ACACAAAAAAC C CTTCT AT TTGTTAAAAAAAAAA AAATC CAC CGTG AACC AAAAAT TAG T AAAAACAATGAACT 

AAAATTTTGTTTTTGCAAAATGTATGATAACAAAATGTTAAGGAAGGTCATGTGCCGTTATGGTTCACTGCAGCCTTGAACTCCTGGGCTCAAGCGA 

TCCTCCTGCTTCGGTCTCCCTAGTAGCTGGGACTACAGGCTTGTGCCACCGCACCCAGCTTATTTTTTTTTTTTATTTTTTGTAGAGATAGGAGTCT 

TGCTTTGTTGTCCAGGCTGGTCTTCAACTCCTAGCTTCCAGTGATCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGCTGATGGGACATTTTTATACAT 

AGTGCCATGTTACTATAAATGAGAAGTTTTAAAAATACTGATTTTAAAAATTAATTTATGTCAAGAATTTTTATAACCAAAGTTAAAAAACCAAACA 

AAAAATATGAAAAGGGTTAATATCTTTGAGAGGTGATGAGAACTTATAAGTCAATAAGAGAAAACAAACATCCCTATAAATGAATAAGCTAAGGACA 

TGAATGGGT AATGT ACAT AAGAAATGTA AAT GTC T AGT AATATGC C AA AAT AGAT TT AT T ATTACT AAT AAGC C ACT TT C ACT CT CTAGT TGG CAG A 

GTTGTTT TG AAAAAT AG ATATGTAATGATGGTGGAAAAGATTGGTTTAACTATT CAG C AGGAAAATTTGGC AATTAG AAG TGT ATC AAAAG C CTT AG 

AATGTTTCATAACCTTAGATTGGGAAATTCCACTTCTAGAAATTAATTCACTTCTAGAAATAATCATGAGTGTGCACAAAGATATTACCACAAAAAT 

ATTTTAC AGTATTATGT CTAATAGAGAAG AACTAGAA ATAATTTAAAT TTC CAC C AATACAG GTT TGC C AAAAT ACATTTTGT ACATT CACC TAATG 

GTATATTATGTCCCTATTACAAATTATGTCCTAGAATATTTAATAGCATGGAAAAGTGTTAACAGTATTTTTTTAATGAAAAAAGCTTACAAAACAG 

TTTGTGATGATTCCATTTAAAATGTGTGTTTATTCATAGAACAAAGATTAGAAAAATAAACATTGATATATTAAAGGGTTATTTCATGGCAAATTGC 

AAATGATTATTTCCTTTTTTTGTGGCTTATTTGTATTTTTGAAGTTTTCTACAATGTAAAAGAATATTTTATGATATGAAAACTACAATACAATTTA 

TAATATAAGAAAGAATAATTCGGCCGGGAACGGTGGCTCACGCCTGTAATCCCAGCACTTTTGGAGGCCGAGACCGGCGGATCACGAGGTCAGGGGT 

TCAAGACTAGCCTGGCCAACATAGTGAAACCCCATCTCTACGAAAAATACAAAAATTAGTCAGGCATGGTGGTGCGTGCCTGTAGTCCCAGCTACTC 

GGGAATTGCTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCCCAGATCGCACCACTGCACTCCAGCTTGAGCAACAGAGTGAGACTTCGTCTCAAAAA 

AAAAAAAAAAAAAAAAAAGAATAATTAACAGAAAATGGTTAGACACTTCCTTAGTGTCTCCTAAGTCAGGAGGACCCCAGTAGGGCAGGGATCCTCA 

TGGCCTCCTCCCATTTGGAGCATTATTGGAGGTCTTTTTCGGCCTCTTCGTCAAGTGGAATCTAGCTTCCGGTAAAACTACAAAGTAACCAAAAGTT 

TGGG AGGTGGA AGAAATGCAACCGGTAG ATCTCA C AGAGTCTGTGC AAGAAACTGAT TC AATGAG AAT C TAGTTTCTCCGTC CAC AGTTT CTC CAAA 

CAGAAACTAAGGCCGACTTTAGGGGCTTGTCCAAACCTAGGCAAGCAACTTAACAAGGTGAGGCCATGACTCCATGGCCTTTCCGTTCTGTTATATG 

CTGACTTAGACTAAAGCTCTCATACTTTAAAGTGCACAGAAATCTAGTTAAAATGCAGATTCTGATTCAGGTTAGGGGTGGGCCTGAGAGTCTGCAT 

TTCT AACCAGC T CC C AGGCG ATG AC CACGCACGGGAC AGGTC TGGGAT C ACAGTTTAACTAG C AATGGTGTAG AAC ACAGAAT CTG CAGC AAGAAGG 

CCAGCTTCCC^TCCTAGCTCTGCCACGGACCAACTGAATGACAGTTGCCTCGGTTTCCGAGTTTTCGTGAAGATGTAGTGAGTCATTACATCGTGA 

GGCTTTCGAGC AGCGTTCA CTAAG AAC TAG CTCTGACATTATTTATCGCATTCCT TAGAGCAAGC AGC CGGTGAAGTAGGGTTTGA CGAATGAAT AA 

GTGAATGAATGACCTTTGGAGAAAAATTGTTTCCTGGGTGACTAGAGTCCGAGAAGCAAAATGGGAGGGCCCGTGGGTGGGTAGGAGGCCCACCTCC 

TAGAAAGTTCTCTGCACCCGGTGGTCCAGAGGGCCTGGAGTGCCCGGAAGCCGGCCGGCTTGCGCTCAGCGGCCCAATGGGGCCGCGGGAGGGAGGG 

GAGAGCGCTCAGCCAACCCTTTCCGTCCGGGGCGCCGCAGCCCCGCCCCTCGGAGCGTTGCGACGTCCGAGCATTCCACGGTTGCTACATCGTCGCG 

AGGGGCGGGGCGCCTGTCAGGGAAGCGGCGCGCGCGCGCGGGCGGCGGGCGGGCTGGGGATCCGCCGCGCAGTGCCAGCGCCAGCGCCAGACCCGCG 

CCCCGCGCTCTCCGGCCCGTCGCCTGCCTTGGGACTCGCGAGCCCGCACTCCCGCCCTGCCTGTTCGCTGCCCGAGTATGGAGCTGCTGTGTTGCGA 

AGGCACCCGGCACGCGCCCCGGGCCGGGCCGGACCCGCGGCTGCTGGGGGACCAGCGTGTCCTGCAGAGCCTGCTCCGCCTGGAGGAGCGCTACGTA 

CCCCGCGCCTCCTACTTCCAGTGCGTGCAGCGGGAGATCAAGCCGCACATGCGGAAGATGCTGGCTTACTGGATGCTGGAGGTACCGCCCGGACGCG 

TCCCCCTCCCCCACCGCTCCCGGGTCTGGACACACCGGGGCCGCCCTGTCGGGACATCCCGGTCCCCACAGCAATCTGCAGGATGCTCCGAGGCTAA 

GACTCCCGGGGCCTGGCCTTTCACTAGGGCACCCACGCCAGCTGCCTCCCTTGCTTTCCCGAGCCCTAATCGGCTCCTCCCGACAGTGACTGCCAGT 

CACTTGGCACCCTAACTGTGCCTCTCTTCGGGCCGAGCAAACGGGACCAAGGACACTCTCTCCCCTTCGGCTTCTCACAGCGTCTCCCGCTTCCCTC 

CGCCTCCCGGCCTTCTCCGGAGAGCGCGCCTGGAAAACCCTGGGCTCGGCCGCCCCCGCCCGGGAGCCGGTTCCTGTGTCTTTCGCGGGGTTCTGGC 

GGGGAAGGCGATGGGGGTGGCGCGCGCCCTGTTCCGGCCGACAGGGAAATGAAAAGTGCAGGAGGCAGCGGCAGGAGCCTCCTCCCTTCCCCTCGGG 
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TGTGTGCAGTGGTGGCGGTGATGGGGGTGAGGTCCCAGCTCTCTGGCTACCATGGGTGACCCTTGCGTCCCACTTCCCCTCGCTGTGGGGGAGGGGA 
GTCTCCGTTTGGAATCCTCCGTGAGCAAGGGCTCCTTGGCTTTACCACTCCCCCGTCAAGGGGAGAGGGGAGGAGGCCGCGCAGAGGAAGTGGGGGG 
CGTCACCTTGGGGAGGGCGCATCCTCCTCCTTTATTGGTCCTTGTCTCTCTAAGTAGCTTACCCTCTCTGCCGTTTTTGCTCCCTATCCCACCTCAC 
TTCAGCGCTCCCACCCCCCCGCTGCTGTGTTCCCCACGCTCCAGACCCAGCAGTGACTGTGCGCCCTCCCCCAGGTATGTGAGGAGCAGCGCTGTGA 
GGAGGAAGTCTTCCCCCTGGCCATGAACTACCTGGATCGCTACCTGTCTTGCGTCCCCACCCGAAAGGCGCAGTTGCAGCTCCTGGGTGCGGTCTGC 
ATGCTGCTGGCCTCCAAGCTGCGCGAGACCACGCCCCTGACCATCGAAAAACTGTGCATCTACACCGACCACGCTGTCTCTCCCCGCCAGTTGCGGG 
TGCGTGTGTAACTCCCCCCACCCCCGACACGTATTGTCTCCCCACTTTCCAGGAAAAGGAAAGCTTGAGATATCAGAGGCTTTCTGCCCCCTGGAGA 
GGAAGC AAGGGGAT CAGG ACC TTTGGGA AAGG AATTGGGGTG G AA C AAACG ATGGGGCATTGTGGT GTGATGGTTAGTATCACTC CCGCC C AGGCTT 
GAGCTGCAGGCAGGCATTAAACCCCCTCCTCCTGACCTCCAGGGGATGCCACTCCTTCCT^ 

TCTACACTCACTAGTACTTTGCCTCTGGGTTTACGCAGCCACCATGAGTTATTTATTGGCAGCAGAAGGGCATGCCTGCACACCCCCACTGGAGTAG 
GGG AGTGGAATGTG CTT C ATGTGGGGGAGGGGGTTTTGCCTT GGTTGAGTCT TT ATC CT CATGTGG ACTTTGTGAT AGAAAAGCCTCCC AGAGCC AA 
TTAAAGCCAGGCCCTGATTGCTTTAAGGAAGGGCTGATGTATTTCATCCCACCTTCTTTGCTTCCTCTTATATCCCGGAAGAAACAGCTTCTTCCTA 
ATCCTTCCTCGTCCTCATTCTGAGGAACTGGAGCAGCTGCTTTGGGCCTGGTGGAAACAGCTCCCTCTTCCCCCCTTAGCCCAAGGCCCAAGCTGAG 
GGACAGACAGGGAGGTGAGCTGCATCTGCTTCCGATGTGATAATGGTGCCCCCTCCCCTGGCCCGCCCTCTCTGACCAAGGTCTTTCATAACCCTGT 
GGGG AG CTG CTGTC ACT CCTGTGGTTTGGGAAGTGGCTCCCT GGGCCTCTTGCTGGAAA CTTGGGCTGGGCTC AGGC CGGGC AGG AAAGT CAC CGGT 
T CTGTC CCGCC TCAGAAGGAGCCTGGGG AGG AGG CGGGGGAGCTG CTGG AT C TGG AAGAGGTT ATTTAGGA CATGC AGATTGT CT CCT AGGGAGGCT 
GGGGGAGAGGGTGGGTGAGCTGGCACCTTCCTCCACCTTGCTCTGTGGCGGGCCAAGTCTTCTGCTGACCCTGAGAGGAGTCTCTGGGCCCCATCTC 
CCAGGGAGGAGGGCAGCAAGCAGGCTGCTGTGATTGGCTGGCAGGGAAGCCAGGGTACTGGGGCCCCCGACTGCTGTGAGTCGAGGAACAGGATTCC 
TCTTCCTGCCAAGTAGCTCAGAGATGGGAGTAGGAGGCCAGAGTTGGGAATCCAGGAGGCGCCACAGCTCTTTGTGATGAATCTGGTAGTTTTTGGC 
AAGTCTGAAGGTGGAGCTAGTTTCCAGGGCCAGAGAGCAAGATGTTTTTGGAGGGGTTGGGGCTGGAAGGTAGAAGGGGAACTTCCTTTTTGTGAAA 
AGAT GG AGGTGGTTTGTGTGC CC ACGTATGAGGG TTTTTCAGT ACAGGCAAAAAGTACT CTT CCC TCTGCC TG AGCTGGGGGTGG AGTGC CCAAAAG 
ACTTCCTAGAAGGTCAAGGTTGGAAGGGACCTCTGAGAACTCCAGTTGGTCCAAATCCGTGTGCAGCAGAGGAGGCTGAGGCCCAAGGAAGGAGGGC 
AAGGTCCTGCCTCAGCATCGCACTGAGCAGATGAGTGTAGAACCTGGGTCTTCTAGCTAGACCAGGTTGCCTTAAAAGTCAGGAATGTGATGGCCGC 
TGCCACCTACTGAAGCTAAGCACTCCCAGGGGTGTGTTCTCCACCAGTTAACACTGGGATTCTCTGGTTGCCTGGATATGGGGAAAGGTGGCCACTT 
TCCCTCAAACACAACCCCTTGCCTCCCCATAGGGAGAGTCCCACTCCCAGTTGCCTTCCAGATGCTTCCTTAGAAGCTGAGTCCCTTCTGTTTACTG 
CTGAGTTAGGAGTTGGCTCCTGGGACATGCACTAGGGGTAGCGTTGTAATTTATCTGTCTCACTTCTGGCTCCAGGCCAGGCTATTTTCTCTGGGGC 
TCTGTGAGGGTAGGGGTTGATTTGAGAGGCTGACACCTAAGCTGGGGCCAAGAGAGAGACAGCTGTGGCTGGGGCCTGGAAGAGGTTGGGTGGGGTT 
GCCATTTCTGCCTCAATAAGGAGTCAAACCCACAAGTTCCTTCCTGAGATAAAACAATGAAGAAATCCTCATGTGTGCAACTCAGAAATGGGGCCAG 
GTCTGGGAACCCATGACGAGTGGACTGGGGTGGAGCTCCCAGCCTTCTCCCGAGACCTAGTACTGGCAGGTAGGCTTGGGTAGGTGAGGATAGGGCT 
TGATCCCTCTGGAATCATTTGGAGCCATCCTGTCTGGCTTGGCCTCTACCTATCCTCCTGAAAGTCAAGTGGCATGGGGCACTACGGGTTCCAGCTA 
AAGGGGGTGGGGTGGGGTGCTATACCTCAGGACCCATCCCTGATATCCTACCAAGGGGATGTGGCTTGGGTTGGGCAATCTCCCCTTTGAGGAACCT 
GAGAAAAGGCTCTGTGGCTGTCATTGGAGGAGTCATACTCCCTTCTGTTGATTCAAGAAGTGTGGACTTGGCTGGGATGCAGAGTTGCAGGGGAAGC 
TAGTGGTGATCTCCCCTTTCACAAATCCTTGGGGTTGAGAGAGTCTTTCTGCTTCATCCCAGAAGAGAGCCCAGTTCATGGTGATGCTGTGGTATCT 
GCTGCTTCCGTCTTTCAGCCCTGGCTGGAGAAATCCTCAGAGCCCTGAATGAAGTCAGGAGAAGGCCTGGATTATTGGCCCCTTAAACATAACCAGA 
CAGGATTAAAAGGGATAATCCATGTAAAGCATGTTGTCCTGGCACCTAGTAAATGATCAGTAAGTGCTCAAAGCACATGCCTCCTCCTCTTCCCTGC 
CTTGAACTCCATGGTGCCCGGCCTTAAAATCAGAAACAGAACTGGGGTCTTGGGCTTTAAAAGTTTGATGGGCTTCCCTGCCTGAGACTTTGAGACT 
T CTG CTTTG CT CCC TCCT CTT CT CTTCT CCA GCC ACTG AC CC C AC C CTTTC C CCG CTTTTTC AGG ACTGGG AGGTG C TGGTC C TAGGG AAGCT CAAG 
TGGGACCTGGCTGCTGTGATTGCACATGATTTCCTGGCCTTCATTCTGCACCGGCTCTCTCTGCCCCGTGACCGACAGGCCTTGGTCAAAAAGCATG 
CCCAGACCTTTTTGGCCCTCTGTGCTACAGGTAAGAGCCCTATGCACATTTTTGCCAAATTGGAAAAATCAGAAGACTTATAAGGGTGAGAGGTAAC 
TGAAGATTCTGGTTTATACTGTCAAAATTCCTTTGTGAAAATTATACCCCCTCTGAGCCCCCAGTCTCAATGCAGAAAAGCAGGTTATAATCAGCTA 
TATGGGATCTCAGACCCCTCCAGTTTGTAGTGGGGACACTGAGGGTATAGAGAGAAGTGACTTGCCCAAGGTCTCACAGTGAACAGGTTGGGAACAG 
AACCCCAGTCTCTTGGCCCTAGTTGTGTTCCCATACTAGGGAGTGTCTTGTCTATTCCCGGGACCTATGGAAACTGGCAGTTGGTTCTGATCTCTAG 
GAAAGATACTCTTTCCTGTCTTCCTCCCTTGCTGATATCTTGCCCCACCCACTGCTGGAGTTTTCCAGCATGTCCTCTGCTCTACTCATGCTGCCTG 
GCACTTTCCAAGGAGTCGGAAGTCCATGCTGGCCGTTATCCTTCAGGTTTCCTTCTCTCATGGTTCCCTTTCCTCCTCTCCAAGATTATACCTTTGC 
CATGTACCCGCCATCCATGATCGCCACGGGCAGCATTGGGGCTGCAGTGCAAGGCCTGGGTGCCTGCTCCATGTCCGGGGATGAGCTCACAGAGCTG 
CTGGCAGGGATCACTGGCACTGAAGTGGTGAGTGCTGGGTAGCTGGGCAGCAGCCTCTCCATTCATAAAGTAGTGCTATGCCCTGAGCCTCCAGTAG 
AGGGAAACCTCCAACCCCAACCTGGTCCCAAGTCCTGGTTTCACAACCTTGGGGCTTTGTGGCTTTTTATCTCCTGCCTTGGTCTGGGGAAGAGCAA 
GTATCTTGGGTGGGGTGAAGGGCACTGGCCCTAGGGGTTCTTTTCTTGCCAAGTGGTGTGCTGGAGTGGGGGTGGGGGGTTTGGTGTCAGATGCAGG 
GCTGCTCTCACACTCCCTTGCCACATCCTCTTGCTAGTTTCTGAGAACTAAAGAGCGATTCCTGGGGCCGGGCTGTGGCCTAACCTCCCCAGACTTC 
CCCATGTGTTGGGAGCTGTCCTTCCCCTGCACCTGCTGCCAGATGCTATGGGGGGAGTGTTTGCTCCTCACTCTTCTCCCATGTTCCCAGGACTGCC 
TGCGGGCCTGTCAGGAGCAGATCGAAGCTGCACTCAGGGAGAGCCTCAGGGAAGCCGCTCAGACCAGCTCCAGCCCAGCGCCCAAAGCCCCCCGGGG 
CTCCAGCAGCCAAGGGCCCAGCCAGACCAGCACTCCTACAGATGTCACAGCCATACACCTGTAGCCCTGGAGAGGCCCTCTGGAGTGGCCACTAAGC 
AGAGGAGGGGCCGCTGCCACCCACCTCCCTGCCTCCAGGAACCACACCACATCTAAGCCTGAAGGGGCGTCTGTTCCCCCTTCACAAAGCCCAAGGG 
ATCTGGTCCTACCCATCCCCGCAGTGTGCACTAAGGGGCCCGGCCAGCCATGTCTGCATTTCGGTGGCTAGTCAAGCTCCTCCTCCCTGCATCTGAC 
CAGCAGCGCCTTTCCCAACTCTAGCTGGGGGTGGGCCAGGCTGATGGGACAGAATTGGATACATACACCAGCATTCCTTTTGAACGCCCCCCCCCCA 
CCCCTGGGGGCTCTCATGTTTTCAACTGCCAAAATGCTCTAGTGCCTTCTAAAGGTGTTGTCCCTTCTAGGGTTATTGCATTTGGATTGGGGTCCCT 
CTAAAATTTAATGC ATG ATAG AC AC ATATGAGGGGGAATAGTCTAG ATGGCTCCTCT CAGTACTT TGG AGG CCCCTATGT AGTCCGTG CT GACAGCT 
GCTCCTAGAGGGAGGGGCCTAGGCCTCAGCCAGAGAAGCTATAAATTCCTCTTTGCTTTGCTTTCTGCTC^GCTTCTCCTGTGTGATTGACAGCTTT 
GCTGCTGAAGGCTCATTTTAATTTATTAATTGCTTTGAGCACAACTTTAAGAGGACGTAATGGGGTCCTGGCCATCCCACAAGTGGTGGTAACCCTG 
GTGGTTGCTGTTTTCCTCCCTTCTGCTACTGGCTUU^AGGATCTTTGTGGCCAAGGAGCTGCTATAGCCTGGGGTGGGGTCATGCCCTCCTCTCCCAT 
TGTCCCTCTGCCCCATCCTCCAGCAGGGAAAATGCAGCAGGGATGCCCTGGAGGTGGCTGAGCCCCTGTCTAGAGAGGGAGGCAAGCCCTGTTGACA 
C AG GTC TTT CCTAAGGCTGC AAGGTTTAGGCTGGTGG CCC AGGACC ATC AT CCTACTGTAAT AAAG ATG ATTGTGAAAT AAAACTGGCT TTGG CTTC 
TTG G AT TGGTGTGTGGGTAAGTC TAT TTCTGCACTGT ATGAGGTAAGG AGGTGGAAGTC AATGGT CTAAAG CAGGG ATC ATTG AGCCTTT TCTGTAA 
AGG G CTGGATATTT AGGCTTTGT GGGGT ACATGGTCT CTGAC CCAAAT ATAGTCAGTTCTGC TTT ATAATGTGACCTATGCGTTCCTAAAAAT C ACT 
ATGCCATGCAAAATTGCACAATAAAGACCACAGGTCTCATGGGAAAAATGGGGCTTGAAGCACA^ 

GAT AGC TTGGT TTTTAC ATATGT CAAATGGTTGAGAAATTAT ATC ACAATAAATATGGCCTT TTATCTTGAAG ACATGC ACT TTG CTTTTGGAAGTG 
GACGTCGGAAGGGTTGCTCAACACAGGTGGGGAGATGAGTGAGGTAGCTGGTACATGTTTAGGTGTGAGTTTTGTGTACTACTTAGGCTTGGTTCAG 
CTAGATACAGTTTCCCATATTCIACCTACTATTTCTCGAGGATGAAATCACACACAAGCAAACATGAAGTCTGCGTTATGATCAAACTGTTCCCAAAT 

GCCACAGAAAACATGTAAATGGGTGAGCATGGCTGTTCCAACAAAACTATTTATGGATGCTGAAACTGGAATTTCATGTAATTCCTACCTATCACAA 
AATACTGTTTTAATTTTTTCCCCATTTAAAAATGGAAAAATCATTCTTAGCTCATGGGCCACACAAAAATGGGCACTTGGGCTGGGTGCAGTGGCTC 
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ATGCCTGTAATCCCAGCACTTTGGGAGGC03AGGCGGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACCTGGTGAAACCCCTTCT 
CCACCCCGTCTCTACTAAAAATAO^AAAATTAGCCAGGCGTGGTG 

GAACCATGGAGGCGGAGGTTGCAGCGAGCCGAGATCATGCCACTGTACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAACAAATCAACA 
AAAAAAGGGGCACCTGGCCCTAGTTTACTAGTCACTAGTCTAAAGCTTTCATTGGCTGAAAAGCCAGAAAGGGAGCCTGTTTGATGCTAGCTCTGAT 
CTAACCATTCTTGGGCCCAGCCTTATTCTCTGGCCTTTTTTTTTTTTTTTTGAGACAGTCTTGCTCCGTGGCCCAGGCTGGAGTGTAGTGGCCCC^A 

T CC TGG ATCAC CGC AAC CTCC AC CTC CC AGGTTG AAG CGATTCTCCTGCCTCAGC CT CC CAAGTAGCTGGGATT AC AAGCACACG CC ACC ACACC CA 
GCTAATTTTTGTATTTTTAGTAGAGAAGGGGTTTCACCATGTTGGCCAGGCTGGTCrCGAACTCCTGAACTCAGGTAATCCACCTACCTCAGCCTCC 
CAAAGTGCTGGGATTACAGGCGTGAGCCACCAAGCCCAGCCTCTCTGGCCTTTCTTGACTCTTATCTCTGCTTTCCCCAAGGAGTTTTCTCTTGTAG 
TCCTTTCATCCACTGCTGAGACACGACCATCTTTGGACAACCTGCTCCTGGCTGCTGTAGCCTGTGATGGGGGAAGGCAGAGGATGTGTCCCCTGGG 
GGCTAAC AAGGTGAGTGCAGCCTGG AGAAAC AAGCTTGGAGAT CAAGGGAGGTGGGT CCTAT ATAAAC AGG ACAAGAATGTC CTC CGT CCTCC AAAT 
ACTAACAGGAGTAATGCCTACCTCCATTGACTACACCAAACACAGAGGTTCAATAAATATCTGTCTTTGCCTTTATCCACAGTGACCTGACACAATA 
GCCATAGAAAAAGTACTCACACCAAGCCCCAGCTGGGGACTGGGAATGCTAGCATATGGTATCTCAAGTTGGCTCTCAGAACTAAACAGGGATAAGG 
GCC TAG AATGG AAGAGGGAAC CAGC C AG ACC CTCAGT CCTTC CTGT CCTGG ACT GGG AG C CACAGATGTCC CTGTG ATC TGT CAC TGC CCTGATCTG 
GGTCTTCAGCCATTAAAGCTCAGTGTCATCTTCAGTCACCAACGGGGGTCTTGGTGTCCTTCCAAACCCCTTTGGCCAGGACAGCTGACTGTTTTCC 
TCACTCCACGGTGATGGGAACATCTTCCACATCGCGGGGGACTGCACTCTGAAGCTCACGCCTGATTTCGGGCGATAGCTGTGGATGGGGCTGCAGC 
CGGAGCAGTTCTAAGAGGGCCTCTTTCTGGTCTGTGGCCAAGTCGGCCTTGTAGCGCTGGACCAAAGTCAGGAGGCACTGGTGCCACAGCACAGGCA 
GTTCACGCTTCTCTGTCCGGAACCCCAGGAAGTGGAAGACTAGGGCATCCAGCACCCGGTAAGGCAGTGCATACTTCTTATCCAGCAGCAGTCGCAG 
GAAGATGCTGTTGGCACCGCTGTATTCCATCTCAGCAATTTTCAGCATGGCCGCACTAGTGGGAAAGGGAGGTGGGGTAGAGTAAGCAGGGAAAACA 
TTGGAAGAAGGATCCTACTTCTCCCTCTATGCTTTCCCCTTTTTTTCTTCCCCTCTGCTGGCCCTTACACTATGGTATTCCCCAGGGACCTTCTCAC 
CTACACAATCATATTATACTCCCTTGACCTCAGTCACCATCTCTACAGGGATCCACCCAAATCTCTATCTCTAGCATAGACTGCTTGTTTCAGTAAC 
CAACTAGAGTATGGTCCAAACATCTCAACGTCTCCACCCTAACTACTTTCTTCCTGTGTTTCCTACTCATTTGTTTATCTATTTACTGAGAACTTAC 
TTTATGCCAGGCACTCGGGCCCTGTTGTTGCACAACTTACATTGAGGGGGATAATGATACACCTACTCATTGGCTTAAACAAGCACAAACCCAAGGC 
Ms TTTCCTTAGCTCCTTCTCCACTCTGCCCATTCCCAAGTCCTGGCAATTCCACTATCTGGATATCCACTGACCAGCCCCCTTCCACCCCCAGCA^ 
fl TACCTGGAGTGCAACACAGGGATGGAGCACTTGGTGATGATGCTACCCACAATGATGGCTTCCCGGAGGGTACAAGTGCCAGACTCGCACAGTGGAA 
S2 TCAGGATCCCTAGGGAGAGTCAAAGGGCAGCCACTGTGAGCCCAGGATGAGAAAAAGGAGTTACAGCAGGAAGCATTCCCACCCAGCCCAGGGACAG 
M CTAGCAATTTCCCACACCAGGCCTGCTGTGTTCCCCATAGAAGCTCCAGCCCTTCCCTTTCTTCCGTGCCTCTGGGATCCCACCTTTGAACCAGGCT 
fl CCAGGTTTGAAAAGGGCCTTCTTGAGAGCCATGTAGAGATGGAAGTTGAGTCGTTTGTATTCAGCAACGTCATCTCGTACTCGAGGGAGCAGGACAA 
\Z GGTTGTAGAAGCGCTGGGCCATGCGTTCCTTCAGGTTAGAGGCAAAAATCCTAGTGGGAGTTAAGCGGGGGAAGTTTCCAACTCACACGACAGTCTG 
GACCCGGCCCATTAAATATCACCATGGATAGTGCATACAAGAGGTTTTAAAGTAAAATATGGGACCTTTCTTTCAGTGGGGGATGTGTGGGAGCTGT 
GTGC ATG CAAC ACC C AT C AG CTTAG AGAGAG AAT AGG AGGGGCAATGAG AT ATA CCC AAATGAGGAG CACCGTT TGT CTG TGG GAG AGCCAGAGCTC 
j e GTGCAAACTGATGGCTACTCAGCATCAGTTCGATTCTGCTAGCCCTGAGCATGCCTGGGCAGTCACAGACTTCTGAGGGTCCTGCTACAGGGTTGTA 
TCCCAGTGGCTATAAAAGCCTCTCCCCTAGAAATTCATCACAGAGGGTTGTATCCCAGTAGCTATAAAAGCCTCTCCCCTAGAAATTCATCACGGAG 
llj GGTTCCTAACTGGCTAACTGGGTCTTGGCCCCAAAAAATGACACAGGAAAAGCTAGAAAAATGATGCTATAAAGTAGCACAGAGAGGGCCCTGGGGC 
AAAGCTCAAATGCCCTGGGGCCAGAGGCAAGGAAAGCCTGTCCGTCCCAAACTCAGCCTG(IACCTATAAACCCAAGGCCCCCGAACCCCCAGCTACT 
* CTACCTGGTGGCCTGGTACATGGCAGCTGCAGTCCAGGCCTCCGGCTCTGTGACGTAGAGGATTTGCTCCCAGTTGGAGAGTGCAGGGATGATCTTA 
H* AATGCCTTGGGCAGTTTTCCACTGCGGTACTTAGATAATACCTAGGGATGAGGGAGGTAAAACAATCATCGTCATCCACATATGGCAAGCTTGCTGT 
fl GGGCTAG GA AC T TGT CTGGGTGCTT TTA ATATATT AA CTC AT TGC ATC CCC ACAATA AC CTAT AAGGAGGTGC CATTATTATG CT C ATTT TAC AGAT 

.51 GACCAGTGAGAGTTCCAGTAACCGACCCAAGGATGCCAAGTCATTTACAGGCAGAGCAGGAATCTGAACCCACACACTCTGCCTCTTAGGTCTGTTT 
flj TCTACTCCCATGACTTCTTTAGTATGCGATGCAGGCAAAAGGCAAAAAAGCCTGTGGCAGGAGCCCACAGCCCCCAAAGGGGTCTTCCACCATGGCT 
isi CCCCTCTCAGCTCTTACCTCCCGGACCCCCCTGTACACTTCTAGGACCCGGGGGTCCAGCTGGGGCATAGGGAAGCCCGACACCTCTGACATGACTG 
5J TCTCAACCTCTGTCTGCTTCTCAGTCAGCTTCTCCATGATGATGTCAGCC^GGGTGCGCCTAAGAGGGGAAGTGCCAAGAGGAGGGGAGAGAGGGGG 
M CAAGGAGCACTGTGGTCACTGTCCAGGAAGCTCCAGACTCCCCACAGTCCCCTGTGAGCACCGCCATACTGCGATCCTGCTGGGATATGCAGCTTAG 
Ms AGTT CTCTCTCTGTG CC CATTCATC CAC C AC AGAC AG AAT CA C TG CAGGGTGCA AAGGAACTGAAAGT CATTC CAT TTAGGAC AAAGC TAGCAGTTG 

TACGGAGGTTATTATGAACCTCAACTCTGAAGTCAGTCTGCCCAGGTTCAAATCCTGGCTCACCCACTCCCTGGTATGATCTTGGGCATGTTACTTT 
ACCTCTCTATGCCTCTGTTTCCTCATTTGTAAAATAGGGATAATAACGGTACCCACCTTAAAAGATTAATGGAAGGATTAAATAAGTTAATGTCTGT 
TCATTACTTGGAATGGTCAGTACACAGTAAGCGTTATCTATGGAAATATTTCTAAGACTGATTATATTCTTTTCAGCCTCCTTAAACAGTATTGTTG 
GGTGTATCAAACATAATCTTATCTTAATGATTCAGAATTACCATCGCCTTGAGCTGTCATATAATACTAGCATCTTCAGAGGCTGCTGAAGTGTATT 
TATTTATTTTTTTTGAGAGAGAGTCTACCTCTGTTGCCCAGGTTGGCATGCAGTGGCACCATCATGGCTCACTGCAGCCTCTATCTCTCTGGCTCAA 
GCGATCTTCCCGCCTCAGTCTCTCGAGTAGCTGGGACCACAGGAACATAGTACAAGGCCCAGCTAGTTTTTTATTCTTCTGTAGAGATGGGGTCTCC 
CC^TGTTGCCCAGGCTGGTCTGACACTCCTGGGCTCCTGCCTTGGCGTCC^ 

GTATTTAAAGCTGCTTCTCAGTTGGGCGCAGTGGCTCACGCTTGTAATCCCAGTACTTTGGGAGGCTGAGGGAGGTGGATCACGTGAGGTCAGGAGT 

TTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAATAAAACAAAAATTAGCCAGGCGTGGTGGTACGTGCCTGTAATCCCAGCTACTC 

GAGAGGCTGAGGTTGCAGTGAGCTGAGATCATGCCACTGCCCTCCAGCGTGGGTGACAGAGTGAGATCCTGTCACAAAACAAAACAAAAAACAAACT 

GCTCCTCTTTTCTGAGGTCCTACTCAATGCCAGGCACTGAGTTTGGGACTTTTCAGTCCTAGAGTCAGCGCAACACTTATAGGACACGGAAACTTAT 

GCAGACATGGGAACTGGGCCTCAGGTTAATAACTCACCCAAGGCCTCACAGCTGTCAGGTGGGGGTAAGAGTCAAACTCAGGTTTTTGGA^ 

AGTTAAGGTGCTTTATAGTACCTTATAATGTGCCCCCAAACTAAATGGCCTCTGATTGTTGAATTCATGTGGATCTAGGCTTGCTTTCTGAAGATGC 

CAGCTATCATTTCTTACGACTGCCAGGAGTTCTGATTACTTCAAGGAGCTTGTCTACTCTATTTACTACAAGCTGGAAAAGCATTCTGCTAAATTGG 

TCAGCATTATCAGTGAAGTAGGAGTTATGAGGTGCTGAGTGAGAACCTGAGAGGCTCTCTGGGGGGACAAGCCCAAGTACATACTTTATGACATATA 

TTTCAACTGTTAGGCATCAATCTGGGAGAGGGGCTC^TTTTCATTTCTTC^TTCTGCATTTTCAAATAAGCTGCCACAGAATTTATCAGCCTCCCTG 

T CCTTT T AT AGT ATG ATTAG CAATGAGATGC ATT AGC CCAAAACAGCCTAAAGTATC AGGGAC AC CAT TAT GTT AATAT CACAGGGTC AG AATG AAC 

TGGTTGAGGACCCTTATCCTAGACTCCACTCCCAAGGGCCAAAATTTATGGGCTCAGGAGTTTGACCACAGGCAGTCTAAATGCCTGCATGTTTTGG 

AGCTAGCATTTATTATAGATGGGAATAAATTAGTGGTCCTGACTAGGGGCAGTCTTGCCTCCCAGAGGACATTTGGCAATGTCTGATGTCATTTTTG 

GTTTTC ACAACTGGGGG AAGGTTGC TACTGGAGTCTAGAAGT AGAG AT CAG AGAT ATCT ACT AAACCT C CT AC ACT GCACAGGAC AGC CTC CACAC A 

CACACAAAATTACCCAGCCCAAAATGTCAATAATGCTGAGGTTGAAAAACCTTGGCTTAAATGAAGCTTAGGATATGAAGACTTGTCCAAGGAAGTT 

GAGCTTACCGCTAAAAAAACTAAGAATACC&AAACCAAATGCAATAGGA 

ACTGGGCAGTTGTACAACAAATGGACATAAGGACACAATGTTTTTTGAGGCTGGGTCATTCCTACTCACAGTGGTCCTTTGAGGATGGGCCAGTTTT 
GTCAAGGTGGTCCCCTACAGCTAGCAACTCCAGGGCCCTTTCCAAAGGTAGGGAATCATAAGTCTGCCAGGGAACTTGTTTTCCTGTGGGAACTTTC 
TACAAGGACTCATTATCCCAAATCCCCAGGCCTACCTGGCAGGAGGGTTCTTGTTCATGAACATCTCTATGGCACGCTCATCCTCAGGGTCCACAAC 
CACCTCTGCATGATGGCCCGCTGCTGTCATTGTGGCAGCCTTCTCCAGGGTGGGCCACTCCTCGTCCTCGTCATCTGATCCATCCTGAGGCATTCTT 
GGACCTAGAAGAGGGGAGACTAAATGGATGAGCACAGCAAAAGGTAGCTGAGGGATTTCGTCAGTTTAACTTTAGCCATTATGTAGTACAGGCCTCA 
CATGTGGTTGTGATAGACGCAGGTATGTGCAGTGAAACAAATGCATTAGCTCCCTGTCCCCAGGGGCTTGCAATCATGCTTGAGAAACAAAATGTAT 
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AGTCTTGCTCTGTCACCCACACTGGAGTGCAGTGGCTGGATCTCAGCTCACTGCAAGCTCCGCCTCCCGGGTTCATGCCATTCTCCTGCCTCAGCCT 
CCCAAGCAGCTGGGACTACAGGCATAAGCCACCAGCCCGGCTAATTTTTT^ 

CGATCTCTTGACCTCGTGATCCGCCCGCCTTGGCCTCCCAAAATGCTGGGATTACAGGGCGTGAGCCACTGTTCCCGGCCTGTAGAAGGAACTTCTA 
;^CACTATACCCATGTTAACTGTTTAATCCTTATAAACCCAAGAAGTGGCGGGCANNNNi^^ 

C CAGCACTT TGGGAGGCCGAGGC GGGCAG ATC ACGAGGTC AAG AG ATCGAAACC ACC CTGGTCAACATGGT GAAAC CCC ATC TCT ACT AAAAATACA 
AAAATTAGCTGGGCGTGGTGGTGGGCGCCTGTAGTCCCAGCTATTTGGAAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGTGGAGGTTGCAGT 
GAG C CAAGGTCACG CCACTGCACTCC AG CCTG GTG AC AAAGCGAGACTCCGTCT CAAAAAATATATAT AAAAT T AAAAAAAATAAAAT AAACCTTCT 
TATGGAACAATGGCAAATAAATAAATAAATAAGAATAAACCCAAGAAGTGGATACTATTACTATACCCAGTTTCAAATGAGGACATTGAAGCACTGA 
GAAGTTGGGTAATTTGCCCAAGACAGGTAGTCTGGGTCTAGACTGCATTCTTCACCACTATTCTACACATATTCGGGTTTGATTATATGCCACAATG 
AGTAGTAGAGACAACATATGCACTTAAAGGATGAGTGGTGAGGACAGTTAGGAGCTGGATGGGCAGTTTGGGAGCAGAAGGAGTGTGTTCCAGACGC 
AGAGAAAAT AAGGCTTGGATATAAG AGT CTC CTAGGC AAT AG C AAG AAAT AATGCT AGGT ACGT GGG AAAGGTGAAG CC AG ACTGGAGGG CCTT TAA 
ATTCAGGCAGTAGAGTTCAGATTTCAATCCACCAGTCCATTTAAATACATGAAGTTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCACCATTTTG 
GGAGAC C AAGG CGGGTGG ATC ATGAGAT TGGG AGTTCG AGAC CAGCCTG ACT AACAGGGAGAAGC CCCGTCTCTACTAAAAAT AC AAAAT TAG CCGG 
GCTTGGTGGCGCATGCCTATAATCCCAGCTACTTGGGAAGGCTGAGGCATGAGAATCGCTTGAACCCGGGAGGTGGAGGTTGAGGTGAGGTGAGCCG 
AGATCGCGC CACTG C ACT CCAGC CTGGG CAAC AACAG C AAAACTCTGTCTCAAAAAAAT AAATAAATAAAT AAATAAAAATAAAT AGGCCGGG CGCG 
GTGGCTCACGCCTGTAATCCCAGCACTTAGAGAGGCCAAGGCGGGCGGATCACGAGGTCAGGAGACTGAGACCATCCTGGCTAACACGGTGAAACCC 
CGTCTCTACTAAAAATACAAAAAATTAGCCGGGTGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACC 
CGGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTTAAAAAAAAAAAAAAAAATGAA 
GTTAGTCCATTTAAATAACTTCTGTTTTTGGAAGGCTACATGTCTGGGCATCTGGAAAGGGGTCTAATAATCCAAGTGGGGAGCTATAATGCTACGT 
ATCAATAGGTCATTAACAACAAAGGCAGTGTGTGATAAACACTAAATAAGGGGGAACAGACATTTTGTCTCCACGTTCCAACGGTCCCCCAAATTTG 
TACTTTGTAGGTGGTTCTCCTTAGTGCCAGACCCTAACTCCAGGTGTCTCCTGTAAGCTGCTCTTGGATGACTCACAGGCACCTCAAATCTCACCCG 
H? TTATCAACTAAACTAATTTTCTTTCCCTCCTAGGTCTACACTGCTAACCATCCACAGCTCAGTAAACGGCACCATCGTCCACCCAATGCTAAGCCTG 

f;3 GGAGG 

\*S HUMAN SEQUENCE - mRNA 

p GCGCCTGTCAGGGAAGCGGCGCGCGCGCGCGGGCGGCGGGCGGGCTGGGGATCCGCCGCGCAGTGCCAGCGCCAGCGCCAGACCCGCGCCCCGCGCT 
CTCCGGCCCGTCGCCTGCCTTGGGACTCGCGAGCCCGCACTCCCGCCCTGCCTGTTCGCTGCCCGAGTATGGAGCTGCTGTGTTGCGAAGGCACCCG 
4™ GCACGCGCCCCGGGCCGGGCCGGACCCGCGGCTGCTGGGGGACCAGCGTGTCCTGCAGAGCCTGCTCCGCCTGGAGGAGCGCTACGTACCCCGCGCC 
TCCTACTTCCAGTGCGTGCAGCGGGAGATCAAGCCGCACATGCGGAAGATGCTGGCTTACTGGATGCTGGAGGTATGTGAGGAGCAGCGCTGTGAGG 
L s AGGAAGTCTTCCCCCTGGCCATGAACTACCTGGATCGCTACCTGTCTTGCGTCCCCACCCGAAAGGCGCAGTTGCAGCTCCTGGGTGCGGTCTGCAT 
GCTGCTGGCCTCCAAGCTGCGCGAGACCACGCCCCTGACCATCGAAAAACTGTGCATCTACACCGACCACGCTGTCTCTCCCCGCCAGTTGCGGGAC 
W TGGGAGGTGCTGGTCCTAGGGAAGCTCAAGTGGGACCTGGCTGCTGTGATTGCACATGATTTCCTGGCCTTCATTCTGCACCGGCTCTCTCTGCCCC 
GTGACCGACAGGCCTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCTACAGATTATACCTTTGCCATGTACCCGCCATCCATGATCGC 
!' . CACGGGCAGCATTGGGGCTGCAGTGCAAGGCCTGGGTGCCTGCTCCATGTCCGGGGATGAGCTCACAGAGCTGCTGGCAGGGATCACTGGCACTGAA 
fsft GTGGACTGCCTGCGGGCCTGTCAGGAGCAGATCGAAGCTGCACTCAGGGAGAGCCTCAGGGAAGCCTCTCAGACCAGCTCCAGCCCAGCGCCCAAAG 
CCCCCCGGGGCTCCAGCAGCCAAGGGCCCAGCCAGACCAGCACTCCTACAGATGTCACAGCCATACACCTGTAGCCCTGGAGAGGCCCTCTGGAGTG 
Jft GCCACTAAGCAGAGGAGGGGCCGCTGCCACCCACCTCCCTGCCTCCAGGAACCACACCACATCTAAGCCTGAAGGGGCGTCTGTTCCCCCTTCACAA 
til AGCCCAAGGGATCTGGTCCTACCCATCCCCGCAGTGTGCACTAAGGGGCCCGGCCAGCCATGTCTGCATTTCGGTGGCTAGTCAAGCTCCTCCTCCC 
| y TGCATCTGACCAGCAGCGCCTTTCCCAACTCTAGCTGGGGGTGGGCCAGGCTGATGGGACAGAATTGGATACATACACCAGCATTCCTTTTGAACGC 
£ CCCCCCCCCACCCCTGGGGGCTCTCATGTTTTCAACTGCCAAAATGCTCTAGTGCCTTCTAAAGGTGTTGTCCCTTCTAGGGTTATTGCATTTGGAT 
U TGGGGTCCCTCTAAAATTTAATGC^TGATAGACACATATGAGGGGGAATAGTCTAGATGGCTCCTCTCAGTACTTTGGAGGCCCCTATGTAGTCCGT 
GCTGACAGCTGCTCCTAGAGGGAGGGGCCTAGGCCTCAGCCAGAGAAGCTATAAATTCCTCTTTGCTTTGCTTTCTGCTCAGCTTCTCCTGTGTGAT 
TGACAGCTTTGCTGCTGAAGGCTCATTTTAATTTATTAATTGCTTTGAGCACAACTTTAAGAGGACATAATGGGGGCCTGGCCATCCACAAGTGGTG 
GTAACCCTGGTGGTTGCTGTTTTCCTCCCTTCTGCTACTGGCAAAAGGATCTTTGTGGCCAAGGAGCTGCTATAGCCTGGGGTGGGGTCATGCCCTC 
CTCT CC C AT TGTCC CTC TGC CCC AT C CT C CAG CAGGG AAAATG CAGCAGGG ATG CCCTGGAGGTGGCTGAGCC CCTGT CTAG AGAGGGAGGC AAG CC 
CTGTTGACACAGGTCTTTCCTAAGGCTGCAAGGTTTAGGCTGGTGGCCCAGGACCATCATCCTACTGTAATAAAGATGATTGTGAAATAAAACTGGC 
TTTGG 

HUMAN SEQUENCE - CODING 

ATGGAGCTGCTGTGTTGCGAAGGCACCCGGCACGCGCCCCGGGCCGGGCCGGACCCGCGGCTGCTGGGGGACCAGCGTGTCCTGCAGAGCCTGCTCC 
GCCTGGAGGAGCGCTACGTACCCCGCGCCTCCTACTTCCAGTGCGTGCAGCGGGAGATCAAGCCGCACATGCGGAAGATGCTGGCTTACTGGATGCT 
GGAGGTATGTGAGGAGCAGCGCTGTGAGGAGGAAGTCTTCCCCCTGGCCATGAACTACCTGGATCGCTACCTGTCTTGCGTCCCCACCCGAAAGGCG 
CAGTTGCAGCTCCTGGGTGCGGTCTGCATGCTGCTGGCCTCCAAGCTGCGCGAGACCACGCCCCTGACCATCGAAAAACTGTGCATCTACACCGACC 
ACGCTGT CTCT CCC CGCCAGTTGCGGGACTGGGAGGTGCTGGTCCTAGGGAAGCTCAAGTGGGACCTGGCTGCTGTGATTGCACATGATTTCCTGGC 
CTTCATTCTGCACCGGCTCTCTCTGCCCCGTGACCGACAGGCCTTGGTCAAAAAGCATGCCCAGACCTTTTTGGCCCTCTGTGCTACAGATTATACC 
TTTGCCATGTACCCGCCATCCATGATCGCCACGGGCAGCATTGGGGCTGCAGTGCAAGGCCTGGGTGCCTGCTCCATGTCCGGGGATGAGCTCACAG 
AGCTGCTGGCAGGGATCACTGGCACTGAAGTGGACTGCCTGCGGGCCTGTCAGGAGCAGATCGAAGCTGCACTCAGGGAGAGCCTCAGGGAAGCCTC 
TCAGACCAGCTCCAGCCCAGCGCCCAAAGCCCCCCGGGGCTCCAGCAGCCAAGGGCCCAGCCAGACCAGC^CTCCTAC^GATGTCACAGCCATACAC 
CTGTAG 
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TABLE VII 



MOUSE NOMENCLATURE 
ICSGNM Wnt3 
Celera mCG19162 

HUMAN NOMENCLATURE 
HGNC WNT3 
Celera hCG27 972 

MOUSE SEQUENCE - GENOMIC 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNN^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 



Yff NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

Q NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

fl NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

"zl GGGGCCAGTTCCCTCCGGGCAGATAGATCAGCTGGCCCCAGGGGTGCAGGATCCGAGGGGTCAAGCCCCCAGAGTGGCAGCTTTGTGTGATGGAATG 
4- AGGCTTAAACTCCAGTTGGCAACCTTCAGTACATTCATGGGGGCAGCCACCACTGTGTGCACACAAACAAGCACCACTGTGCGCACACAAGCAAGCA 

CCTCTGTGTGCACACGAGCAAGCACCACTGTGTGCACACAAGCCAGCACCACTC 
j , GTGTGCACACAAGCCAGCACCACTGTATGCACACAAGCCAGCACCACTGTGTGCACACGAGCACCACTGTATGCACACAAGCCAGCACCACTGTGTG 

CAACCAAGCAAGCACCACTGTGTACAAACAAGCACCACTGTGTGCACCCAAGCAAGCACCACTGTGTGCACACAAGCAAGCACCACTGTGTGCACAC 
lyj AAGCATCACTGTGTGCACACAAGC^GCACCACTGTGTACAAACAAGC^CC^^ 

TGTGTACAAACAAGCACCGCTCTGTGCACACAAGCAAGCACCACTGTGTACAAACAAGCACCGCTCTGTGCACACAAGCAAGCACCACTGTGTGCAC 

ACAAGCAAGCACCACTGTGTGCACACAGGCACATGTGCATGCTGCACCCCAGCTTTCTCTTACCCACCTCCTTCTGGGACCCTGAAATCTCTGTGAC 
H» TGCTTGGAACATGCTAAGAGGTCCATGGACACCACCCAGTCCTGTGAGTAGTAAGTACCTCCTGGGCTAGATATGACCCTCACCTATAGCAAGTGTC 
r-i CGCAGAAGGCTGAGGTAGCCACTTCTAGGGACCCACAGCTGAGTTCCTGTTCAGTGCCCTGGTCTGTCCTGCAAGGTAAGGACATCAAGGATAATGA 
Jf? CCATTAGTAATGGTGGAGAAACGAAGCATCTAGTCTAGTTAGCTGTCACGGCCAAAAGGACTCTTAATTAGGCCACGTTAATCTATAGGAAGGGTCA 
III TAGTCAAAGAGGGTAAGTGAGCATGCAGCCCAACGACAGAAAATCGGCCAGGGACCCCTTGGCTCCCCACCCACATCCTCCCAGCTCTCACATTCTG 
| s '| ATGGAGGACAACAGGTAGAGACTAACAGCTTTTCTGGGCCCTGGACTGTCTACCTTGTCTGAAGTCTACAGTGTCTGAAGTAGGTATCTGAACTTGA 
Jjf CCCAGCCCTCTTCCTCGCCTTCCTCCTCCCACTCCTGAAGAAAGCTTCTCACAGACCAAAGAAGTGATCACTGGCATCGCCAGGCTCCTCTGGCAGC 
13 TGAGAGGCCGTCACGCTTCCTCCTTACTGGGAATATCTGGGTCGCAGAGGCCTCCGTATGACCCATACTTTCACCCCACTTACTCGTAAGGCTGGAT 
y,, TGGTGTCCCCAAATATGTCCAATACTTTGAGACGGGGTGGGGGTGGGGCCTCATGGCCATGGAGATTCCAGATGCAGAGGAGGGAACTCAGTGCCAC 

CTTTGGCTGTAAGGAACAGCCTCAATTGGGATTAGGAAAATACACCATCTTGAATCATAAACA^^ 

AATCACAAACAAACAAACATTCCAAGACGGGATTCTCAGGGCAGTTAATGTCTCAGCAGTGGCTCAAGAGACTCAGTTTCTCCCACCCTGGTCTTAA 
GTC TCT ACT TTGGTCTC AGT C AGTTGTTCTC ACAGCATCCAAC CAAGAT AT AACAAT ACCCCAAAGCGGGG AAA CTGGA CAGCAAT TTAG CGGACTG 
CTCACTCCCCTAAATACTTTCTAGAAATCACAGGTCCGTCAGCTGACTCCCCTTCGTGTCTCCTTTGAGTCGGTGTAGCCCAAGCCCTTGCAAATTG 
ACTCAGACTAGTCAGCATTTATCTCTGAGCCAGGTGGGGGAGAACAAACTTGGTTCCTCATGTGCCTAAATAAAGCCTCTGCCTTTACATATACTTT 
TATATACATATATGTGTATATATATCATCCTAGTCAGGATTTCTACTGCTGCACAAAACATCATGACCAAGAAACAAATGGGGGAGGAAAGGGTTTA 
TTTATCTTATACTTCCACATTGCTGTTCATCCCCAAAGGAAGTCAGGACTGGAACTCACACAGGGCAGGAAGCAGGAGGCAGGAGCTGATGCAGAGG 
CCATGGAGGGTGCTGCTTACTGGATTGCATCTCCTAGCTTGCTCAGCTTGCTTTCTTATAGAACCTGGGACTGCCAGCCCAGGGATGGCACCCCCCA 
CAGTGGGCCCTCCCAGCCTTGATCACTAATTAAGAAAATGCCTTATAGCTGGATCTGATGGAGGCATTTCCTCAAGGGAGGCTCCTTTCTCTGTGAG 
AACTCCAGCTTGTGTCAGGTGGACACAAAACCAGCCAGTACAGTTACTTGCAACCATAAGTAACTCCAGTTCCAGGGATCCTGACCTCTGCAGGCAT 
CGAGCAGGTGGATGGCATACATGCATACAGGCAAAACACTCGTACGTGTAAAAGAAGATAAATCCAAAATAGTAATAATAATAAAAAGTAACTCAAA 
AGAACTTTCCTCCTCACTCAGCTCAGCTACTGCCAGGAACCATTTCAGAGCCAGCAGGCATCACCACTGCAGGCGAGGGAATAGGCTTAGAGAGCAT 
AGGAAACATGAAACATGCCCAGGGTTGTACAGTACGAGGCAGAGCTGGGCTCTAAGGCAACCACTCA^ 

TCCCATAGGACAGATCCTGAGAGGCCAGCAATGCTAAGCAGGGTTTGCCCAAGGACCGCCCCCCACCACCTGTGACAGCTATAAACACCTGTTCCAG 
G AT CTTT GCTGGTG AAGT AG CGAGGGCCC CAGCG ACTGTC CACTG AAC CATCTACTTGTGTTCAG ACTCTTAG CCCC TAGAGCTCATTTCTGGTCCA 
ACCTCAGTTTCCCCACGGTGGCTGCTGTTGGTCATACTTGTCTGTGAAGGGATGGTTGGGGGCAGAGATCTGGAGGGTGCAAACCAAGCATGATGGG 



CTGGCTGGCTCTTAATTTTGCTGTTTAGAAACATTTGATTTTCTGAGGTATGCCTCTTCCAGGAGAAGGTAAGCCTCATAGACTCAGAACCCTGGGG 
CTCAGGCATGCTTACCTGGGGGTTACCTGCATATAGTCCTTCn'TAGTGAACAACTAACTACACAGGACACCTATGCAAAG 

TAATGGCATGCACATGTGTGCACACCACACACACACACACACACACACACACACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 
GAGCGATCATTGTGGCTGGAGGGGCTGTCAAGTCTCAGATGCCTCCAGGTGTCAGTGACTTCTGGCCAGGTTGCTATGTTCTCATCTCAAGCCATCA 
GGAAAGGTGGGTGTTACCTTGAGGAAAACTGAACCCAACACTGGGGTGGACAGGTCTCCAGGAAGCATTAAAGTGCCTTAAAATACCACACGG 
TAA ACC C AG GGC CCT CT CATCTGGGATCT CAACACCCTCTCTAAGTCTGTC AGCCTACTCTGTTCTTGGTG CCAGAGTTCTCCTGAGTTCAGT TGGA 
TAAAGGGCAGCACACAGCACACAGGCTTAGGTGTGCCATGTGTTAACTCCATGGCAGTGTTAGGGGCTTCTGGCC^ 

CACTGATGGAGGTGGAGAGACTCTCCAAAGAGATGCTGATGTGGGAGTCCTACAAGG GGCGGGGGCGGGGTGGGTGGGTTGGCCTCTGATACTAGGA 
CATGTTAGGGGGAGAGGAGTGCCCTTGTCACAGAAAAAAGAAGAAAGCCAGTGTTTTTTTGTTTGTTTGTTTGTTTGTTTTTTAACCATCCAAGTCA 
GCTTTGTGTAGTATTAATGTCTAATGATAAGGACTCCCTGTGAGATAGCTAAGTCCCATTCTCACACCACAATTGAGGGTACATTCAGGGAGCAGCT 
ACCTCCACTGGCCTTCATTTCTTCTTCCAAAGGACCCCTTCCAAATTACCCTGGGGACCTGCCGCTGCTGTTGGAGGATCACTGGAAAAAGGGGACA 
CCTGACATCTTCAGGTCTGGCAGACCTAGACCTAGAAAGGATACCTGTTCTGGGTGGAAGGCCTTGGGGTATCACAGGTCCACTCCCAGAGATGCCA 
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ATCTGTATCCCATAAGCCAGTCTGCTGACTGTCTCTAGGGCCTGCAAGTCCTCCCAGGTGACTCAGGGGTGCCTTGAGCCTCTAAGTGGAGCACACT 
TGATCCCCAGGTAAGCTCCGAAATTGCACAGTAGATTAAGGGAATTCACTCACAAGGATGCCTCCTTAGCTGAGATCCCCAAAGATCCAGGCTGCAT 
AGGACCTGAAGGCTCACTCACTTCTGCTTTCCAGTCAGCAGCAATGTAGCAATGTAGGAGCCAGGCAGCACAGGGAAGGCTCTGAGCTCTGGGACAG 
ACACCTGCATCTCTTTGGGTGCTTGGATCCCCCTACTCTTTGGCCTTTCTGCAGGCCACAGGCTTCGACTTCTGAGTCTTCACTTAGGATGATGAGG 
CCTACCTGCATGAATAGCGTCGTGTCTAATTTTTGTCTCCGCCAACCCCTCCATGATCCTCCTACTTCCAAGGCCACCTGCTGTCTGGTCTCCTAGC 
TGTCTCCAGGAAACTCACCCGAAATCGTTCGAGAGGCTCCTAGAGTTGTTAAGGGAGACCCTTAAAAGCTCCCTTCTCCTGGTCAGAGACCTTGGAG 
TCTCCCGTTCCTTCTGGTGTTCTTCTGAGAATCTCTGCTTCAGGTCCCTTGACCTTCGCACCTGATGCCTCCCTGCTTTGCCCACAGTGGTTTTTCT 
C CCTCC ATTCCTACATTAGGCTAGG TTGGTT CTG AGCC CCCACCC CCAT CC CGC AGTTCCCAGAGGGAACAGTCCC AGCAAAAATAC ATGGAACTCA 
GTGCCTGAGTGGGAGATTTCCCCCTCCCTCCTCACGACAGACTGGCAGCCGAGTGAGGGCGGGTGGGGGTCTAGCGGCCAAATTCTTTGTCTTCGAG 
GTGCCTCAGCAGGGGTCCGGTGGAGCAGCGCAGAATACACACAGCCGCTGACCGATAACTCCTGGAGAGAGCGCACTTCAAAAGGAACCTCCCGGCA 
GCTACTGAGGTGGGGGTGGGGGTCCCCATCTCGGGTCTGAAGTCC^ 

GGTCCTGGGTCCCAGATGCCTTCCCCTCCCTCAACCAGGAGACCACCCTTCCTGTCCAGGGCGAGCCTCATATCCACCCTCATTAGCTGGTTTCAGA 
GTAAAAATAAT C CTTTC C CGCCATGTGCAGCCTAC CCTGATGTTGGGTGCCTATCAAGAGAG AAG ATGTGTCTAGG AGA TACTCGTGGCAAAGGGGC 
TCCCTCACTGCCTGGGCAGGGTCO^TTCCTGCGTTGCCACCCGCAGGCAGGAGGAAGAAAATTAAGCCTGTGTTGAAGGAAGGACACCTGGCAAGAG 
AGACGAAAGGACTC CGGAATCCCCT C CT CTGGGGAACTTTCAAGTGGGGGGGAAATC CC C ACGCAGGCTTCTACTCTGCGGC AGCAACATGCGCGGC 
CACTCCCCGCTTCTGGTCCCCTCCTCCTTTCCTTCTTGCACCTTTGCCTGTTTGTTTCCTACCCTCAGAACAGCACCTGGGGCTGCCTCGGGCTCTG 
GTTTGCCTGTAAGAGGCACCTGCATACACACAGCCCTTCAAGGGACGACCAGTTTTAGCCACA CC^ 

AACATCTCACCCAAAGCCACCAAGAGCCTTTGCCCCCGCTCTCTTGGGCTGGCACCTGTGGAATCTTAAGGATCTAAATGACCCCAAGAAGCCCTCC 
AGATTAGCTCTTGCCTCCACACCATAGCGGTGTGTTGGAGGCCAGAGAAAGCAACGTCCCTGTTTTCAGAAACACTAATTTTGGACTGCCCCCCCCC 
CCGCCCCACGCTCTGATGCAGGACTAGAGATTTAATCCGCATAGATGTTCAGCGGCCTATGGGGAGCCCCCATGGGATGAGCAGCAATCAAGATCTC 
AGCTAAAGGCTACCTAGTAACCCAAAAGCCTGTGAAGCAAGCTCCTTGACAGAAGCCTCAGCCCCGGGGCTTTGCATTCCCCGATAGTGGCCAACCC 
CCTCCTACTGCCCTCCCAGGTGAATCAGAAACGGGGCCTCAGAATCAGCCGTGCCCCTTGGGGAGGAATCTTCTGGGAATGCCCGGCCCCCGGGGGC 
ATTT CT AGGGGTGCTTATCT ^ 

CCTTCCCCTTTCAACCCCCACCTCTTCCTCTGTCTGGGCGTTCACCCCACCCCCGCCCCTGTTTTCCGTCTCAGTGTCTCTATCCACTTGTCTCTCT 
CTATCTCCCCTTCCTCCCCTCTTTCATCTCTCCCTCTACCCCCCTCCAGCCGAGGATCGGGCTAGGAACTCGCAGAAGCTGTGACAAGCCCAGGCGG 
CTCTCTCGACCCTTGGTGCCCCCAGGGGCCAGTTCCTTGCTCTCCTTACACCAGCAGAAGCCATGTGCCGAGAACAGACCAGCAGGGGCAGTGGATC 
J2 CGAGCTGTGGGCACCAGCCTGCCGGATCAAAGGCTAGGAATGTCTGCTCCCCGTAGGACTTCCCTCAGTCCCATCCACCCCTAACAGCAAGAAAGCC 
CGCCATTCCTGGCTAGGCAGGTTTGGGTGGGCTCCCTCTCTTCACCGTCTACACAGTTAAGTCCCACCATGGCTTCTCCTTTGGCTTGCTTTCTAGG 
TCCCAAGCCAGGAGCATTTCAAGGGCTGAATGTCTTCTGTGTCCGGGTAGAGAGGCTGAGGTACCTAGGTAAGGGGACTGAATCCTAACAATCTAGG 
M" TATGATTGGGCTCAGACTTAACTCCCAAACTGTGGAGAACTGGTAAGAAGAGTGCCCCTTGGAGGCCTGAACAGTTCCCCTTGTATAAGAGCTTCAG 
g ., £ TAGGTCGTAGGGTGAATCTGGTGTCTGCACGGGACAACGCTTAAACACCAATCGCCAGGTTTTAGTTTCTCATCCX3TAGAAGGGTCGGGAATATAGG 
TCAGAGACCTGGACATAGGGGAGGTAGATGTTCCTGAAGAGCCCTCGCCCCCAACCACTAATAGACAGAGCTATACCGTGTTTCACTAGTCCCAAAT 
ES GCTTCCATCCTAAGATCCAAGAAACTGCAGGTACTGCTGAGGTCCGGCCAGCACGCCTGCAGGAGGAGAAAGTGACCGAATCCTCCCGCAGGAAATG 
1^ CAAACGGCGGCTGGTGTCAGTTGCAGGAAATGCAAAGGCTGCAGGAGGTCGCGATACCAATCTGACTCTTGACAAAAGACTCAAATGACACTAGCTC 
!IT TGTCCCCCGGCGCCGCCGGCTGGCAGACCTTCAGAGCCTCAGCGGACTCTCCCGCGCGCCCCTCGGACACCCTGGAAGCGACCTGCGGCCGGGTGGC 
|3 AGGGGCCGCCAGCTCGCGAGCCCGAAAAGGCTGGAGACTGGGAGGGGTTGCATTTTCCTAGCCCCTGATCCCAGTCGCGCGATCTGGAAACAACCCC 
f$J TTTGCTTTCTAGTCTCGATTCCTTACGGGCCCAAGTGCCAAGGGATCACAGAAAAGGGAGCCACCCATCAATGCCCAAGCCGACCCCTTCACCACGT 
I GAACGCCGGGTCAAAGCTCCAGCTCCCTAAACCTGGCTCGCGTGGACTGGGAAGGGGGTGTCCCAGGCCAGCAGAGAATCCAGCGTGCCGGGTCCTC 
yy CCGAGGCCTCCGTGCGCCCGGGGCGCAGGCTTTGCCCAAGTTTGCCGGGAGATGCCCTTCCCAGAGCCGGAGTTGTGGGCCGGGCGGGAGGGGCCCG 
f*j TGATTGACAGGCTGAACTACAGACTCATCTCTTACCTTAGGCCGCGGGCGCTGATTGGCTGCTCGCTGACATCCTCAAACCCGGCTGCTCGGCGCTG 
W GGCTCCGGAGGGGGGCXBGCTGCGGGTGGAGGTGCGCCTCTGACAAGCCCGAAAGTCATTTCC^ 

§** GTCGCTCCCCCTCTTCATGATCGCCGGCAAACTTCCTCCTCGGCGCTGCTTCTAATGGAGCCCCACCTGCTCGGGCTGCTACTCGGCCTCCTGCTCA 
GTGGCACCAGGGTCCTCGCTGGCTACCCAATTTGGTGGTAAGACTTGCCTCTTGTCTGCCTGCGGCCCCGATCTCTCGGCCGCA CCCGGGGACGGTG 
GGAGGCGCTGGCCGGGTCTGGGCTGCAGCAGGCCTGGGGCTCTTTTCTCCTTGCTTCCCGCTAGCCCCCCAGCCGTCCACCCCTCTTGGATCTTCTG 
CTGGTGTCGCCTTCCGAATCCAACTCCTGGCTTCTCAGATAATCACCATTTTCCCTGCTACCCGCCGCAGAGGCGGGAGGGCTCTCCCTACCCCTGA 
AGGTCTCTCGTTCTTAGTCTTCACCCTCTCCCACCCCAATTTGGGGGTCCCGAAGAACCTTCGTTTGCTCACACTTGCATTTCTGTGCTGTCCCCGC 
GGCCCGGCCACTCCTCTTCTCCATTTTAGGGTCTCCGTGGGTAAAATTATCCTGTCCTTTTTCGCCACCGCACCTCTTCGGTGTCTCGGTCAAGAGA 
TTTCCCCCACAATACTCGCGTTTGCCTTCATATTTGAGACCTTAAGGGCTTTGGGAATCCCAAGCAGGGGACTTAGGGGGTATTTCAGTAGATTCTC 
C CCACTC CGTT CTAG ATT AATTTTGAGC CTCTGAAGTG ACTA TAGAATTTGCTGATCCTAGATGCCATTGCTGATC CGGGTG TATTCGTTCCT TGC T 

TCCATGTCAAATTCGTTGCTTTTGGAAGACGGGGAAGCTAGGAGAAAGGGCGAGGGAA 

ATTTTAGTAAATAAGGCTGGGCGAGCAGGCGGAGTACTTGGAGCTGATGGGTGTGTGAAGGGATTCACTGGAAGATAAACCGCCTTTTGGAAGCCCC 

AGCTCCAGAGACTCCCCTGTTTTCCAAGCCTCATTGCTGCTCCTCCTCTGTAGGTCCTCCGGTGGAAGACACTAAAACCAGTTTTCTAAATTACACC 

CCAGCCCTCTCAGTCTCCAGCCCCCTGCATTCCGGAAAACTTTAATAAAAGAACCCTCCCCTCCTGGGCAGTGGCTAGAGCCCTGGGGGGGGTGGGA 

GGTGAGGAGGAGGAGGAATTTGGGACCTCTGAGTAAGAAGGAGACTGGCCCCTGGGGTGAAAGTTAGAAGTGCTACCaGGGTAAAGTGCTGCCTACA 

TGGCCCCCTCCCTGCCTGCTACCTGCAAGCCTGCAGTATGAGGCTCAGAGCTTATAAAATACTGGGCCACACCCCAAGCCTTTTTCCAGATGCTTTG 

GGGTGTCTGAAGGCCAGGGGAGGGGGCAGTCATGAACCCAGGCACCCAAGCCATGGCCTGCCCAAGGTTCTGTTTAGAGGGTCCAGGAG 

TAGGTTGATAAATACGCCAAGGACAGACCATCTTTCTTTCCTGTGGACTTGGGCTGGTCTGGGGCTCCTGTGTGGAGTCTATAGAGCCTGTGCATGC 

TGGGGACAGCCAGGGATTCCACAGTGAGGTTCTGCCAATTGGTGAAAGGTT^ 

CCTTCTCAGAGACCCTGCGGTGGCGGGTGCATAGGAAGAGAGGGTGCCTGGAGCCCTAGGGATGGCATAGAGGAAAAAGGAGAGAGTGGGCTTGCTA 
GTTC TGTGA ACTGGAAGAGAG CACT TCCTGGGGCAGT ACC AGT AGGTT AGATGGTGGTAACAG TAG CCTCTGTGTGTAGTAATGTCCAGGAATATCT 
AGGTAAGGTCCCCATCCGACAAGAAATATTAGGAAAACCTGGAGGGCAGAGACAGGTCCTAACATAACATACATGGCCTGTTTGGGGGTTAAGGG^ 
GACCAGATGGCTGCTTTTGTCTGAGCCATAGAGAGTATGTATGGGTATTTTTGAGATGGGTGGTTTGTCAGTGCCTGTGGCTACGACCTCTGTGTGT 
GTAATACAGTGTGTGCTCTAGGTCAGTGACCCTAAGTGTCTGTGTACCTGTATGTCTCTGTGGGTAGAAGCTGTTAGTGCCTGAGACCCAGGCAAGT 
GTTTTGTGT GAAAC CTTTTGGTGTGTGC CTAGGGGATCTG CGG CC CAC AGC AGGGTATGCTGTATCTGCATACAAG TATCAGCAAGTGGCTGGTTC T 
GTGATGTGACACAGAAGTCGAGAAGTATCCGTTGCTTGTGTAAAGGGGGACCTCCTGGCTTTTGCCTGTGATACTGGACATGGCAGTATTTGTGTTT 
GGACTTGTGTGCTGTCTGTGCATGTGTCTGTGTGTGTCCCTTCACCCGCGCATGC^ 

GCGTGAGATCGGTGTTCGTGGCATGACCAGCCATGGTTGGAAGTATGAGAGTGAGCCTGGGAGGCCTGGCTGAGGGCTGTGTCCGGGCCCAGCTTCT 
GATCGTGACCCCAAATGCCAGTGACATAGTTTTGTGCCAGTGGTTATGCGTCTGTGGTCATGTTTGCTGACACCTGACTTGTGGGTGTGAAGAATAC 
CTGAAAGGGGTGCTTGGTGGCCATCAGTGGCCCTGAGGGGAAAGACAGGCAGGAAGTGGAGGTCTCAGAAGGTGGCTGCAGACACAATCGGTCTGGG 
ATCCCAGCTGGGGTGGGGGCAGGGCCTTCCTTGACACCATCAAGCCTTGCTTGCTTGAGGACTGTGGCTACGAGCCTTCCCTCCTCTAAGGTCATTT 
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CCCTTTCGTCTTGGAGCTGOGGCATGAATC^CCTTCATGGGGCTGGAAGTAGAGAGTGGGTAGGCCTCAAATTCTTGATTCCCCTCCTCTGCCCCAA 
GGCCGGCAGATTCCAGGCCTTTGAGTCCGTGACTGCCTGCCACCTCTTTGGCCCACCCGGCTCTCACGTTCACCCAGAGGATTTACAGGGGAAATGG 
CTTAGCTGGGATTTCTGTCCACATCACCTTCAGAAAGGACCCCTACTCCCAGTAGCCCATCCCTGCTTTTCCTTGCAAGGGCCCCAACTCTGAGACA 
AGGTGATGAGGAAGGGGCTCTGAGGGGCCTCTTACTCTAAAAGGAAGCCCTGTGCTCCAGAGCCTCCCAGCCTGGCAGTGGCTCCATTGGAGCAGGA 
CCAGTG ACAACCAG CCTGCTTTCTAGAACTCTCCTTCATAGTACTAGTTAG CTC CTAGT AAAGAAATCTGT CTTTCTCTTTCTTGAGG AAT AAGATC 
GTTCTACGGCCTTGCCCTGTGCCATCTGTGCTCCATTGGAAGTGCCCTGAAAAAGGAGGCCTGGTTGGCTGGGTGGTGTTTGGCAGCTCACTTCA^ 
CTGTGTGGCTGAGGACTGGTGATTCCTCCCCACCCCCCCTTTTGGGGGTCACT^^ 

TTGCTGAGGGCTCCAAGCCTGTTAGGGCGCCTGGGTTCTCATTGCACCACTCCAGAGCCCATCCTCCACCCAAGCACCTATCTTAAGCCCTGCCTCA 
GTTTCTCTATGTGTCACATCCTGCTGCCTCCCCTG 

ACCCCCGTGGGGGTCAGTGGGCTGGGCCCAGGGAGGCACCTGGAGACTGTGGGGGAGGAAGGCCCCAGGCCAGGTGTTAACACTTGTAACAGGCAGG 

AATTACAGCTGGCTGGGGCTAGGGCTGGTGAGCCGAGCTGGCTGGAGGCTGTTGATCCCACAGACAGTCAGTCAGTCAGACAGACAGACAGACAGAC 

AGACAGACAGATGGGAGATACCAGATCGGATGGCCAAGCTTTAGCAGGGTGGTCGCTCTTCCAGAGAAGATCTTGCCGTGGGGGAATTCGTCTGGGG 

GGAAAAAAGAATGATGGGCCCCTTGAAAATGAGTTCACAGCCCACCCCTATTGTGGGCGGGGCCTTGTGCTACAAAGGGATCGCAATTTGGTGGT^ 

GTGGTAGAAGGTTGGTATCCTTCTAATGGCATCACACATTGCCACACAAACAACAAACAAGACAACTGGTGAAACACCAGCTAGTATCAGGAACT 

GTGTGTCCTTTAGAAAAGCGGACTGGACTTTATAGGGAAGAAGGCACGTGGTTGGGGTTGGGATAGTGGTTGAGGGGGGACACAGCTGTCAGAGGAA 

AGTCCTCCCCTGCCCCCTCCCCCCTTACAATCTCCAAGTGTCTTATTAACAGCTTTGGGACCAAGAGGTAAAAGCTGGGTGTGGCCCACT^ 

ATAGATTCTGAGGTCTTGGAGTTTTGGATGTCTAAACTGCATTGAATCCTGGTC 

ACCTGCTGCTGCTACCCCACTGTAAACTCAGCTAGAGCTCTCTAGGGCCCACGCTCACTCACTTCTGGGCCTCTGCTCCAACCTCCATTAAAGAGTC 
AAAACGAGGGCCCCAGGTAAGAGTCCCCAAGGGACAGACTGGTCCAAGGAAGATTTCCCCACCCTCTCAGGGCAGAACTCACTGCTCACAGGAGGAC 
ACAGTTGCCTGGAAGGTTAGAGCAGTATTCATGGGCCTCCTACCAAAAC^ 

GTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGTTTTGTTGAGATAGGATTTCTTCATGTAGCCTTGGCTGTCCTAGAACTTATTCTGTAGACC 
TGACTGGCCTTGAACTCAGAGATCCTTCTTCCTGCTTCTGCCTCCTGGGTGCCGGGATCAAAGGCACCTGGCCAGCACCCTTCTTGAAGGCTCGGCC 
ACCACATGTTCTGTTTCTATATATTCCCTCTGGGGAGAAATGTTTCAGCCTGGAAGTGGAGAGCCACGTCTGAGTTCTGAGCTTACCATCCATTGGC 
TGTGTGACCTAGGGTGACTCTGTCCTTCTTCAGCTCCCTGCCTCTCTCTTGACTATACTGAGAGGTGTCCATTCATCTTCTTAGAGCCTCGGGGATA 
TGATGGGGACTCCACAGCAGATACTTGGGACCTGCCTGAATCAGCACACCCCTTGATTTCTGTGGCTTGGTGTACCTGCCCTAAGCCCCACTTTCCA 
CCCAACACTGCCATGCCTTCTTAATTCACTTCCAAACCTCACGTAAGGGCTCTCTATTCCTCTGTCTCTGGGCTGCTGGCAAAGTGACCTCTCCCCG 
T CT C AG TTT CC C CATAGGAA AAATA AAGTTATGGC CGCTACC C CATGAACATC AGTGATCAAATTCAGGCCCC AGAATGC TGCCGG AAT GCA GGAAG 
TACAGTTCAATGTTTGAACCTCGGTCTGGCCCTGAATCCAGACTTGTCTCTTCAAATAAACCCTCCGAGATTGGCCCCCATGGCTCTCATTTCTTCT 
TGTTCCATGTGCTTGGCACTGGTGCCCACAGGCTTGTAACTTCCTTCAAAATCAGAAGTTCAGCTTTTTAAAATATTTTTTATTATATTCATGTTAG 
TGGTTTCCTACATGTATGCCTATGCAAGAGAGTCGGATCCCCTGGAATTAAAATTATAGACAGTTGTGAGCTGCCATGTGGGTGCTGGGAATTGAAC 

CCTAGGGGAGGTAGGGTAAGGTGGACAGAGGTGGTCTGGGGTCTCCCCTAATCCTCCTGATACCCCACAGTCTTCTGAGCTCGGAAACCTCTGTCCT 
CTCCAGCTGGAACACAGAGCTCCCGGGCCTGGTGGGGAGGGGTAGGCTTGGGGCCCTGGCTGCTGGAGCAGCCCCAGCACGAGGTGAAGTTGCTTCT 
GGAGCTGGGGGATGACTCAGAGGGTGATTTATAGACGGAAAGCAGAAAGCTTGGCTCCTGGCAGAAGCCTGCTGCTTAATTACATTCTTTGGGGGGT 
GGAGGTATGTGGGGCGACAGCAAGGGAGAGACTTCTACCTCGTCCCCTCCGGGCCTGGTACCTGTGATGTCCTAGCACCAAATAGGGTTCTTCTTAG 
AGAGTGCGTCCCCTGTACTGAATGCACGGTTAGCCACTGAGAAGAGGAAGAAGTGGGACAGAATCATCGGAAGGACAGAGTCCCGTGGGGACCACCC 
CAGGGACAGGCTGATGGGGCACACAGGAAGAGTGGGATGCTGAGAAGGATCGCTGAACCTCCACCCTGCTTCTGCCGAAGAATCCCTAAGCCACCTG 
GGCCTCTGGGTAGCCACTCTCCACTTCCCCTTCGCGTGGTTGCCTGTGAAGACCTTCAAGTTGGCTTCTTGTCCTGTGGCTTGGGTCCTGGTAGCCC 
ATGCGTGCAGGAGACCAGGCCTGGAACTCAGAGACTAAACTCTGCGTGTTTGGGTTAGATGACCTGTTGCGTGGAAGTGTGATGACTGCTGCAAAAC 
TGCAGCGGTTCTGGTAGCATCAGCCTCGTGCTGGAGGGAAGGAACCCCACTAACCCTGCCTTTAAACTAACTACAGCTAGCCTGTCTGCCCATGCTG 
ACCAGTGTAGTGACTCCAACACAGCCCCAGATACTGAATTTCTGGGTCAGGGGTCCCAAAAGGTATAGGACAGGCAGGGGTAAGGAGGCCAAGGTCT 
GCAGCACCTGGCTGCCTGTTTGAGCTTGGTTAGTACAGCTGGCAGGACTTGGCAGGGGTCAGGATGGGGACAGCAGACAGCTCTCCAGTTCCCCCTG 
AGGCAGGCCTTCTCAAAATGGCCTCATTAATGCATGTGGAGTGAATTCCAGAAATGTCAGGTATACTCACACATCTAAAGAAGGATGGGGACCCAGA 
AATGTTCTGTGCTTGGAGACAAGAGGAGTCAGAGTGAGGGATGGTTTCCAGGTCCGGGATCTGAACTGTTGGACATGGTCTTCCTATTGCAAAGTGA 
TGCTCTGTGCTCAGGACAGTGGATGTTTGAGTTTTCTGTGCCTAGTGGGAGACTGGAACTGATTCCGGGACAGAGTAGAGACCATAAACTTTATATG 
TGAGCAGCTTTGAGGTGCAAGCTGAATTGCCAACCTTCCCAGACCTCAGTTTTCTCATCCACCAAATGGCCAATAACATGCCTTCCCTGCCCATGGT 
ACTAATGAGAGCTCAGGATGTCTGTCTTACCAGAGTCAGCTCAGCAACTACCATTGTTAGAAAGAAGACCAAGATCCCCACCCCCTACCCAGAGGAG 
GTGCAAGAGTATCTGTAGTGGATCCCCAATCTCTGAGTAGAATCTCTGGGAGCAGAGCCGTTGCTTGGGTTTAAGACCCAGTGATGGCAGGTGTGCT 
CCAGTTTCAGCTCTGGAGGTCCACGAAGAGCACATAGGCCTGATCCCCACTGAGACCCGGTTAGGAGGTCTGGGTGGAGGTGAGGGCAAGGTAGGCA 
GCTGGAAGCACTCTCTCAAAGCCCGAGGCCATCAGGAAATCCAAACACCCCCAGGGAATGCCAAGGAAACCCACAGCGGCATAGCCCTGTTCCCTAA 
CTGAGCTAGAGAGACTTGAAGAGAATGTCCTGTGTCCCTGTGTTGGCAAGGGGGATGGGAGCAACAGGACCCTGTGACCAGGTTTGTTGGGAGGGCC 
TTTCTGCCAAACCTGCCCCTGGGCGCAGAGTCCCACTTATCTATCGGAGCCAGCCGGGATCAGCAAGGAAAACCAATCTTCCAGAGCCCGCTATGCA 
TGCAAATGGGGGCTTGTGACCATGTGAACCTGCACACCGCCTCTCTCTAGTTCCTCCACACTTCATAACGTGGCTTTCCGAGGGTGTCATTCTGTCC 
CTCTGAAACCrCTCATGGTTCTGATCTAGACCCCCTTTGTTTTATTTTGGCCTCTACCTGCCTATACAACATGCCTTTCAGAACCCCTCTCCCACTC 
CTTCCT CAAAC CGAACAC AAAATAGTAC ATG AACCTCTGC ATGCC T CCTTCCTCTGACCTGGGGAAGGCAGTAGGACTTGACCCC TAGGTTGA C CAG 
GGCTTATTTCTGCTCTGCCAAGCACAGGACCTG^ 

TGGGACCCTCCTGAGCAGACTGCCGCAGCCGTGCTGGATACTGGGATGTCTCCACCATGTCCATTCCCTTGGGCTTTGTTGCTCCTCCTCAGCCCAG 

ATTGAAGAATTCTATGTATGTGTTAGAGCTTCTTCTATGCTCACCCAGCTCTTGCTATGCATATAAAAGTGTCAGGACCTAGGGACAGGTGCCTTTG 

CATGCTGTGTGGCTGTCTTTAGAGACCTAGAAGATATTCTTCCAAGACTAACGGTGAGCAGCTGCTTrGCACAGACCACACGTGGTGGGAGAAGGGT 

TGGAGCCCACATTCTGCCTCCACTCGGGGTCAGGAGAAGACTTCCT 

NNNN^NNNNNNNNNNNN^NNN^ 

Nl^NNNNNNl^NNNNNNNNNNNNNNNNNN^^ 

NNWNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNl^NNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNhTtfNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNI^ 

NNNNNNNNN NNNNTGTGTGTGTGTGTGTGTGTTGGGG AGAAGCTATGT ATG C ATTTGTGGGAAGGGGGTAG CT T ATT CATGTGTG C ATTTGAG CATG 
GAG CC AG AATAC AAC CTATGG TTT CAT AT CTT GGG CACTCTGTTTAGCTTTGGAAAC ATAGTCTCTCACTGAGACATTAGGTT CCCTGATT CGT CTA 
GGTTGGTTGGCTAGTGAGCCTCAAGGATTTCCCTGCTTCTGCCTCCCTAGCTTTGGGATTACAGGCACACCACCACACCCAACCTTTTCCCATGGGT 
TCTGAGGAATGAGCTCAGCAATTAGATAAGCCATCTTCCCTCCCTAGCCTCAGAATTCATGCCCATAGCCTAGCCCTGAGTTGGTGTTTCTGGGCTG 
TTGAGTCTTTCCTTGACTTTCACAGCTAGACATTCAGCACTATTTCCTGCTGAGAAGCCAGAATGCTCTCTCCTTTCTTATAAAGCATATTTCCTTT 
ATGCATCAGCCATTTAGGAACCCCAGAGCTAGACTTGATACTCGATGTTTGCACCTGTGCTGACCTTGGGTGACACATTCGTCTTCTCCTTTGCCTG 
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ATCCAGTGTTTTCATCTCCATTCTGCCATTTATCCA 

ACCACGCTTGGTCTCTGTGTAGCGGCTACTGATGCTCCATAAATATGTTTTGGCCGTCAGACAGGACGTTGGCTGCCAGCTGTGCTC5AGTTCTTTCT 

GAA AAG AGG CAAGGCAG AGG AAAATG AG AACGAAAAATTC ATTTTGTCAGCATGGCAGC AGTATGCCTGTGAT CCCTGCACT C CAGAGACTGAGGCA 

GGAGGATTGCAAGTTTGAAACCAGCTTGGGCTCTACAATGAGAGTCTGTCGACACAAAAGAAAGAAAGAGGGAAAGGGGTGGGTAGGGGAGAAAAAA 

ATGGCTGTTTCTGAGAAGCGGCTCTGAGCTGACAGTTTTGAGTTGGGGGAAGCGCATCCTGTATCTTCCCCAGAAACCTTATCTCTAGGGATGCTGT 

TTGCAGGACTGACGGTGCTTTAGTCTAAGCTTGAAGGTGTCATCCCACCTCTTGGGAATCTCAAACATTGTAGCCAAGGATGACCTTGACCTTTTCC 

TCCTCTTTCACTGCCCCAGTGCTGGGCTTAGCCCGCACCACTGTGCCTGGGTTTATTTGGTGTTAGGGATTGAACCCAGGGCTTCATGCATGCTAGG 

CAAGCACTCTACCAACGGAGCTATGCCTCAGGTTGGGGACTTTGTGTCTGCCATGGCTGTGCCTTCTCTTTGTTTGCTCCCCCCACCCCATGC 

CTGTGGTGTTCTCCTTGGGAGATGACTAATTGATGCTCTTGATACCACTTCCTAGGTGACACACAAGTACCCAAG^TGCCCCCTTTCCATCTCCCCAG 

C AG CTATTGCAG AACTAC CAC CC TGT AAAAATTCAGGGCGGCATGCGG ATG AGAG AG AG AGAA CACAGGAGAC CACACT CAGGACCACAT AGGGTGG 

CTGGCCTCTCTCAGGCCAGACGTTCCCATTGGCAAAGCTGTGGGGAAGACCCCAAGTAGACCTGGACTAAGCCACATCACTGCTGTGGCAGGAAC^ 

C AC AAAAAGGAG AGGAG AAC ACT CAATCGTG C ATTGTGGTGC CTTTGAC CCC AACACTTGGGAGG CCAAGACAGGTAGATTTCTATACCTTCAAGAT 

CAGCCTGGCCTATGCGGGAAGTTTCGGGACAGTCAGGGTGATGAGAGACCCTGTCTCAATAAACAAAAATAGAAGTAGAGAGGAGCTTCAGGGTAGG 

GGAGCACAGCCCGCCAACCACCTGGCAAGGCTTCAGTTCACCTCTATGCTCCTGCCCTTAGTTGGCCCTAGACCCATCCCAGTCCCTTGACTTCAGC 

CAAAAGAGCATCCAAATCTACTGTAGGTGGACCTCCTVTAAGGTCCAGA^ 

CACCCTGTCTCCAGAAAGTGGGCAGCCAGGGAGCAGATTTTAATCATCAGCGAGGAATCCATCTGTCCTTTTTGAGGTTCCTGCCTCAGCCTGGTGT 
GCACAGAGGGAACACATCACCTCTGTCAGCCTTAGAAGCATGCTCTTGTTCTCCCCATTCTACAGGTGACTU^CTGAGCCCTTGAGAGAACCACTTC 
AGGGCCTAAGGCTCCTCAGGCTGTGGACTGAAACTCCAATTAGATGCTGACTGTAGGAAGCCAGGCCCTCCAGGGTTCACAGCCCCTCACTGATAGC 
CAGGAAGCTCCCTCAAGGTCTATCTAGACTTAGTCCAAGCCCAGACAATTTCATTTGGTCTCTGGGATGTGAACATCACTGAAGGGTCTCAATCTGC 
CCTTGTGATTGGATATGTTGTTCCTGGAGTCCAGGGCAGTCCTTTGGAGGTAACCTGAAAGTGCCATTAGCCCAAGTTATGAGGGGGGTCCTGCCCA 
GGTCTTTGGGGCAGAGGGCTCTGCACCACTGTCTGACAATTGTGCCCCTTTCTCTGGGGACAGGAATCTGAAATCCCAGGGCCAACACCCCTAGGGC 
CCACACCCACCCTGGTGATGCTCCAGGCATCCCAATATCTGGGGAATCCTTCCCAGATGACTGTAGGCCTCCTGAGCTCTCTGGGTTTCTACCTAAA 
GTCTGTTCTATTAGAGAAGATGACAGTACTGATGACCCAAAAAGAGGTCAGGGCAGGGTATTTGCAGGGGCTGTGGGTACCTGGAGGGTTAGCTAGG 
TGTCCAGCATGGTGAGCCCTGTAGAGGCTGAGGGCTATTTCTGCCCATGTCACCATTTTCTAGAATGAACTCCTTGGAAAGACTGATTTTGCTTACT 
ATTTAAATTTTGTATTAGTAACACACACACATATACACACTTTCTGGAAGCACATGGCAAGACTAAATCAAGAAAGCACATGTGTCCTGTGGAAGTC 
CAGGACAAGGATCTCAGTTTGAATGAAGAGTATTGACATTTGTTAACCTGAGGTGAGGCAGTTACGTCTCTGGTACCTGACAAACACTCAGCCTTCA 
TTTAATAATCTCGTCCCACACACTCTAGCCAGGATGAAGGAGTTGGATGGGGAAATGGGTCTGTCTAGGTTCTCCCAGCACAGAAATCTCGGAGAAG 
GCTTGACCATCTTGGTGTCCCCCGCCCCCAGCCCCCTCCAGCCTTGCIAACATGGGCrCACCCTAGCAGGAGAGGAGCCCCTTGTTTAGCAGCAGTAC 
CAGGGACTGTCCCAGCTGTGTTCCTCTCACCTTCAGGGCCCTCTTTGCTTCAGGGCCTCATCTTGCCACTCTCGGTCTCAGAGGACCTTCCCTGTGC 
TTGTGGCTTTAGATGATGTGACCAGAAGGGCAGCGACTGCATCAAGGAACACACCCACCTGACTGTAGTGTTCTTTAACCCCAAGGCCGACCTTGAC 
CCTGCACCA CCCTGCACCTTCCAGCCCCAGCCTGGCTCTCCTCCTGGCTACTCTCATCTCTAGGATGACAGTCCCGGGCTTCCATCTATGTAGCTGG 
GTGACCTTACTTTCTCTGAGCCTCAACTTTTCCCTCTGTAAAATGGGGATGGCAAGGATTAAAGGATGTAACATTATATGAGACAGACAAGGTCTGG 
CGTCTACACAGTGAGTAATGCTCCCATTTGCCATCATTAATCTCGCTGTCTATTTTGTTGTTCTCGCCATGCATCACCGCTCTGCGCTTTTGTGCTC 
GCCCCTGCCTTCTTCTGGGGAGGTTCCGAGGACTCGTGTCCGCTCTGTGAAGTGAGGTTGGTACCATTGTCTCACCAGCACCTGCTCCACCTGAGCT 
CCTCCCGGGGTCTCAGTGGGACTCCTTCAACTCTACCCACCTTGCCAGGCAGCACGGGACACCACGCAGCATCTGGGGACAGGTCrTAAGCTGAGGG 
GCACTGCTGTGTGTGCCCTGCTTGGCAGAGGCCCAGTCTTCCTGCTCCAGGTGTCTTCCTCCTCCCAATGTCCTCCTCCTCCTCCTCCTCAGGAAGA 
CTGACTGACACTTGTCATCTTTCTTGACTTCTCAGTCACATCTCATTGGTATCTATCTTATCCTTCCAAAGAGGGGACCAGCTCTTGAGGACATGAA 
TTCTGCTGTGTGTCTTGGTGACCCAACTGAGCTTCATATAGCAACTCTGGGGTGCAGAGCACAGCTTGGGGAGCACTGTCGTTCTAGAGATTGTTGA 
GCCGGTGACTCAGGACCCCTCCTCAGAAGCTCCGGTGCGGCTTTCGCTGCTTCCCGTGAACTGGATAGCAGCTCCCCCTCACCCATGATCTCAGCAG 
GGCCCATCCTTGAGCTTCTAGCCTTTCGTGGACACATAGCTCTCTCGCTGCATTTGAAGAAGAGCTGTCTGTGGAAAATGGCACGTCTCAGACTTGT 
TGGCTGGTGGCTCTCAGACTTTCAGAGCCTCCATTTCTTCTATGTAAAACGGCGGCGGGGGTGCCGTGTTGTCAGGAAAAGTTGTGAGGTCCTGCTA 
CATAGGGGCTGGGCTGACAGGGACTGATCATGGGTGGTATGCAGGTCATTCCAGTCCAGCGAGGGTTGTCCAGGGACTCAGTACCGGGAAAGAATGC 
AGGTCATCCTGCCCTAGAGAAAGGGCCACGCTTGGGGAAAGCTATACGCCTTGTGATGGTGTGCCCCAATTATCACCTTATTTGAAAACCAAAATGG 
GAGTGGCTCAGAGCAGGATGCAAGTGATTCAGTGAGGACCCTGGTCTGACCCTGTGTCCTGGTGTCCTGCTCAGCTCCTGGGAAGAGGTAGGCAAGT 
CAGATGTGGTGGCTTCAGGGAGTTTTACTGGGGCTCCTGGGCTGGCCTGTGCAGGTTTAATATGGCTGAGGCATCGGTCATTTCATCCCTGAGAAGG 
GTGGTGAGTAAGCCTTGCCTGGGAGGCTGGGGCCGCTGTACACATCTGTCTTTAAGGGAAGAGGGCCCTGCTGTGCTGCTTAAAGTGTCACAGACAA 
GAATGGGTGGCCTGTGTGTGACTGATATGCATCCTACACAGACACCAGCTGTTCTGCTGAGAATTGGAGAATGAATATGCTTTTTATTATATAGTAC 
TTTATGCACACAAAGCAAACACATCATCTATGGGATGTCATAACAAGATGAAACCTTGTGCTTCCCCACACCCGGAAGAACCAGAACATCAGCATCT 
GTCCTTAGGCTTCTCCCTCCCCACCCCATCACCATCTCCT^ 

GTGGGGGGAATGTCTCTCACCCAGTCTGGAACTCACTAATTAGGCTAGACTATCCAGAAAGTCCCTAAAGATTCTCCTGTTGACCCTTCCCCAGTGC 
TGG AGT GTT ACCAC C AC A CTTGTTTGCATGG CTGCTGGAAATGGAACTCAGGTCTTAGCTTACAAAGCTTACTGAT GGAGC CAC CTC CTCAGGCC CT 
GCACAGGGATTCCATGACTGAGATAAAAAGCCGGCTGGATGTATGGGCC<2^ 
AGTTGGAGTTCCAGGACAGCCAGGGTTACACAGAGAAACACTGTCTCAAACAAACAAACAAACAA 

CCTGAGATAAGAATGCCCACAATACATTTTTCACTTTGCTTGTTAAAAAAAAAAAAGAGAGAGAGATAGTTTGTGTGTGGTCT 

TTCTTTGTCTTATTTTCGAGATGGGGGGGGGGGGGTCTTATGATATAGACTTAGCTGGTCTGGAATTTACTATGTAGACCAGGCTGGCCTCGAGCTC 

ACAGAGGTCTCTGCCTGCCAAGAGCTGGGATTAAAGGTGTGTGCCACCACATCTGTCTTTGTTCTTTCTATTTTTTGTTTCTAAGTTCCTCATTGTT 

CCTGAGTCTTTCTATATTTCCTTCATTTTCCCCACTGTAGAGTAGTCACTGTGTCTGCTTCCTTCTTTTTGTGTACACAGGAGTCTCTCTGTGCCCA 

TGGAGGAAAGCTCTCCCTCCATACACTTGCCTTGGAGTGGAGCAGT^^ 

TGCTCCTATCTCCTTCCTCTCTTTCCTCTCTTTCCTCTCTTCCCTCTCTTTGCCCATGATCTTCCAAAGCTTTTTCAAACTCAGAGCCCCCGTTGTC 

CCAGGTCTCTTACCAGCTTGGCTTCTGCCTCGCCCCCACACTGCACATCCTCCAGCTCTTCGGAGCATCATTTGGATGGAGCACTAGGTCTCAAAGT 

GTGTCTCATCACACTGGTCTTATGTGATCCTGCTGGGAAAGGCTT^ 

ACCAGGTACACCACAGAGGTGACCCTTGCTTTCATTTGTCTAGT 

TTTGGTTTTTATGGTCTTTATTTTCCCAACTGGGAGGCTGCGATTTTTTGCAATGTATATTTTTTTTAACATTCTGCATGAATTGAGGAAGTTTA^ 

CTAGTATCTCCATCGCTTGGGGGTCTCGAGATGATGAATGAAAAAGCATAAGTGGTGAATGAAGGCAGCAGTTAGAGAAGCAGTTGGCTCGTTGCTC 

TGTGGTCCAGCTGGGTCTCTTGTCCTGAGGAGTCAGAAGCTATGGACAACTTGAGGTTGTTTGGCCCGTGTCCAACCCTGCCTACAGTGTTTCTACC 

CAGCCTTCCTCCAACACCCACGTGGAATGAGTCACTCCCATTAAGCCTCCTGTCACTGGTTTGGGGGCATCTGTTTCATAGGGAGGCCTGAAACCCT 

TGCAAGGGCAGAGGCTCTGTTCCATTTGCCCCCTAGGTTCTCAAGAGTGGCTGCTTACCCTCTGGCTGGTGGATGAGTGCAGTCGGAGGAGGTCTCC 

TGTGTATCCTTGCCAGGGACTAGAGGGTACTATGACAGTCTGCAGGCTCTGGAGTCAGACCAGATGAGCCCTCATAGACCTGCCAGTCTCCACGCCT 

CAGCTTGGGAACTGGGCATCTTCTCTTCTTTCCACCCACAATTACTCTACTGACTCTGGGAACAGGGTCTCACTTTGTAGCTCTGGCTGGCCTGAAA 

CTCATTATGTAGACCAGGCTAGCCCCAACAATTATGACAACCTCCCGCCTCTGCCTCCCAAGTGTAGTCATGTTCCACTGCCCTGGCTCTTTGTTTG 

GTGGAGTTCTGTTTGTTTAAATTGAGACAAGGTCTCATGGAGCGTGGGCTAGCCTGGGACTCACTATACCGTGGAGGATAAACTCTGATCTTCCTAC 
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CTCT CT CTACAAGTT CTGGG ATT AAGGGAGC ACACTT AGT TAATGTGATGGT AG AGACAGAACT CAGGGT TTTGTGTGC ATGAGGCATG AATTGAGC 
TACAGCCCCAGACTGTTTGTCTGTTTTGAGACAGGGTCTCATATGTAGCTCTGGTTGCTTGCCAAGTAAATAAGACTTGCCTTGAACTTGTGGTGAC 
TCCCCTGCTGCAGCATCCCAGATACTGGGATGTGCAGGTGTGCACCACCATGTGTGACATTGCTGGGTTTTATTTTGATCCTTTGTTCATCCCTACA 
AAC T AG AAC CC ATAACAGTGC TTGG TTG TTTTGGGGTTGGGAATGG AGGGAG CG AAC CTAC AGGTAGGAACTTGGGAAGC CCAAGGGT AGCC AGGAC 
TTT C ATGGCTG ACC C AGG AAGCT CAGTATCC TGAGACCAAGATCTCTTT GAC AC AGCGTGGCT AGAGCGTC CTGTG CTGT AGCAGTGTCCT AG AATG 
GAGGGCTCACAGGGCGTGGCTTACTGCCGTGGTCCTCAGCCTCCTGCACAGGATGGTGTGCGCTGGGAGATGCTTGCACACAGAACATTTGAATCAC 
TTT TCC CTGGGCTT AGAGCGCCAGG AGGTCAGAT AGC AGGGC CTG AGG ATG CTAGAGGC CCA CTTACCATT CTC AGGCAT ACACCACTTGGCCTTGA 
AGTCGGGGAAGCTCCTCAGACCTCATTAACCTCCTATTGTAAGGAGCCTCA^ 

AGTTCTAGATCTGTCTTCCCTGGCCTCCCAACAGAAGAGATAGATCGTGGGGCACTGCAGACAGGCTGAGGTGAACGTCTGATGTGACACACGGATG 

C CTGGGG ACGTT CACTGGGGAGC AGAGAAAC CTGGGC ATTTATTTCAT C CC AGGTAACG AGC TGT AGTCCC CTCCACAAGGGAGAACAGAGCAGGGG 

TGGTTCAGAGGGGTGGAGGCACGTAGGTGAGATTGCCAATTGGGTGTCCAAGAGGCAAAGATGTCATTGAAATGACAGAGCCAGAGAATCTAGTAGA 

G GG AGACTG AGATCTTTG CGGCAGAGG AAGGAAGAC ATT CAAAG CAATT CAC AGACCACTGGTCTT C TGAGAGCTCGGGT AGTGTG AGAGGGGAAGG 

GGAGATGGAGGTGGTGGTAGGCTTTATCCTTTCTGCAGCGGTACAAGCTAGAACGCGTGGTAACAGGGTGGTATGAATTTCCCCAGAGGTGTGGAAG 

GCTCCTGATTGGTCTGATTCCTCAGCCTCCTCGTGAGACACTCC^TCAGGTTGGACCTGAAAGTACTCAGATCCCTGCCAACCGCCATCTTTCAGAC 

TCACAGACAGTCTCCACATTGCTGCCTGAGGGATCTGAACTCAGTAGCTGACTGTCACCCCTGTCTCTGTCCACCTTGACCCTGTGTGTTGGTGTCC 

TACTTACATCCCTATGTCCAAGGCTCCTTCCTTTTGAGAGCAGAGAAATGTACACAGAGCATCCCCTTTTACCTCCTTCCTTCTTGGTCCAGGTCCC 

ACCCACTGGGAATACCCTGGATATACCTCTTTTGCCCCCAGTGTGTGTAATACATCAAGGGTTACACTCGGTTGGAAGGAAAAAATGATTTTAAAGC 

AATAACAGCAACAACAATGCCAAGTGTGGTGGTATCTACCTGTCATCCCAGCATCTCGGAAACTAAGGCAGGGTAATTTCCATAAATTAGCATAGCC 

TGG TCTGCATAGTG AGT TCCAGG CC AGC CAG AGC T AC ACAGTGAG AACCTATCT T AAGG T AAT AT AATTTAAT ATAATAT AC C ACTCC CC TATGTGT 

TTTGCAACCCATCCTGCCATGTGGCCCCTTTCTGTGGGTTTTCTCAGCTCTGCTTGGAAGTTGCCTGTGTCCTTGCTATTGTCCTTTAGTCCCTGTG 

TGCATAGCCCTGTTATTAGATTATACAGTCCTCCGGGTGAGGGGCCATGACAGCGTTGAGTTAGTGTGGGACACATTTTGGGTGCAGAATAAGTGAA 

TTCTGCCAGAATGAGTCATTAGCGGAGAGTGGATGGAGGCCGTTAGACACAATGGCAGCAGCATAGGAGGCGAGATGAAGCCATTCAGACCCCTGTA 

CCTTCTGAGTCTGCCTTCTCGTTGGTGCAGTGCATTAGCCTGAACCCCTTGGAGCTTTTGAAAGAATAAATCTCAATGAGCAGGTTCTGGAAAACAG 

CTTTGCAGTCTCAAAGGTGGCCAGCTCCCCTGAGAAGCCCACATCCACACTGGTTCCGAGAACCCTCACCTTCTGGGCTTGCTATCTTGGAAGAGGC 

CCAGTCACCTAGAATTCTCTGTGGTTATTTGAACCCATGTGTCAACCCTTCTGATTCTGGGACAGAGCTCTTTTGTCTCTTGAGGATGCCTGGGACA 

TGG CCC TGC TT TGATAC AGAG AC AT AAA CAT C AAGGGT AGGA AAT CCAC AT CAACATTC AAGAGAGAAAAT AC C AGT AAGAAGGA AGG CT CCAGTGT 

GAGTCCAGCACTCAGCCTGAACAACAAGACAGGCTGGGTGTCTCAGCTCTCCACTGACACACTGTAGAGTCTCCCACCCCAAGAAGTCCCTTCTTGC 

TTTCTTAGCAGCCTCAGTATCTCCATCACCAGCTGTATGATCCCAAACATATTACCCGGCCTCTAGAACCTGTGTCATCACCTGTCAAGTGGAGATA 

ATAATGTCGTTGGGTTCCTTGGCAAATTAAAGGATGTATTTCATGTAACGTGCTTTATATGATACCGGGCACAGAGTCAGGGCTCAATAAATGCTGG 

CCTTCACAAATAATAACCATTGTGAGTGCCCTGGAGATGGTTTCACCCCCTCCACTCCCTTACACCAGAGAACAGGAAATCATTTCCTCACGGCCTT 

CTTAACAACTGTCTCCCCAAATAAGTGCACCCTATATTATCTTATTCACCCTGAGTGTGCACCTGTAAAGCGGCCCCTTTATTCCATTTATAGCTGT 

TTCCACAAAGAGGCCCCCATTACCTCATCTCCCTCTGTTTTGTGGGCTCCTCCTGTCTGTGGCCACTCAGGGCCAGTGTGGGTTCTGAGGGAGCCTT 

GGCCACAGGTCTGTGTTGGCATCTGCCCTCACGCGCACACACACACCTTTGTAGAAAAGTCTATAATTTCTTCAAGCTTTGTGGCACATGCCTGTAT 

ATCTGTCAATTCAAAAGACTGAGACAAAAGGATAGAAAGTTTAAAATGAACTATATGTCTTTAATCCCATCACTGAAGAGGCAGAGGCAGGTGGATT 

TCTGTGAATTCAAGGCCAGCCTGGTCTATATTGTAAGTTTTAGATCAGCCCCGACTACATACATTCTCACCAAAGGGGGTAAGAAAATGAAAGCTCA 

GAG TGGGTCGG AGAG CT TAGTAGGACTATCTATCT CT ATCTCT ATATC C AT ATCTAT AT AATTGGAAAGGGCTGAGGAGG AAT AGC CTTG C AGTTCA 

GTACTTGCCTAGCAGTGTGAAGTCTTGGGTTTAATCTCCAGTACTGAAAGAAAAAAAAAATCTATTGGAATTTCTTTATTTAAATGTGTGTGTGTGT 

GTGTGT GTGTGT GTGTGTGTGTGTG TGTGTG TGTGTATGAGAG AG AGAG AGAGAG AG AG AGAGAGAGAGAGAG AGAGAGAGAGAGACTTTTGT ATAC 

AAAGCTTGTTCTTCTCCAGAAGAAATACAACAAGACATCAGACTATAGAAGCAGAAATGCCTGAGGTATCAGCTGGGCCCTGCAGCCTCCTGGGGTG 

ACAAGATTGGGCACAGTGAGAGGACATGGGTGAACAGGGAGGGCTCAGCCTGCAGAATGTATCCTTTCCCCGCTGGGTTATGCACAGGAACTCAGAA 

AGATGTGGGTCCCTGTAAGAGGCACATCTGGAGACCTTGAACCCAAAACTTCTGTGGACAGCCAGCATGTCTGGGAGCTCCTGTACTCACCAACATG 

TAAACCCCACAGCAAGACTCAGCAATCTGGGTGTAGAAATTAGGAGAAGGGAAAACTCTCGGAGGAAGAGAGATGCAAGGAGGAATGTGGGCAAGGG 

GGTGGGGTCCCCCAACAGTGCCAAAGACAAGGTTTCTTTTGAAAGTTTTTTCATGGTGACCTTCCTTTTCGAATGGATGTTAGTTAGCGAGCGGTTA 

GC^GTGAGGCTCTTAAACTCCAAGCTTGGAGCCGCCTGTGATTTATACCTGGCCTATGTAGATGAGGGACATGCCCAAACACTTTTCGCTCATACTG 

CTGATCAAAAACAGTTTGGCTGCTGCGGCTGTCAAGATGGAGGCTTGAGAAACCTGGAGAGCTAGAATGAGGAAAACACCAGGCTTGGGAAGAAAGG 

AGGCTGATTCTGGGTACCAGGATGTACGGCGTAGGGCGGAGCAGATAGGGCCCAGCGCTGGAGCCCCATGAGCGGTCCTAGTCTCCTTTGGCTCTTG 

GCTTGCTCCTTGAAATGAAGCGAAGACCCACTGAGCACCCAGACTTCTCCATCCTATGGTGCAGACATCAGACTGTGAATTCTGTGTGTAGCCAGCA 

TTGAGGCTGAGCTGCCCTTCTCTGAGTGCTAGAATACCTAGCGGGAGTCCCGCTAGCAGCTAGGAGTGTGGAGTGGGCACTCACAAACCCAGAGAAG 

AAAGGCTAGGCTTGGTGGAGAAACTGGAAGCAGGAGCCCAAGACTGGTGGATGTGACCGTGTGGGTGCGGGGCAGGGGGTGCGCAGGAAGGTGTTGA 

GGCTCGCTCCCGGGAATGGGCCCGGGAGGGAGGAGCAGGCAGCGCTTTGAACTTGCGCTCAGCCTCCAGAAGGAGGGCGGGGGAAGTCCAGGTGCCG 

GCCGGGCTGCTTTCACCTGCGGATCCTTGGCGCTTTGGGGGACGAGGCGCAAGGAAGACCTCCTGGGAATGCAGTGGAAATACCCACCTTGAAAGGC 

AAGGAGCGGGAACATAGCCCTGAGCGCCGGGTGTGGCGCTGCTCCCGCCCCCATCCCTACCCGCCTCACGCTGCAGCGCGCACCTGCTGCTGGTGCT 

GGTACTGGTGCCAGTGCCCATTCCAGGGCCTCCCCGACCTGTTGGTGCTCCCAGCATCTGAGCCATCAAGGTTCCTGCCCACAGGTTTCATCTAGAA 

GTCCCCACAAATGAGTTGGGAGTTTTCTATTGTTCTTCAAACTTTTATTATGGCAATCATTAATGAAGGCTGTGGATCTTGCTTGGAGATTTAGGGC 

TGGCAGAGCATGCTACAAGGGCCTGATGTTCGTTCATCATCACAAAGCAGCCGGGGAAATGAACTAGGAAAGTCGAAAGGGGGGCAGGAGATGTCGC 

TCAGTTGGTGGAGTGCTTACCTCAGCATACCTGAGGCCATGGTTTTGATAAACCCCATCCTACTTGGATGGGGAGTCAGAAAGGTTATAGCTAGTCC 

T GGGTC AGTCAG CT T ATTCTAC ATAGCAAGTTC CAGGCC AGC C AGAGCTACAT AACG AAAAGT AG AGGAAAAT AAT ATAAGG AG C AAGAGTTTACAC 

ACCTTATAGTGCCACCATTTAAAAATGTTTTAAACGTGCACGTTTGTGTCAAACAATTACAAAGAAAATTCTATTCCGCATGAGCGCTGAGATTCCG 

ATATGTAAGGCTGCCCCCACCCCATCAAACCGCCATGCTGGTTAACAAGACGTCTAAGAATTCCCGATGATTCAGGTTGCCTAGGTGTCCTCCTCTC 

ACACTGGCTCTTGAGAGAGCTATGGTTTGCAGTCTTTGAT<2ATGTGACCTGTT^ 

TCTGTCTCTCGTCTTCCTGACAACTGAGGGTTACTTAGGCGAACCGTCTTTGGCTGGTGAGCTTTATGGAGGAGGTTGGGTACCCCACATTGCGCCA 
TCTCCAGCCATCCAGCTCTCCTGCTTCTTTGGAGGTGTTTAATAGGGTTTCTGAGTCTCTTTCCCAGATGGGCTGGTCCAGCGGGTTAGGACTGGAG 
TCCATGTGTCTGGGTTTTGCAAATCTGCCCCTTTGCTGGCTGTTTAAGTGTCTCACCTGCCTTCCTGGCCTCAGCAGTCAGCTGGGCAGTGCTCATC 
CTGGGCCCTCCAAAGGGCATGGGGAGGGAGGTGTGGGAAAAGGTGAGTTCTCATGGCTCTGGCTCAGCAGATCTGAATGTCTGGATCTGGGAGTTGG 
TAGTTCTTATAGATTATAACCACGCCTATAAACATCTGTATGCTCTC^C^TCTAGCACACAGCTGGCCCAGTGTAATTGTCCCTGAAATGAAGATC 
TCCACCAGGCTGGTGGCCCTCAATGAGCCCCTAGAAAGATGGTTTCTGGTGTGAGCTTCCAGCCAATAAATAGCTCAGCTTTCTGCATTTGAAGACT 
C AG AGAAAG ACTTC CCC AGGGTGTG CAG CAGTTGTGAGAAAAGTCCAGAGAGCC AGC AGGGTGTCTTCTGCTCT CAGCTT GT C ATGGC AG TAG C CTG 
AGTGGGGAGGAAAGACTGGGGAGGTCAGTGGGAGCCTCTGGGTCCAGTCCAAGGGGGTCCTGGGCCCATTAGAAAAGCAGACTCTCCACTTAAGTAG 
TCA.CCGGTGAGGGAGGGAAGTTGTCCTCTGAAAAACCTCAACTTTTAGTGGAGAGAACGGCTCTGTAGACAGCTCAGTAGAGAATGTTCAAGATCTA 
TCTTCTTCTGACAAGCTGAAGAGATGACCAGGAGGAACAGTTTCAATCCAGAGGGGATTTTTCTAACCTCAGATCTTGAGGGGTGACTAGGAGCTCA 



112 



CCGTGAGATGAATGGCCAGGGAAGCTGTTTTGTCTTTCACCAGAACTTGGCTTGAAGGGAAAGCTCTATGAGAACTGGGAATGGTTCTGACCTAGGA 
ACCAGCTGTCTGCTTTCCCAGAACCATTAGCCAGGCCCTGCCAAGCCCTCACTTCACTGCACGTGGGAGACTGCCCTGCCGGAGAGGGTCAGCAGGG 
AGGAAGTGGTACGCCGCTCAAAGGAACCAATGAAGCACACTGGGCTTAAGGAACGTTTATAGTTCTGATTGGCAGGTCAGTAGGCAAAGAAATAAAT 
GGCAGAGCCCGGAGAGAAAAGGATGTGACCTAATCAAGCTGAGCTTGAATTTCAAGCCGAAAGCACAGGCCAGCCAGCTCTCAGCCTCTCTCCCCTA 
GGCACTCCTGGCTTTTAGAATTCCTACCCCAAACGGGACTCCTCACACTTGGTCAACTACAGATTGCCTCTTCTGCGTAGAATGGGAGGCCCCATCC 
AATGTATACAGAGCAGGAGCTGTGCTGGTAACAGAAGGCGTTCCAGTGTGCTGACGGAGGCTCGGTGAGGCACAGAGGAGTTCTGTGAGGTACGAGG 
AGATGG GGCTCTGGCTCG AGGTT CAAAGGCCTGGAC CAT C CT CCTTT C AC C TTG AACCAGTG AAGGACATTGGATGAGTTACAT CACTTAAGGGGTG 
TGTGTGTGTGTGTGTGTGTGTGTGTTGAGAAAAAGAAGTTAATTGTCCCACCAACCTTTGGCTT^ 

CCTCTCAGGCAGCTGGCTATGGGACCATTTACAGGAGGCTTCGAGTGGCTTTCCAGGGCCTCTTACATTCTGTCTCTCCTCCAGCAAGACATAGGCC 
AGCCAGGGACACTTTTCAGGCAGAAACTTCTGCTCAGCAAGCCTTTATTATTGATCAACCTTCACGCAGAAACCCCCGTTCTAACTAACCATCGCTT 

C AGGGAGGG TGG AC ATG CCCCCACTAGAAAAG ACTAAATG CAGTCTAGCAC AGT C ATTTGCAACTGCAAGC AAC CC AAG C AGGCCTTTGATCCTTGA 
TCATCTTCCCTGTGAAACAGAGCCTGCCTAACTGTGGCCTCTGGGGCCACAAAGGCMGGATCAGACCTGCT<^GTGGGGGTGGGGAAGGGAAGAAC 
ATGGGGACTCACTTGGGAACTGCTTGGTTAGAGGAAATGACTCAGTCCTTGCCCTAACCTCCTGGAA 

AGAGAGACCTTCTTGGGGTCTGAGGTGAGAGGCCCCTGACCTGGACCTGCTATCTGGATGTTCTGGGCTGAGTTTGCTTCTGGTAAGTCTTTCTAGA 

GAAGGTAGACTAGGCCAGGCTGCAGAAGCCCCTGTCTCCTTCTTTCTTGCCTTCCACTCTTTCCCCTTCCAATTCTCTGGATCTCTCCCTCTCTTCG 

TACAGTTAACTAATTCCCTTACATCCTCTTTAGGACCCATCAGCCTGCTTCATGCTCCAACCCCTAATTTAGCCAGGAGGGACAGTCCTTCCTCCGT 

TCTCCACTGTGATCAACTCCCCCTTCCTATGAAATGGGGTGGGGGTGGTGGCTCATG 

TTTCAACATTTTTCAGGCGGACTTGTCACACATCTTATCAGATGAGAGCCT^ 

GTCACGCTGGGCTCCCTTTGGTGGAAAGGGTTCACCCAGCTTCTTAGATGGAGGCAGGATTGATACCACTTCCCTCTGATAGGGTCACCCCTAAGAG 
TGACCCCCGACTGCATGTTTCTTTCCATATCATCTTCTTCGTCCCATAGATGCCCAGAATCCCGTCTCTTTGGGGTCCCCAGCAAGTGCACTGCTTC 
CTATAAAGGTTCCACTGGGCCCTGAAAGGAGCTTCTAGAAGTATCGGGTGAAGATGAGTCTCTGGGGTTACTTCCAGAGTGTAGACAGTCTTTTGGC 
CAGCAGAGCCAGTAAAGTTCTGTTGTTTGAGCTCCACCGGGACCTCAGGAAGAGAGGAAGAAGGTCACCCTGCCCTGCTCAATGGCTGCTTCCTTCC 
AGAAGTAATGGCTTTGACATTGGATTTCCCTCTTCACGCCTGAGAGTTTTGCCCCATGATGTGGGTTGCCAGCCTGGTATCACCCTTCCTTCATGTT 
CTCCAAAAGACTCTGCTTCCCTGCTCCTGCCATGATAATGATAGCCTTGGCGACGTGGCGAAGACAGCCAAAGCCCAGAGGCTCAGAGTTCAGTCTG 
AGAAGCACACCCGGAGGCTGGTCCCCAGGCCACAGAGAGCGGCTGTAGCCACTCTCACTGGGTGGGAGGCCACCCTGGGGGCAAACTCCACTCTGTT 
T GT C AG AAAGG AGT C TC C ACGGG AGG ACCTGGCTGTG ATGGC AAGAGCT CT C AC CTAGGG AGGAAACAG ACAAGGAACAGAGC AG AGG GAG AGACAT 
TGTGATGGCTGACCGCACGTGCAGGCTGTTTGCCTCTGTAAGTGCTGATGAAACGGGAGCCAGGAACACAGGCTTCAGCCCTAGCCGTGTGAGGTTT 
GGGCACACCATGCTGTTCCGGCGTCACTCCTCCCTGCCTGGTATATGATATCAACAGTCCCAACCTGAGATTTCCAGCTAGCTCAGCTTTCCCGGGG 
ACCTTTTGGAATCTTCAGGTCTAGAGTGCCTTTCCACTGTACCTCCTGAGCAGCTATCCTCAAGAGGAAATGCCTGCCTTCTGGGTTCCTCTTTGGG 
CTCTGCACTTGGCCATGGTCTTCAGCTGTCACTGTCCCCAAGGGTTCCTTTTCCCATTGTTTCTAGCCTCTTACAGATGCCATCTCCATGGTCACCC 
TCGTTCAGACCTTCTACATCCCTTCTCTTTGTCCTTATCCACAGCCTTCTCTCTGTTCCCTGAAGCCTTCATCTTGCAATGGCCAGTGGGCCCATCC 
ACCGCCATAGCCTGTTAGCCTCTCTGGCTCACCTGAGGCCAGCGACACCAAGGCCTCTGGTGGTATCGATGTGTTGCCAGGTCCTCGAATCTCCTTA 
GTGCCAACTGTCACACCAGCCTTCCATCTGTTTCCTAGCTTTTGATCTCTAGTGCTGCCCTTGTCCCATTGTAGGTGGGAGATAGATCAACAAACCA 
AAGTCCAGCTCTGGATGCTTCGAGCTCTGCCAGCTTGCCCAGAGTGGTATAAAACCTGGCAGGTGGGGCCCCTTGCCAGCCCCCCTCCTCCTGAATA 
GTCCAAGACGTCAGGTGAGCGATAAATGGTAACATGGGTACACCAGCTGTTGTCCCCAGCCTCCTAGTTCGAGGCTGCTCTCTGTTCCAGGTGCTTA 
AGTGAGCCCAGCTTTAATCACCAGGTGTGTAGCTGCTCCTGATCACAGCAGTGGGGACTTACTGGCCCCACCCTGGGAGACTCAGTGTTCTTCCCTA 
TGCTATGGTAACGATGACTGAGGACATAAGTGACAAGTAGCCAAAGATCATCTCCTGCTCACGTGTGGCATCCACAGTACCCTGATCTGGTAGGTGC 
TTGAAGAGGGCTCGTTGGCCTGCCGTTGACAGATGTCTCCTGGCCTGGGTGCTCCTCCGTGGGGCTTCATTCATCTCTGTGTTCCTAGCACCTGGAG 
CTCTGCCAGGTTTGGAGTCATCTCTCAGTAAGCGATGTTGAGTGAGCAGTCGCTGAGTGACTGTGTTTACCAAGGAAGGACTTGAGCCCAGCTGCCG 
AGACCAAACCCTCCAGTTGCTTATCCTCGCAGGCCCAGACTCAGGCAAACTGGAAACTCAGCCCTGCCCTGGGCTCCACCCTTTTCCCTTCAGCCCC 
TGTAATGTGAGCTAAAGTTATCTCTGTTCCCGATGGATCCAGAAAGGCAGATAGATATAAGGGATGGAGAGAGGTCCGAGGACTTCCCATGACTGTG 
AAGGGCAGCATCTAGGTTCCCTTTGTGAACTGACAGCCATGGTGTCACTCGGAAACAGCCCCTTCTTGGTCATACGGTGCTGGGAAAGGTGTAGTTA 
TGTGCAGAGGGCCCTGTGTGTTCCAGATCAGCATGGATTCTTAGGAAAGGTGGCGGCAGATGCATCATGGGATTTCTCCCTCCTTGCTCTTTATTGA 
AGCTTCCTGCCATGGTGCCAGGGCTCACACAGGGCCCGTAGAGACTCTAGAAGCTGAGAGAACCCTGGCCCACCAGACTCAGCTCCTTCCCAGTCCT 
GAGCTTTGTCCTCTCCCTTGTAATAATCAGCAGCTCTAATTTCCCTAAGTGGAACGGCGATACTTGGGTTCTTTTTGTCCATTTCTCATGATTTTCT 
TCTGGCTTCTAGAAGTTTCCATATTGCCCAGGAGGTAGAGTTCTACTGGACCAAAAGTGGGTTGGGCTTCTGTCTAGCATTGCCTAGAAGTCAGCAG 
ACAGAATACCTGGTTCTGTGCCCTTATGGGGTTTTGGGAAACCCTCTACCTTCGTCTCCCTCCCAATGCCTGGGCATTTTATTTCATGAGAGCCTAG 
GCTTTTGGATCAGAATCTTGTGGTTATCTCTGAGATCAGGTGGGTGGCTACTTATCACTGTAAGAAAGAGAAGGGCAAAGAATCCGGTGCTCCCAAG 
AAGAGTAAGACTCATCCAAACTCATCCCAAGACTAATAGATGAGCCAGAGACCTTCTCCCTGTGCTGTGGCCTGGAAATACACTCTGTCCACTCTGC 
CAAGTCCCAGAGAAGCTTCTGAAAATGTGTCTTTATCACACCCTGGGATTCACAGGAAGGGACAGAGAAAGAGCTTGTTGGCATCGAGGCAGGGGTC 
GTGGCCATGGCTGCTGACCTTTCCAGGCAACTGAGCTTCCCCGTGACCCTAGCTCAGACCAGTGCCTTTCTGAAAGTTCTCTGAAATTAAGGGATTC 
AGAGTTGAAGAGATGGCTGAGCGGTTGGGGACTGAOTGCTTTTAAAGAGGACCCA 

CTCCAAGACCTGACACTCTCACGTAGACATAGATGCAGGCAAAACACCAATGCACATAAAATAAAGGTAAATAAATAAAAACTTTTATAGAAAAAAA 
ATTGAGGGATTCAGGTAGACTCGGTCCTGGTCATTTCAGCTATTACTAATTGTATAAATCAGTGCATTAGGTAAGTTCCAATGTCAGGCAACTACCA 
CTATTATTT CTAGAACATTCTCTTCATC CTAAAC AGT CAT TACTGTGG AGTATG AGC CT ACGC CG AGACTGCTGGATCCTGTGGT ATATAACAAATT 
CTCTTTTTAAATGTTGTATATAGAGGAAAAGGAGGGAGTTTGTGCTTGCATCCGTTTCCAAACAAAAATAGTTCACCATTAAGGCAGCCTGAACTTG 
GTGACACCCCCACCAGCATTCACCATCTCCCTAAGCTCTAAGAGCCTGTTGGGAAGCTCAAAACTTAAAAGCAAACAAAACAAGCTAGGCTGGGATA 
C AG AGAAGG CTC CCTGGT AGGATGAGCC TGGTGTAAG C AG AC AGAGCC AGCCTG ATC TCAGC AGGGACT CAGT CATGCAGAGGGATGGCAG AGTCTG 
TGGGTCTCTAAGGCTCCAGGACACCTCTTTGTAAGTCCCTGCAGCAGCTAAGCTGCCCTGAGTTCTCTGACTAGAGATGCTGGGGCTAGAGATGCTG 
GTAG ACT GC CTTCT C AG C AGGCACAAAG GCC TGAGTTTGTTC CCC AGTACTGATTTAAAT AC C ATAAAATCGTG AGAAATGGACAAAAAG AAC CCAA 
GTATTGCCTTTCCACTTAGGGAAGTTGGTACTAAACTGGATGTGGTGGCCCGTGCCAACAA 

TCAAGATCATCCTTGGCTACGTAGGGAGTTCAAAGCCAGCCTGGGATACATGAAATCCTGTTTCTAAAACAAAAACAAAAC 

TCATGAAAAGTCTTATATATGTATATATACTTCTACATTTGAATGAAGTACTAACAAGCCACGTATGCATTCTTTCGGAACTGCTGCAGATCTTTTC 
TCC ATTTTCCTT CT AATTAATTACATCTGTAAGT AAAAAAAT ATG AAT CAACCCATTGG AAT GGT TAT GTCGAGGT TGG ATG AGC ACATCTGC AGT C 
CTGGTACTCAGGTGTCTGAGACAGGAGGATTGCAAAGCCAAGGCCCGCATG 

TAGTAGCTCAGCAGTAGAGCCCTTGCCTTAGCATGGGTAAGGTCCCAGGCTCCCTCTGCAGCCAGGAAAACTGTCTGATCTTGTCACCATGCACACC 
CACCCACAGCCCTTTGCAGAACAGGTGACAAAGAACTGGGTGTCAGTGGGAAGCTGACAGGGCCATGGGGATGTTTACAGTCTGGCCTAATTGGACA 
GTGGCATTGGTCGCTATCATCAGCCTAAGCCTTCCTGACATCTATGGCTCATCCACCTGGCTGCTTGCGTCTGCCTGGGCCTGGATGGTAGGTGGTT 
GGGATGAAGACGAGCTGTCCCTGGGGCTGAAGACAACAGAACCTTGGGTAGAGCAGGGCAAGTCGGGGACAGAAGCTGGGTATAAATAGTCCCTGCC 
CAGCTGGGCAGGCCTGGCAAGCCCTCCGTCCTCCCCCTCACCAGCTGCCAGTGCCAGCCATGCCCCGACCTCCTTTGGGAGCCTGGTCACCATCTCA 
GATCCTCACTGGAGGAAGTCCCTCAGCGGAACTGAGGAGTCTTGTCCTGTGTGTGGGCGAGACCTCCGGCTGCCCGAGTGGGGGCATCAGGCAGGCT 
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C TT CGGGGTGT CCTC TATTTT CTTAACTGGAAGG C CAAGTGTGCTGGTGTATGT GTGTG AAAAATCAATAG CT ATT CCT CTT TGT GTGTGTGTGTGT 
CATGCTGCAAACAAATGAGCCAGCCTTCTGTGGTGAGAAGGGGAGAAGAGGGTACGGGGACACTAGCAAAGGTTACTTCTAGCTTGGGAGCTCACAG 
AAGTCT TTAGAAGTG ATT CAGGG AAAGT TTG ACTTGGGTATGGGCTGC AGAGAGGCACT CAAGGAGAGGGCTC AGC ATG CGTG AAGGGTGTGAAGCT 
GTTTCAGCTGG CCT C AAGGCAAATC TAATCAAAG CCTCTC ACAGTGGGGAG ATGGAG AAGATC AGGAGCCT CT CCT GTT GAC TCTCCC AGGGGCAC A 
AGCTCTTCTGTAACGGGGACAAGAAGTGGGTTACACTGATATGTCTGCAGCAATGGAATGTCAGCCATGGTGTGTGGGCACGGCTCAGGGCCGGCAG 
GGCATGTCAGGGGGGACAGCTGTCCTGCAGAAGTCAATCCCAAGGCCATAGCCTCAAGGCTGTCCATAGATCTGAGCCCTGTCCCCAGAGGCCCTAG 
CTAATTACCACAGGATATCCAGATAGAGCTTGTTGAGACCTCTGACCGGAGCATCCTGGAAACCCTCCCATGCCCCTTGGTCAGTTTGAACATGCAT 
CCAAGTCACACATGGCTTTGCTACGCTCAGCCTGTC^GTCACAGAGGGATTTGCTTGGCCAGCCCTGTCAGGACTGCTGGTGAAGAGACTCTGCCGT 
GTCTTGGAGCCTTCCTCCTCGTGCTTGATGGGAAGTGGCTTTAGAAATTCTGATGTGCTGTATCAAGTCAGGGAACTGAGGCTCCGGGAAATGAGGT 
TGGCGACCCGGCAGGGAC CAAGT C CTGAG AGCATGGAACC TGAGGTGGCTG CCATTGCTTG AGGGATC CTG ATGTGCATGGCACTGAGCTGT AC ATG 
TTTACTCTTCCCGCAAGCTCTGTCTCACAGTGGAAGAAAGTGTGGTGTGGAGAAGTTAAGGAAATCGCTTGGATTGGCAGCTTGAGGCAGGCAGAGC 
CCGATGTGAGCTGGGGTGAAGAGACAGGCTTGGAAGGAGGAGGGAACCTAACTCTGGGTGACAAGTGTCTACACTGAAGCCCTTGCTGGGGTTTCCT 
TTCT AGTGT CAGTAGGGC TT AGGGATGAGTGGCAGGC CAGGG C TAAGTGCTGGC TTAGTGTGTGAGAATGAGCTTG ATC AAC AACACTGC AGAGAAA 
GGGCCTGGTTGGGGGCAGGGGTCTGTAGGGAAGGAAGAGAAAAGACAAAAGACIATGCCAAGAGGGAGGCTCAA 

TCCAGGCAGCTTACTTAGCTCCCTTGCCTCTGTTTCCCCGTCTGTGAAATGGGATGATATTGATGATGACACTCTCTTTACCTCATAGACTGGTTGC 
GAGGATTAAATCAGTTAAAGACGTAAACATTTCAAACTGTGCCCAGCTCAGGTCAGATATACATCCATGTTAGGTGTTTTTGCGGGGCTCCCCTATG 
CCTTTATGTGCTGTGTGGCTTCAGGCGCTAGGCTCACCATTGTTTCTCACTCAAACGTCAGCCCACCAGCGGGTAGGCAGGCCAGGCCTGAAGCCAC 
CCAGCTCCCTAGGGACCCAGACAAGCATTGCCCACGAAGCAGAGGGAAGGCTTGCCTGGGAGCCAGGTAGTCTAGTTTCTAACAGCAGATAGAAACT 
CTATAATCTGGGCAGCTAGTAAAGCCCCCACTGTGTGCTGGGTTGTTATCAAGCAACTCAAGGAAGTTAGTGAAGAGTTGGTAGTGAGCCGTGATGG 
TGTGACACTAATGCTTCAGTGGTCACCTCATTCTTTTTCTCTCACTCTTGGCCAGTCCCAACAGGAGACAGTCTCTTGTAGCCCAGGCTAGCTTGCA 
CCTCCTGACCCTCTTGCTTCTGGCTCCCAAGTGTTCAGGGATTACTACCCACCTTCCTAGCTTTCCCTTGTCAAAAGGGCCCAGGGCTCTGGTTGGG 
GAGAGCTCAGTGGGAGATCAGTTGTCTGGCAAGCTTGAGGCTTTGAGAGGTGGGGTTGGGGACTCTAAGCTCAGGACAAGGTTCTTCAAGGCCACCA 
ATTCAAACCCCCAGTTCTGCCTACTCGCACTCTCTCTTCCCAGTGGGGAAACCTGGACTTCATCCCCTGACAGCTGAGGCCAGAGGCTGCTGGAAGT 
GGTGCTGCTGTGTGCCATGTTTTGCTGGAGCCTCTGAGCAGAGCMCCGAGGCTCTGGGTGTTGAGTGTGGGGGAAGGGTGTTGGGAGGCAGCTCAC 
AGCATACTCTGATTAGACTCTGATGAGCCATATACCTGTTCCCAAGAAGCCTAGGGCAAGCTAGTTTGTTCTGGAAACCCAGATTCCTCATCAGGGG 
ACAGAGATGATCAAAAGCTGATCAAGTCTGTTACATCCAAAGTTCTAAGCAACTACAGAGTCCCTCAATGGCCGCGTTCGGTTCACCCATGCCAACG 
AAGCTTTTGTGTCTCTTCTGACCTACCCCAGAGTTTGCCCCTGGCCCTCTTTCCAAGGCTCCTGCATCCAGGACTTGACCTTAGCTAGTGCCTAGGT 
CTCCTTCTTCCAGGACACTCTGCCCTTTATTATGAGCCTGGAAGAGGGAAGGAAAAGGTAGAGTTTCTGGTAGTGCCTTGCTTTTTCTACAGTAGTT 
AACCCTTTCCCCCCCGAGTGTACTTACCGGCGCTGTAAAATCCCAAGGGTTGAAATGTAAGAGGTACAATAATGGGGTGGCTTGACTGAAGTGAGAA 
GGTTGGGGGGATGCCTGCTGAGAGTTTCCTTTTGCTCTTGAGTTAAACTAGTCATCAAACGGAGCCTCTGTCGCTAGTGAGAGGGGAGGTGGCACTC 
TAGAAACTTCTAGGGAACTATAAAGATGGTGGCGGAGGCCCTTGTAGACAGAGCACTTGATGCCATCAGAGACAAACAGGCTCTTGGTGTCCTCTGT 
ACTCTTGGGGAAGAGGCTAAGAGGGCTTTGTGGGGAGAGAGAGAGAGAGGGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 
ACACTGTGCTAAGACTTATGCTCCCCCCCCCCCAAGATGGAGCTGTGATCACAATGGCCTCTGCTTCCTTTGTCCTTTATATTTGGATTCCTTCCCA 
TGAATGGCTGTTAGTCTCCACATGACCCACGAATCCTGGAGTCCCTGGGAGATTGCTCTGCAGGAGCCCTGCCATCTCTACCAGGGCACCGGGGCCC 
AGGCACTGGGCAGAACTTGATTCCTGCTCCAATTCTTCATTTCCAACCCAGGGTTATTTTTGTGGCACAGTACAGACCTACACAGTGCCCCCTCCCC 
CAGAGGCTCTGCTGCTCCACATGCCCATGTCAGAGGGACACTGCTTGAACCGGTGCCGTGCCCTTTGTGGGACACCGTGTCCCCAAGGAAAGTCAGA 
AAGCAGCACACAAGGTGCCAGGCTTCTCAGAGCCACGGATCATCCCTCCCCCACCCCAGCCCATCCTCTGGGAAGAAGTGACATCCATACTAGGCCA 
CGTGTTCTGTGTCTACAGTGCCATCTCCATTGTCTGCCTGCAGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGCCTCCCAGCCTCTGCTCTG 
CGGCTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATGCCCAGCGTAGCAGAAGGTGTGAAGCTGGGCATC 
CAGGAGTGCCAGCATCAGTTCCGGGGCCGCCGGTGGAACTGTACCACCATAGATGACAGCCTGGCCATCTTTGGGCCTGTCTTGGACAAAGGTACTT 
GCCTGAGACTCGGTGGGAGGTGGCTGGGAGACTGGGGGTGGTGGGAAGGGTTCTGGGGACCCTGTCCCCTTCCTCCCACTGTGGGAAACCCTGAGTC 
CAAGTTCTCCCTAGGAGGGTAATAAAGCTGGTGGAAAGTGGGTGGGGTCCATCTCTGGACAGGACAGGCTGTGTCCACATCGGCTACATCTTTTCTA 
GTGCTACAGGGGGGTGGGTAGCCCTGAGGTGCCCCAGTGTCCTGAGATTGGAGGGCAGGCACCAAGGAGCCCGGAGCCTTCCTTCCCGGATGACACA 
GTGGGGGCCCAGTCTTTGATGTAGACATGTGTGACATTTCCCTTATGCTGTGTTAAGTGCTATGCAAATACAGCTTGGGGGTGGGGGTGGGATGGTG 
CCAGGCCTAGGTGTTGTGCCATGGAAAAAGTCTCTATGCTCTGTCAAAGCTGTCTGGTGCTGGTAGCACTACATGACCCCCGAGGGTCCCCTGAGAG 
TTCCTGTTGGTTCCCATAGCCTGAAAGCAAGACGCGTCCATTAACCCTGATGTCCTGTGGATCACGCTCGGTGATCTGGAAGTCTGACCAGCTGTGA 
CCCTAGTGGATGAAAGCCGGCTTTTGACTTTGCACATGGGGCATAGGAGAGGGTTGAATGGGGTAACTGCTCCTAAGAAGGGGACTTTGGCTCTAGA 
ACCTCTCCTGTCCAGGAATTTAACGGCCTTAGCTGGGCTGGCCTCGACTGGAGTTTTGCCAGCCGATGGGGGCGAGCCTGCTCTGAGAAGGTTCGTA 
AAAGCTGCATTCACAGCCAAAAGGGAGGAGTCCGAGAGCTAGCGGGAGGGTGGGCGCAGAGTTTGGGAGAGGGGAGCTATCCCCCCTCCTTTGGAGA 
AAGCCCTCTCCCCCTTTACCTGTGCTTCAAGTGCAAAGTGTCTGTACAACCTACAGGCAGAAAACGCAGGCTATTGTCGTGGGACAGGAGGGCAATT 
AGCAAATCAATGGATGATGATTCTGTTTACGCTTCTAATACCAAAGCTAATTATTCCTTGTTTACATAAAAGGTTTGGCCTCTAATTGGGGCCAAGT 
GGCGGCGGCGGCTGGCTTTTCAAAACTGATCTCGCCCGCCTGTTGGTCTCACCGCCTTCCGGCGCCCCACCTCAGTCCAGGTAGCGTTTAATCAAAC 
TGCCAGGCGTCCCGACGCCTCCCCCAAAGCAACACTGGCCTTCTGATCACTCCTCCCACCCAGGATAGTGTCCATCGAGACCTTTCAGCATGATACC 
CGCTCGCAGTGGGTCTGATGTTTAGATTCGCAGCAGTGGGTTGTGCCACATGCCTCCGTCCCACAGCAAGAGGGCAGTTTGTGTCCCAGTGTGCAAG 
GAAATATGGGGCTGAGACAGTGGGATTTGATGAAGGCAGTGGGGTGAGGTTCCAGGAAGATGTACCCGCGTCCCGGTACCCTGTGGGCAGGCGGTGG 
GTCCTGGCCACTTTTGCGCACGTCCTAGCAGGTGCGGGGTGGGCGGCCGGAAGGCGCAAGGCCGCGGTCCTCCGCGGCTCCGGAGGGTTCGAGCGCC 
GCCCCCCGCCGCGCGCCCGCTGCGGTCGCCGCGCAATTCACACGCCCTCTGGCCGCCGCCTCCCGGGGATCGCGCTGCCCGGCGCCGGGGCTGGGGG 
CGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCAGGCTAATCCCCGCCCCGCCCCCGCCACCCCCACTGCCCCGGCGCTGGGA^ 

TGGCGGGGCCGGGTCGCCGGAAGCCAGGGGCGCCCTCCCCTGGGGCGAGAGGGCCGGGAGGCCTGGATCACCGCCGTGCGCAGTCCCCCCCGGCTTG 

GCTGGACTGCTGAGCCAGCTCTCGCTCCGCGCTCCCCGCCGGGCTGCGGGAAAAGTGGGACGCGCCCCTGCTGTGCAGGTGGCGCACGAGGGCTTCG 

TGTCTGCAAAGGCTGCTGTGGCTACTGAGTATCTGGTTCCCTGCATATTCAGCACGGGCCACGACCTAGGAAAGGAGATCCGGCTCTGACCACCTGT 

GCAGTATAGCACGACCCGCTCTGGGAACCTATAAAAAGAGATGGCCCGGGCCGTTGGCCTCGCCCGGCCCTGTCTGTAAAATAGGGTTTAGAACATT 

GTAGGGAATAAAGCACAGGAAAGACAGATCTGCGCGTATGCAGTACTCAGGGAGTGGCTCTCATTGTAACAAGGCTGGTGATGCCCGCTGAGACTCT 

CATCTCTGAACCCTGGGCCTGAGGCCGAGAAACTGCCTTGTCTCGACCCTCGTGGCTTCGTAGGAGGAGGGCAAGAAATGGGCAGCCATGGCTGAAG 

ATG C TGC AC CT ACT TTA CAG AGTGT ACTGTGAGG C AC AGATG AGAAAT C TGC CT ACGGGG AACTGAGGTTCGAGTGGAAC GAGTGG ATTAGTAGGTT 

TT AGGC T GC AGGTT C CAGTCTCC AG AATGGAGGACTC AGAAC TGTTAGG AGGAGTGTTT CTG AATGTG CAAGGTTGGAGGCCAAG AGCCC AAG GAC A 

GGGCGTGGCCAACCTAACTCTTCAGAGAGCATTCCGGTCCCACCATAGACCCCTCCATCTTAGTAGCCCTGCAGCTCTGAACTCAGCTTGCCCATCC 

ACAAAGAGAATACTAACTAGGGTCTGAACTCTTTAGAGCAACAGGAGGGCACCTGGCTCTCTTCCCCCAATGACTGGTGAGCCTTTGCGCTCTTCCC 

CAGCCCTCAGGGCAAGGCTGTGTATGTACCCTAAGGCTCCAGGCTGAACATCTTCTCACACCATCCCTCTGCAGCCACCCGTGAATCGGCCTTCGTG 

CATGCCATCGCCTCGGCTGGTGTCGCCTTCGCAGTCACACGCTCCTGCGCTGAGGGAACCTCCACCATCTGCGGCTGTGACTCACATCATAAGGGGC 
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CACCTGGAGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGCCGACTTCGGGGTGCTGGTGTCCCGGGAATTTGCGGATGCGCGGGAGAACAGGCC 
AGATGC CCGCT CAGCTATGAACAAGC AC AAC AATGAAG CAGG CCGAACGGTGAGACAGCGACC CACAACTCGGGGGCGGGAGGCACT CCCTGGACCC 
CTGCCTTCACGGCCAGGTTCCCATCCACACCCCATAGTAACC^ 

TATATTTACTCATTCTCAGTGTGTGTATTCCATGTTGTGTATGTGGAAGTCAGAAGGCAACCTGTAGTGGTCAGTTCTTCCCACCATGTAGGTTTCC 

AAACTTGAATCAT GCTTG ATG CTGTTCCTGGAT CTTTTGAG AAATCTGC CAAGGAGTG C AGT ATGGCTGCCC CAT AGG AAC AACAC CCATAG GCTAC 
ACTGGTTGGCAGGCTGTGCCTGTGTACGTCTGTGATGTGGTTGTGGGTTGGGGACTGACTCTGAATCCAAAGTAGGGTTTCCATAGCATAGCCCTTC 
CTATGCTCGTGGAACACTTAGCTCTCGGCCTCAGTTTTTCCATCTGTTCAATGGGAATAACAGTCCCTAGTTTGCACCACTTCTATGGATGAGGTGA 
ATGTGAAGCCTGCTGGGGCCGGGGGTGTGACTAGGGACACTGGTTTCCACCCATTCTTCCTC 

GGGGAAGCTTTCTCCCGGTGTGTCAGTCTGAACTAGCTGCTGCCCCTCTTTCCCTAGACC^TCCTGGACCACATGCACCTAAAGTGTAAATGCCA 

GGTTGTCCGGCAGCTGCGAGGTGAAGACCTGCTGGTGGGCCCAGCCCGACTTCCGTGCCATTGGCGACTTCCTCAAGGACAAGTACGACAGTGCCTC 

CGAGATGGTGGTGGAGAAACACCGTGAGTCCCGAGGCTGGGTGGAGACCCTGCGGGCTAAGTACGCGCTCTTCAAGCCACCCACCGAGAGGGACCTG 

GTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGCTCCTTTGGTACCAGGGACCGGACTTGCAATGTCACCTCCCACGGCA 

TCGATGGCTGCGATCTGCTGTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAACGGAAGGAGAAAT 

TGTCAGCTGCCAAGAGTGTATTCGCATCTACGATGTGCACACCTGCAAGTAGTGAGCCAGGTAGGGCGCGGGGCGGGACGTGGGACTGCCCTCTCCT 

CCCATCCGCTAGGCCTGCTCTTCTGATTCTCGGCTTCCCAAGACGAACAAGTTCCCAAAGCTGAAGTCAGAAACTAGGCTTCCCACGCTTCCACGGG 

AATGGTCTTGGGGAGCGACTTGACGTATTTACACGAGGCTCAGATAATGACTGAGGGTGTGCACGTCAGCCACAAGCAATTGATTAAAATTCTTACA 

CAAATTAGCAGTGATGAGCCAGACCAGATTTGTTCATGGCTGCTGGCTGTCATTAGTGAGCATCTATTCGGCCAGAAGAGAGGTTGAGGTCTGGTCC 

CAGCCATGCCCCTAAAAGTGTCATATCTCAGGGCCTCAGTTTCCCTACTGTCTAAAGGAGAGGTTCAAATGTCAGAAACAGGGGCAGGGTATGGTGG 

GACTGGCCTTTAATCCCAGCACTTAGGAGGCTGAGGCATCTGGATCTGAGTGTGAGGTCTGGCTGGTCTCCATAGAGAGTTCTAGGCCAGGACTACA 

CAGTGAGACCCTGAGACAGTCACCTAGTAAGTCTCTCCTAACCCTGGGGCAGAGTCATTTCACATCACCCATGAACTGAGAGGGCAAGCTTTGACCT 

AACACAAATCCTCCAGAGCTTGGGAAGGTGGGTGGGGGAAGCCAGATCTGAAGCTTGGGTGGAGCGGGAGCACTCTCACCAAGTTTATGGAGGGAGA 

CAGGAAAGAGGAGAGCTCCCAGTTCAGAGCGGCTGGGGTGAGTGACCGTGGGATAAAAGCCTGGTGGATTATGCTTCCAAGCTTCCCAGAGGTGGCC 

TGCATGCGTGGGAGGACCCCAAGAAAGGTCAGCCTGGAAGCAGAGTGATCCTAGAGGCTCTGGCATGGGCCAGTGTGACCCTTGTCTCTGTCCCCTC 

CCCTGAGAGGGCTCTGCTAGTGAGTCTGACTCTCAACTGGCTTTTCTTATTTGTCAACCAACAACTCTGGCTTCAGCATCTGTTACCTTCCCCGAGC 

AGCTGTGAGGAACGTGGGAAACCACATTGGTTCCTGGAAAAGCAGCTTTCTGTCCATCTATGGATGGCCAGCGCAGTGTGTTCCACCCAGGGACCTT 

GGAGCTGGTGACACACCAGAGCTGGTGGCTCATGTCTGCGCTCTGTAAGGGTGGGGGGGTACCTGACATCACAAGCCCATCTAAGAAGTTCCTTGTG 

GGGCTTAGGTTCCATTCCCAGGGTCCCTCTAGAACCATACAGGAGGCGTGGGTTTTCTTTATCAAGGTCAAGGCTGGAAGGAAGACAGCGGCGAGCT 

TTAGCGCTTCACGCGCCCACTGCTGAACCTCGCTCCTCATCTGCGCAGTGATGGCAGTATGGGGATCTTGATGGATTTTTATAAAGCTCAAGTTAGT 

GGGTGTGAATGCTGCTCTAAGTGTCAGGGTTGCCATCAGCGTTAGCACGGGGTTGCTGCAAAGGAAGATCCCATGGGCGTTCTTTCTGCTTTTCAAA 

GGGATGGGAGAAAAGTCAGCAAGTAGGAAGGTTGAGGTGGAAGGAAGCAGACCATGACGGCCGCCTTCTCTGGTATTCACTCTGACAAGAGTTAGCC 

CTGCACCCCCCAATATCAGATAGTACATACATAGCTCTGGCCGTTTTGAGGAGCTCTTGGTGTGGCTTCTTATCACTCAGCTAGAAGGGCCATTTGT 

CCTGACCACCCATGTCACTGAAGGTTAGACTTGAGTTAATGGCCTAGATAGACCCAGGGGTCTGCTTCCAGAAGCTTCCTATGTAGGCTTGCATAGG 

CCTCAGCATTGCCAACCCAAGCTTGATACATGGGGCAGTGTGGGTCCAAGATACAGAGCCCATGACTCCACTCCGTCCTTGCCCATGTGGGCAGCTG 

AGATCTTCCAGGCAGGGAAGGAGGTGCAGGGTGAGCCTTTGTTGTCCAGCAAATCGCTCCACTCCAATCCTTGCCTAAAAATGATCTCCTAAGAAGA 

ATTGGGTGTAGCTGGGTAGTAGAACACTTACACCCCTGAATCAATCTCCAGTACTGGGGAGAGGGGAAAGCAGCCCCCTTTCCTCACATTCCTAAGG 

TAGACACTTGCTGTTTTTCTAGTATCATGCAGACATGTTTTTAAGAAGCTATGCCCACAGAGTTCACAGCCTTAATCCCTTCCCATGGGGTGGGGGG 

TGGAGGGGGGAGGAGGACTGGACTCTCAACCTCACACTTCCTGATAAGAAAGATCCCTCCCACTGGGATGTCCTTTGCCTCTCGGAGCCCAGGTTTC 

CTCCTGTGTGAAGTGATAAGGTCTGTGCCCTGCAGGGCTGCTCAGGAGCCAGACACAGCCATAAAGAGCCTGCTCTGGGCTCCTGGTTTGGAGCCAT 

AGGCCAGGAGATGGTTGAGACCTGGTTCCTTCCTGTCCTTTCACCCCCGAAGGCGCTGCTACTGATGGAAGCCAAGGGACAAATGGGCAGGGAGACA 

GCAGGCATAGAAGGCTCTGGCTTTTTTCTTGCACAAAGGGGCCACAGAGCAGAGCCCAGCAGGCCACATGCCCTTCTCTGAGAAGAGTTCAGCACAG 

TGGTAGCATCCAGTCCTCCCTAAGTGGGGAAGACACCTTCTCCATTGCTTGAGGTTTGTCCACTGTCTCCACTTAGCCACTAGCTTGGTATCCGAGT 

GCTGGTAGTTTACAAATATCTGATTTATTCTTTTTATCCCTATGTCGGAAAGGCTATGAATCTAGTCACCCCCTCTAACTGGCCCTGTGTCCTGGGT 

CTGTTACTAGTCTTTTGTGGGACCTTAGGCATATCCAGGTCGTCTTGCTATGCCTGGTTCAGCTCTGCCAGCAGCACCAGAGATGAGGGTTGGCTGG 

GGGTTGGGGGGTGCCTGGGGCGTGGCTCAGTACCTCAGATCCCACGGGATAGGGCTGTACAGAGTGATCATGCTCAGGAAAGCACCCTGGGAGGGAG 

TTCACCATAACTAGGTAGAGTCCGTTTCTTTTGGACAATTCACGGCAGTTGTGCGTCTGATGGCAGGCACTAGCCCTGCTGTGACTGGTGGCTCAGA 

CTGGCAGGCTTTTCCGTCTCCACGTAGGAAGGGCAATGCTCCCTTGGTTACCTACAGTGGGCTTCCTCAGCTAGGAGGAGGTTTGAGTTGTGTGGAT 

GCAGGCTCTGTTCTTCCTGAGGATGGCATGTCCTCCCTCAACAGCAGGCTCTGGGGCGAGGAGGGAGGCAGGGCTGGCAGCGTAGGCAGCCTGTGAT 

CTTTGCATCCCAGGGGGTGATGTTGACAGCAGTTATGAGGTTTCTGATTCTTCCTGTTTATTTCCCAAAGGGCACTGGGAAGGGGTAGATTGTGCGG 

CTGGATCCATTCATCGAAGTCCCATGAGAAGCAGGATCTAGATCCAGGCCAGCCTTCGGCACTGGCCAGCAAGGAGCATGGACTGTTGCCAGCTGCA 

TGTGATAAACGACCTGGACCCAGCCGGCCTCGGACGGACGGGCGGCTTCTTTCTCAACTAACGTCTCTCCCCCTGCTCTGGATGGTGTACGGCTTTA 

CAGAGGGGCTTTCTTTATGGTTTTACC^GGGTCTGCTGGGGACAGACTCGAGGCTTACCTTTGCACATGTTAAAGAAAATAAAAATGAAAAAAAAAA 

ATCTACCGCAACAGAACAGGCTGGGCTAGTGTGAGCTCTTGGCCTGGTGGGAAGGACAAGACCATGGCGAGATTCTGTGTCCAAGCTGCCTCTACTC 

GTGACATTC CA AGATGC CTCTGAGGTGGGAACTGTGAAGT AG GAC AGAGCCC CGC AGTCCCCTCTTGT CCGTCGACT CC C ATT TAAATTGGAC ATAC 

CTTGTCGTT CTGAG AAAAGC CAT AG ATAGGTGTAGCTGGGAT GTAGTG ATGGGGAGG CC CCTGGC C AA C AG TGGGAGCAAGATCTTGAGTTTTG AAG 

ACCTCAGAGTTCTGGGCGGCCTGGGAAGCCATCTGCAGAACAGAGTTCCTTGTGGGCTCCTGTTTTCGCTAGCCCTGTTCTGCCCTGGAGCGACAGT 

CAGATCTCCACGCCCCTTTCTGTTGTTCTACAGTGTCCACCTTTACTACGCGTTTTTTTTTTTTTTTTCATGATGACCTTGTAAATAGGTCAGATGT 

GGAGGCAGGTCTCTTCTGGCTCCATCCACC^CACCC^GAAAGAATGGGCTGCTCTGCCCTTCTCAGCCTTGCTAACCAGCAGACACCGAGGAGAGCA 

GCGGGGCACCTTAGAGAGCAATCTAAACATGGTTGGCAGGTGGGGAGGGTAAAGAGTCCCACTTCCTTTGTGTTAGAAGGCAGACTACCCTGCGTCC 

TTTTCTCCCATTGGCTGAAGTAACCAGAAAGACAAGAGATCCTTAACAAGCCCTTCTTCCCACTTGTAAAAGGGATAGCCTATCTCAGTTCCCAAGG 

ATCTGGATTAGATAGATATTCAAAAGAGGCAAGCAGCGAATGGAGGCAGCTCCCAGCTCTGTTCCCGACGCATGATGGTACTGGCTGGGTTTAGTAA 

GGTGGGTGGGGCTGCACGGATCAATCCATCAACTCCGTCTTAAGGAGAATCAGAAA^^ 

TCCTTTCCCGCTTCTGTCTAGGGTCTGCTAATGCTGGCTTGACGAGGGGTCAGCCACTTCTTTCCTGTTGTGCAGTTGGCTTGCCAAGCAGGCTCCA 

GTAGGCCCTTGCCTGCACTCTCTACCATGTGACCATGAGCACTGCTCTAGGGACACCTCCCATCCCTTCCTAGCACCCCAAATG 

CTC CTT C CAGAAGTTGG AAATCAAGTCAACTGGATAACGCTTGTGTGAGGAC ACTTGAGCAG AACGGATAC AA CAATTT ACAAGT CTC TT CATATCT 

ATGT AT TCT ATATT AAAAGTG AT AAAGT C ATGTTT CCGGGGCGTATTC AAGT AGCTG AC AAGTAATTATTT AAT AATAGTACATGAG CG C ATTGTAA 

TTATCCTCGCCATAGTCAGGTAATAGCATCCAATGGGAGGTCCCTACCAACCTGCTGTATCCAAAGTTTTGTAAAAAGTTGTAGAAGTTGTTGATCT 

TTT T GATTTTAT ATT CAAAAAGT CT CTT TTT ATAAAT ATT ATTTATTAT AC AATGTATAT ACCTTTGAGTTAACTAAGATTAT AT ATT AT ATAAAT A 

TATATATATTTGGAGAAAATCTATTTCATCATGCAGTTTTTTTCTGTTAAATCATTAAAGAGAAGGTAAACTTCAAATGGACACATTGTACGATGTG 

TGGAGTCTCCAGATTCGTCCTTAGGTCAAGGGTATCTGAATGTTGTGCTAATTAAATGGCCAAATTTACGGTCATTTCCTCTGCTCTCTAAGCAAGA 

TGAGGTATGATGGACAGCAGATGTGACATGAGCCTTAAAGCATTACTTGATATTAAGTCAGTATCAAACAAGGAATTGCCAACTTGGGGAAGAACAA 

ACCTGGTCAGCAATTCATTCATAGACTTGAAAAATTTCAACAACGCATGTTGATGTCTCAGATCTTTCTACAAGGTTCAGCCCAGGTCAGTGGGTCA 
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CAGGGCCTGCCATAGTCTTACTCCCTTTCTCACTACATACCCAACTCCAGTTGTTGGAGAAATAGGAATTGCTCAACTGTAAACAAAGCTGGACCAG 

AAACAGATAACAGCTGAACTGTGGAAGAGGTTGGTAGGTAGGTTGACCAGATGGTGTTCTGCCTGTGCTCTGGGCCAAGTGACAATGGAATTGTAGG 

TTGGTTTAGATAGGTTGCTGGGCAGAGCTCTGGGAAGGGCACTCTCTCCTTGGGCTGGAGTTTCAGTGACTCACCCGAAGCCTTTGCCTGTCATCTT 

GATGCTGTGCAGTTTTAGGATGATGGGTGGTCCTTCACTTCTCAGACCTCCCTCTAAAATATCCCCTTCAGTGTTCCTTTCTTCTGGTCTACCACTA 

TTAGAGATAACGTATTCACACTGGCCATAGGAAGATGAAGCAGTCTGCTTCTGACGGTGGGGACTGCCAACACCTTCTG^ 

CCGCCTGAGCTTCCATTTGGTGCATCTTTTGCCCCTAGAAAT^ 

TGT ATAAGGTTGCC AAGTAT TTAGATGGT CC CTATCATACTT ACC AAGGCC ATTTGTCT CCCATG ATCATTGT ATT C CATTAAAG AGAGTACAATTT 
ATAGCTATGCTAGAACACTAGCTGGAAACACTGGAAAGAACAAGTTTCAAGT 

GCAG AG ACAGGT AG ATCCCT CTGTT CTTGGC AGC CAGTGC TAC AC ACC AGACTT AGTGGGTCTTAAAAAAAAGGTTTTGAGAAT AATTTG AGAGAAG 

ATAAAGGGATATGGTGAGGTTTTTTTTTTTTATTGAAAGCAACACTATTCTGAAGGCAGGCTGTGGGTAAATGGATGGGCACAGAGATAGGAAAAGG 

AATCGAGGAGAAAGAC^G<^GGGAAACGAGGCGAGGCTGTTTTGGGTGGAGTGCTGGTCGACCTGTGTAGTGCTGGGCATAACAGCACGTAACATG 

GGTGGTGTGGGTCTACATTAGGAAGTGAGTTCAGCAAAACGTATGGTGTTAAGCCATGACTGGCTTAACAGAACTTGGGAGGCAAACAGCTATC 

T CACTAATC AATCACCAAATGTT CATTATGG AGA CTGTGCTGG AGAAT C AT AAATAC ATGTC ATT ACAATTTC TTGTTTGGAC ATAGTCT C ACTTTG 

TAGTGCTGGCTATCCTGGAACCATGCAGAATAGGCTGGCCTCAATCTCGTGCTGATCCCCCTTCCTCGCTACCTCTGGGATTAGAGGTGTGCACCCT 

TCCTCCCGATCTTGGGATTCCTGCAGATCACAGGTGTTATCCGTGCTCTGGCATCACGACACACTCACACCCCCTGGGCTGTCACTGGGGCTCAGCC 

TCAGCAGTTGTCAGCAAACGAGTGGCTTTCATCTTGCAATAACTTCTAAAATTTAAATGCTGCTTTTGAACTCTTAGCAAA 

TTATTTGAAGGCGGCAGGCGGCCCAGCTGCCAGGCAAGCAAACTGAAGTCCTGGCCTTGCCTTCTGTTTGGCCAGGTGGATATTTATAGCTGACCCT 
GCTTCAGCAGGCCTGTTGACCAGGCTTTGATTATTAAATGATAACTGAATCTCCTCCATGATTCTGTAACATAAGGTGGGAAATGTAAAGGCTGCTC 
ACTGTAATTTTTAGGCAATAAAAAGACAGCCTTAAAAACACATCTTAGGCTCGATAGATGCCCCAGTGTTGGGGAATCGAGGGCGGGGAG 
TGGGTGGGT GGTGGAGGAACAC CCT CAT AGAAG C AGGGAGAGGG AGG ATGGGAT AGGGGGT TTC TGG GAGGGGGAGGAAAGTGGAAAAGGGC TAAC A 
T TTG AAATGTAAATAAAGAAAAT AT CC!AAGA AAAAAG AAAAAAGT ATT C AAAAT ATATACAT ATAAAAA 

GTT ATG CATGTTTTTGGATAGTACATGAAAAAAAAATC CT CAAAAG AAGAC AAAAAT GT AGT AAGG AAAAAATG AG CTCTT CTTTGGAAGATTTAGT 
ATTTCCTGACAGCCGGTGTGTGTGTGTGTGTGTGTGTGTGGGGGGGGGTCATCAGAGCCCTCCACACTGTTCACAGGGCTATGCTGTGGGCTGGAGG 
TAGGAAGGCCTTTGGGGATCCTCCCAGAACTGGAGGTGGGGGCAACTGCTGTCCCCCAAATTGCTGGACATGCCTCTTCTCTACACAGCATCCATTT 
CTGGGCTTCCTGGTGCAAGGACTGGGGCTGTGGCATCTAGTATCTGAGAGATCCAGAAGGTATGAAGAACAGCAGGGTGAGCTCGGGGGTCTCAGGT 
TTGCCTGCTTCAGAGAAGTCATTGTACCGCAAAGCAGCTTCCACCCAGCTCCTGAACAACCGAGCTCATGGCTGCAGTTCAGACTTTTTAAGGTAAA 
ATAGGTATCAAACGTAAATATGAAGATTCTCGTACATTAGCATATACTAATGAACTTTTGAGGGGTTTGTTTTGCTTTTGTTTTTGGAGACAGGGTT 
TCTGTGCAGCTCTGGTTGCCTGGAATTCACCCTGTAGACCAGGTTGGTCTTGAACTCACAGAGACTCACTTGCCTCTACCTCTAGAGTGCCTGGATT 
AAAGGCATGCGCCACCACTGCCCACTGAACTTTTTGTTGTTGTTGTTGTTGTTTGAGAAGAGTCTCTCTGAATGAATGGTTCCCAACCTTGCTAACG 
CTGTTTGCCTTTAATGCCGCTCCTATCATGATGACCCTCAACCACAACCTTGCTTTTGTTGGTATTTCATAACTGTAATGTTGCTCCTGTTGTGAAT 
TGTAAGGTAAATACCTGTGCTTTCTGATGGTCTTAGGTATAACCCGTAAGTTGAGAACTACTGTCCTACAGGGTCTTGGGACTCACTATGTAGAGTA 
GGCTGGCTTCCAACTCAGAGATCTGTCTCCCAAGTGCTGGGATTAAAGGCATGTGCCGTCAGCTTGCATGCGTATCTGTGTGCCATGTGAGTTACAG 
TTGTACACTGCCATGTGAATTCTGGGAATTGAACTGAGTCCTCTGGAAGAGCGGTAAGAGCTCTTACCCACTGAGCCATCTCTCCAGCCTCCTTAAT 
GTTTTTCTAAAGAGATTTAAAAACAAAACAAAAAAAAATCTCCTATGCTGTAACAAAGCAAGGAG 

GCCCCATTAGTGATTGATCCTGTTCCATATAAAATACTGGTCTATACAATATTTCAAGTATATTTACTCTTTATTGCATTCCTTCAATTTGCATTAC 

ACAATATAATATCTTTTTCAATATAATTTTGGACAAAAACACAAGACATTAACTTC^TTGAACAAGAAACATAAATTAATTCATG 

GGGAGGAGAAAGGCAGAACCGCAGACAACAGGGACCACGGCGGTGGACAGGGAACCACGGTACTAGCCACGGGAATCACAGTTTCACAACACAATTC 

ACTCTTTCAAAGCCACAAAATAAGCCAGGGATGAAAACCCCAAACAAGGTATACAGACGGCAAGACTAGGAGATGATTTGCCTTCTACTGTGGGGAC 

AGTGTCAGAGTATCACCCAGGTGCCCCTGCGGCAGCAAGCCTCCCGGGCGGCTTCTGATGGCTGTCTTTGTGTTTGAAGCAGCAGCGGGGAGAGTGG 

CATGCGCCCTAAGACTCATCTCCACGAATGACGTGACGACCTAATTGCATTCCTTCTAACCCACTGATTAGGCAAACCACCCTCCAAAGGGCTCGCT 

GAGTTCCTCTTCGGGAAGAGGTGTGTTGAGTACGCTGGAATGGATATTCGAGGGCTGAGGGAGAGGACTAGGGGGTGGCTTGTTCTGCTGGAGAGTA 

GTGTTTT AT TT ATG TGC TGC AGACACTACAG ATC ACT TGAGATAC AAGGAGG ATGAATTGTAAGT CAT C AGGCTGAGGAAAAGGAAGACG ATGGCAC 

TAATAGGCCTCTAACGTCATCCTAAGGAACTGACCCACTTAGAGCCGGACGCTAATTAGAATTGAGAGGCTTGTAAATTATTTTAATTTTTTTTCTG 

TTTAGGGATGTACAGATTTCCATATGTATCTTTCATAAGGTTTTAACGGCATCTACCAAGTGGCAATGATTAGCATATCTCTTACCACAGAGCAAAT 

TAATTACACTCAGAGCATATCACAGCCACCCCAGGAATGATTTGACTTCCGATC^ 

NNNNNNNNNNNNNNNNNNIWNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNISTNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NI^NNNNNNNNNITONITONNNNNNNNNNN CAAAC CCT ATT C 

ACTAAGAAACACTCGAGGCTTCCTGTTCTCTCTACATTTTCCACCATTTGTTTGCTATTTGATGGGCACTTTCTAAAAGAAAATGGTCACATTCCGG 

GCACTTAAAAGCAGGTTCCAGTGAATCATGGACCATCAAGTTTTCTCTGCTCTGTGACAAGGAAGTCTTGTGAAGCAAGCCAAGGAGA 

CACGACAGCCTGCTTCCTTGCTTGGTCCCTCAGACCCACAGCACAGCGAGGGGCTAGTGCGTGAAAACAAGGCTATACAGGAAGGAAGGAAACCATC 

AGCGGGCTCATATGCGCTGGGGAAGACTTGGGACCTCGCAGATGGGTGAGCACTGCGTGTACTTACGCGCCTTCTTCTCTCATGAGGGCCAAGAATT 

TCCTCACACGGTACTCAGGATCCATCTAAAAAGCATCGGAAGAAACAAATCAGACGCGCGAAGCTGCCTTTTCTCCA(^TGTGCTTCCTCCGCACC 

AGGGAGGCGCGTGGTCATGTGCTTCCTGGGTCGTTCAGAACCATGTCACCATTCCCTAGTTGTTGTTTCGAAGCATAATAAAGTGGGTTATGTTATT 

G AG AAGC AC AC C AG CCAGGTGTC ACCAC AAC ACT AGATCGTG GTC ACTC ACGGAGGGGT CCAG ACATG CCTGGGTG C ATT CATGGTGCTAT AAATC A 

TGAG AGTTTAAATG CTAAAATTCAT ATATCTT AT TGAAGGAGG AGGATAGAACCTGAGAAAC C AAACTGAGGTTGAT TAAGTGACCAAAATAC AGTT 

AGAACTTTTTGTTTTGTTTTTCGAGACAGGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCATTCTGTAGACCAGGCTGGCCTCGAACTCAGAA 

ATCTACCTGCCTCTGCTTCCTGAGTGCTGGGATTAGAGGCATGCGCCACCACTGCCTGACTAGTTATGACTTTCACATACTCACTGAACITGGCATTC 

TTAGCTTTAGTGCCCTTCTGGTGACGCTGCCACACCTAAGAGAGCAGAGAGTGCTGTTGGCCTCCCCTCTCCGTACACCTGGCCTCTGGTAGCCCGA 

G AGT CACAG ACT CAAGACAAGGTGGTTTG ATC CGGGTGAT GGTTC AAGTCACCATTACT AATTAT CTCTCCTCGCT GTC ATT CTGGAT CC AAGTTAC 

TTCACTTCAAAGTGTATCATAGTCTTATTGGCCTGACTAGGGTGTGTGGGGATCTGGAAAAGAATAATCACAAACAAACATCTCTAATTACTCACTA 

TGAATTTGTACTGTAAGAAACACCTCACTTTTAAAGCATGTTGCAGGGCAGAAGTCTGCTGGCAAGAGGGAACTCCCTGCCCTCCAGGAGT 

GCTCGCAAGGGCTCAGCTCGCGCAGCCGTGGGCAAGAGTGCACTCAGACTCCCCTCCAAGCAGGCTGGAAAAGCGACAGTTCTGTCCCAGCAAAACA 

TGG AACGTAAGTTT CAAATGTAT CAAAGTTAAGGGGT C AGAC T AC AAAG AACTAAGAAAAAGTGTGAA C AG CTAAC AAAAGAGTC ATTTT TTTT CC C 

CCTCTATGGGACTAGAATTACACATGAATCAATTGTCAAAGCTTTTTCTAAAAAATGGAACTGGCTGGAGCTCGCTGGAGTCAGTACATGCAGGTCT 

TCGCCCATTAACTCTATTTAATATGTGTGAGGAGCGACTTACTGACTTGCCTGTAGCTATGGAATTAAGAGTCAAGAGAAAGGGGAGGAAAAAAGCA 

ACCTAGATAAATGACTTTATTCCCTCAGGTGACATCCAGTCAGAGCACATCCTTAACACCGAGCAAGTTTTAACAAAATCGAGTCCGGCGAGCAGAG 
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CGAGTGAGCATGGCTTCCATTCTGCTGAACTGCATGCCTCTCCACTCGGGCTCTGCCCCTGCTCCTGAGGCAGATCTAAAGAGAAGTCACTCAGCTG 
CTCGGAATCCATTTGACGTGAAGGGGTTCAAACTCCAGGCCTATGAAGTGTCAGTCGCTCAAGATTAGGCCGAGCGCCCCACTCTCACTTTCTACCT 
CACACACATAAGTATTAACTTGATCACTGCGCCGCAGGCATCAAAGGGAGAGGAGGCAGGGGTGGGGAGGAAAACTCGCATAGTTTCTCCTATGCAA 
TTATTTTAAAAGATTGAACTGAAACTGGAAAAAGAAGGGCAGAGCTGACTTCTGACAGATTTATCTT^ 

GTATGC CTTTACCAGGAATAAGC CTGAG C AGGGTGCTT AGTCTATGAC AAT AATTTAC TACTC ATAATGGAAAC TAT ATAAATTG ACTT AAAAAC AG 
T TG CTGGAAGCTTGGGAAACTTT TGGTTTCTG AAAAACCACAAAAATCTAATTT T AAATTGAGTTTAT C AG AGAGAGAGTTAAGATAATTC TCT CCA 
TATTTCTTATTTATTGGCCTAAGAGCTTCCTTCAGATGTGAAGCTGGGGAACCCCAATGTGAGCAGTGGGCGCGCAGGGCTCCACAGTGACACACCT 
CCTCAGGTCTGGGGATGGCTCACTCCTCATTCATTCACTTCTGCTCCTTTCATTGTATCTAGTGAAATTTATTGAGCTCTCTAATACGTGCTGAGCT 
GGT CCAGGGGC CAG CGACAGTGTGTGGC AGC CCAGGGGTCTGAACACACGG AACTCAGTCTAC AC AAGTGT AT TAT GTACATGTGTGTGTACACGTT 
CGGGCAC AC AG AGG C AAGGCATT CC AGAAGG AGAGAGGGAGGTGC AGTCCAC AC AAGAAAGTTCC AAAC CAAAATC AGT CTC ATCTG CCTGTGCCTA 
GCAGCAGGCTG GGAGTC AGTT AGTTTGAGAT TTC TAGGGCAC TGG GCATGG CGGT ATGAGCCTTTAATCCTAGCACTTGGGAAGGCAGAGGCAGGGA 
TGGTCTCTGAGTTCGAGGCCAGCCTGGTCTGTATAAAGAGCTCCAGGACAGCCAGGGCT^ 

TTT TTT AGGAGTTC AGTGAG GGG AAACGCAC AGTCACAGCCCGATCTGC ACAAGTCGG CCG CAT CC ACACTT CC ACATCT CCC AGAGCT CTTAGCCA 
TCAGTGCTGTCACATGCTCACCCTGAAGGACGGAGCCTCAGTAGTGAGAGAGATCGGGGCAGACTCACTGACAGACTCTTCTCCCATCTGCATGAGA 
CTGGGGACACCTTGCTTCCCTCCTTCCCTTCCAGCCCCCTCTTCATCTTCATTGTCATCCTACAACATCCAGTCTCCCCTCACTGAAGGTAAAGTTT 
AAATTATCACCAGGATTTTGCACATCTCCTTTTATTGCTAAGTGCTGTAGATGTAAGCCCTATCCTAAGCTGTTCAGGGTCTCACTCCTTTTCTCTG 
ATTGTTAATAAGGACTATGAAATTGACTGTACCCAGGTCACCTCTCTGCAGCCTCQ^AGATGTTGAGGAGCCTAGGGCAGGGGTGGGAGTGTCTTTT 
CCTCCTTGTTGAATGTCTTCCATGACGGGGCAGTTGTAGTTTTAAATTCGCTTTGAAGCCCTTTCTGATTTTGAGACCTAATGACTCTCTACATTAA 
GATTAGAAGTCACGCTCTGCTCTGCTCATTTATTTCTGAAGAGAAAGAGGAGAACATGGGTACTCCTGAGGCCTGGGAGAGGAAAGCAGGCCCTCCT 
GGTGTGCTACAGCCATTCCTGTGACTCACGCAGAGACGGGATCGCCTTACCTGCAGGGACATCTCAATCAACATTAGTAACTTCTTGATTCCTATCC 
AGACTTTCTTCCCTTTGACTTGCTGAGCAATTGTGGTGCGTTCCTTATCTTTGAAGTTGCCCAAAAGCTATAAAAGCAATAAACAACAATTAGACAC 
TTTTATGGTGTGCACACCAACATTCACCTTTAAGTGAACAGTGGGAGGAGCTACCTGCCTGGGATGAATTGACCCACCTAGTCCTGTATGACCATAA 
AAGACAAGGTCTTATTCTGTAGCCCAGACTAGCCTCAAACTCATGGTAATCCTCCTGCCTCAGCCTCCGCGGTGCTGAGACTACCTCATCTAGAAGA 
GATAAATCTTATGCTGTTTTTCTAAGGGACCATCACTAAACTGAGAGACGCGTTTCCTAGGGAAAACAGCTCCAATCTATGATAAATACAAGTTAAA 
CGTGTTAGCATTCCTTCCTGGGCGCAGTTCGTATGTGTCAAAGTACGCTGGTCTTTTCTCTTTCCCAACTACAACAGGGTGTTCAGACAGGTGACCC 
AGGCACACTTCTCATGCTGTTGGCGTCTGCTCTGTTGGGAGCCCACGGCAACTCTGAAGTTGTGGAGCCTTTAGGCAGCCTCCCCGACTCGACCACA 
CCGAATAACTTACATCACCTTTTCCATCAGATCCCATCTAGTCATCTGTGCACATTAATATTAATCAACACTCACTCGAAGACTCACTTTTTTTAAT 
TAAAGGCTTTATCATCTTCAAAAAAAAAAAAAAAAAAAAAGGAAACATTTCCACATTATAAAGTTGTGAGGTTTGGGTAGCGTTCTTTTCTTTTTTC 

TTTTTGGTAGT T CC AAAGG 

MOUSE SEQUENCE - mRNA _ 

CCTCTTCATGATCGCCGGCAAACTTCCTCCTCGGCGCTGCTTCTAATGGAGCCCCACCTGCTCGGGCTGCTACTCGGCCTCCTGCTCAGTGGCACCA 

GGGTCCTCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGCCTCCCAGCCTCTGCTCTGCGGCTCCATCCCAGG 

CCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATGCCCAGCGTAGCAGAAGGTGTGAAGCTGGGCATCCAGGAGTGCCAGCAT 

CAGTTCCGGGGCCGCCGGTGGAACTGTACCACCATAGATGACAGCCTGGCCATCTTTGGGCCTGTCTTGGACAAAGCCACCCGTGAATCGGCCTTCG 

TGCATGCCATCGCCTCGGCTGGTGTCGCCTTCGCAGTCACACGCTCCTGCGCTGAGGGAACCTCCACCATCTGCGGCTGTGACTCACATCATAAGGG 

GCCACCTGGAGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGCCGACTTCGGGGTGCTGGTGTCCCGGGAATTTGCGGATGCGCGGGAGAACAGG 

CCAGATGCCCGCTCAGCTATGAACAAGCACAACAATGAAGCAGGCCGAACGACCATCCTGGACCACATGCACCTAAAGTGTAAATGCCACGGGTTGT 

CCGGCAGCTGCGAGGTGAAGACCTGCTGGTGGGCCCAGCCCGACTTCCGTGCCATTGGCGACTTCCTCAAGGACAAGTACGACAGTGCCTCCGAGAT 

GGTGGTGGAGAAACACCGTGAGTCCCGAGGCTGGGTGGAGACCCTGCGGGCTAAGTACGCGCTCTTCAAGCCACCCACCGAGAGGGACCTGGTCTAC 

TACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGCTCCTTTGGTACCAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATG 

GCTGCGATCTGCTGTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAACGGAAGGAGAAATGCCATTGCGTCTTCCACTGGTGCTGCTATGTCAG 

CTGCCAAGAGTGTATTCGCATCTACGATGTGCACACCTGCAAGTAGTGAGCCAGGGCACTGGGAAGGGGTAGATTGTGCGGCTGGATCCATTCATCG 

AAGTCCCATGAGAAGCAGGATCTAGATCCAGGCCAGCCTTCGGCACTGGCCAGCAAGGAGCATGGACTGTTGCCAGCTGCATGTGATAAACGACCTG 

GACCCAGCCGGCCTCGGACGGACGGGCGGCTTCTTTCTCAACTAACGTCTCTCCCCCTGCTCTGGATGGTGTACGGCTTTACAGAGGGGCTTTCTTT 

ATGGTTTTACCAGGGTCTGCTGGGGACAGACTCGAGGCTTACCTTTGCACATGTTAAAGAAAATAAAAATGAAAAAAAAAAATCTACCGCAACAGAA 

CAGGCTGGGCTAGTGTGAGCTCTTGGCCTGGTGGGAAGGACAAGACCATGGCGAGATTCTGTGTCCAAGCTGCCTCTACTCGTGACATTCCAAGATG 

CCTCTGAGGTGGGAACTGTGAAGTAGGACAGAGCCCCGCAGTCCCCTCTTGTCCGTCGACTCCCATTTAAATTGGACATACCTTGTCGTTCTGAGAA 

AAGC C AT AG AT AGGT GT AGCTGGGATGT AGTGATGGGGAGGC C CCTGG C CAACAGTGGG AGC AAG ATCTTGAGTTTTGAAGAC CT CAG AGTTCTGGG 

CGGCCTGGGAAGCCATCTGCAGAACAGAGTTCCTTGTGGGCTCCTGTTTTCGCTAGCCCTGTTCTGCCCTGGAGCGACAGTCAGATCTCCACGCCCC 

TTTCTGTTGTTCTACAGTGTCCACCTTTACTACGCGTTTTTTTTTTTTTTTTCATGATGACCTTGTAAATAGGTCAGATGTGGAGGC^ 

TGGCTCCATCCACCACACCCAGAAAGAATGGGCTGCTCTGCCCTTCTCAGCCTTGCTAACCAGCAGACACCGAGGAGAGCAGCGGGGCACCTTAGAG 

AGCAATCTAAACATGGTTGGCAGGTGGGGAGGGTAAAGAGTCCCACTTCCTTTGTGTTAGAAGGCAGACTACCCTGCGTCCTTTTCTCCCATTGGCT 

G AAGTAACC AG AAAG AC AAG AGATC CTT AAC AAGC CCTTC TTC CC ACTTGT AAAAGGGATAG CCT ATCTCAGT TCC CAAGGATCTGGATT AGATAG A 

TATTCAAAAGAGGCAAGCAGCGAATGGAGGCAGCTCCCAGCTCTGTTCCCGACGCATGATGGTACTGGCTGGGTTTAGTAAGGTGGGTGGGGCTGCA 

CGG ATC AAT CC ATCAACT CCGTCTT AAGGAG AAT C AG AAAGAGGAGAT AAAATGGGGGAATGGGG C AGAAC AAAGAATTTGTCCT TTC CCGCTT CTG 
TCTAGGGTCTGCTAATGCTGGCTTGACGAGGGGTCAGCCACTTCTTTCCTGTTGTGCAGTTGGCTTGCCAAGCAGGCTCCAGTAGGCCCTTGCCTGC 
ACTCTCTACCATGTGACCATGAGCACTGCTCTAGGGACACCTCCCATCCCTTCCTAGCACCCCAAATGCCCCTTCCCATCTCTCCTTCCAGAAGTTG 
G AA ATC AAGT C AAC TGG ATAACG CTTGTGTG AGAC ACTTGAG CAG AACGGATAC AAC AATTT ACAAGT CTC TT CAT ATCT ATGTATTC TAT AT TAAA 
AGT GAT AAAGTC ATGTTTCC GGG GCGTATTC AAGT AG CTG AC AAGTAATTATTT AAT AATAGT AC ATG AGC GCATTGTAATT ATCC TCGC C AT AGT C 
AGGTAATAGCATCCAATGGGAGGTCCCTACCAACCTGCTGTATCCAAAGTTTTGTAAAAAGTTGTAGAAGTTGTTGATCTTTTTGATTTTATATTCA 
AAAAGTCTCTTTTTATAAATATTATTTATTATACAATGTATATACCTTTGAGTTAACTAAGATTATATATTATATAAATATATATATATT 

MOUSE SEQUENCE - CODING 

ATGGAGCCCCACCTGCTCGGGCTGCTACTCGGCCTCCTGCTCAGTGGCACCAGGGTCCTCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCC 
AGCAGTACAC^TCTCTGGCCTCCCAGCCTCTGCTCTGCGGCTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGAT 
CATGCCCAGCGTAGCAGAAGGTGTGAAGCTGGGCATCCAGGAGTGCCAGCATCAGTTCCGGGGCCGCCGGTGGAACTGTACCACCATAGATGACAGC 
CTGGCCATCTTTGGGCCTGTCTTGGACAAAGCCACCCGTGAATCGGCCTTCGTGCATGCCATCGCCTCGGCTGGTGTCGCCTTCGCAGTCACACGCT 
CCTGCGCTGAGGGAACCTCCACCATCTGCGGCTGTGACTCACATCATAAGGGGCCACCTGGAGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGC 
CGACTTCGGGGTGCTGGTGTCCCGGGAATTTGCGGATGCGCGGGAGAACAGGCCAGATGCCCGCTCAGCTATGAACAAGCACAACAATGAAGCAGGC 
CGAACGACCATCCTGGACCACATGCACCTAAAGTGTAAATGCCACGGGTTGTCCGGCAGCTGCGAGGTGAAGACCTGCTGGTGGGCCCAGCCCGACT 
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TCCGTGCCATTGGCGACTTCCTCAAGGACAAGTACGACAGTGCCTCCGAGATGGTGGTGGAGAAACACCGTGAGTCCCGAGGCTGGGTGGAGACCCT 
GCGGGCTAAGTACGCGCTCTTCAAGCCACCCACCGAGAGGGACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGC 
TCCTTTGGTACCAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTGTGCTGTGGCCGGGGCCACAACACGAGGACGG 
AGAAACGGAAGGAGAAATGCCATTGCGTCTTCCACTGGTGCTGCTATGTCAGCTGCCAAGAGTGTATTCGCATCTACGATGTGCACACCTGCAAGTA 
G 

HUMAN SEQUENCE - GENOMIC 

TGCCTGCCTTGGCCTCCCIAAAGTGCTGGAAATACAGGCATGAGCCACCGTGCCCGACCTGGGCAGAACTACATTCTGGATGCATCTTAAACCTCACC 
TCAACCCTAC^CCCTCCCTTTTGTTGTTTCCTTTCTCCCCACAAGTCATCTCAGAATGAGGCTCCCAGCTCCAGGCTCCAGTTGCAACCTCACCCCC 
ACCTAGACAAACCCTCAAATCAAGAAAAAGAAGCCCTGGCAGGGCGTGGTGACTCATGCCTGTAATCCCAGCACTGTGGGAGGCTGAGGTGGATGGA 
T CG CTTG AGCT C AGG AG TTCAAG ACCACCCTGGC TGGGCAACATG ATGGAA CCC CATCTCTA CCAAAAATTTT AAAAAATTAG CTTGTTGTGGTGGT 
GTATGC CTGTAGTC CCAGCTCCT TGGGAAGCTGAGGTG AG AGGAT CATT AG ATCCTGGG AGATGG AGGTCG CACTGAGC CAAAATTGTGCTG CTGC A 
CTCCAGTCTGAGTGACAGAGTGAGACGCTGTCTCAAAAAAAGAGAAAAGAAAAAGAAATCCAAAGGAAAAAAAGAG 

G AG AAAAGAAAAGAAATTTCTGTTGGCCAGGCCCGGTGGCTCATA CCTGTAATTCCAGCACTTTGGGATAGGC CGAGGT GGG CGGATC ATG AGGTC G 
GCGGATCATGAGGTCAGCGGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCT^^ 

GGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGGGAATGGTGTGAACCTGGGGCGCGGAGCTTGCAGTGAGCCGAGATTGCACCACTGCAC 
TCC AG CCTGGGAGA CAGAGCAAG ACTCTG CC T CAAAAAAAAAAAAAAG AAAG AAATTTCTGT CTC AAAAAAAGAGAAAAGAAAAGAAAAAGAAATC C 
TAACACTCATCCATCCTTAAGTAAAAGCAACTCACCTTGTAAGTCAGTCTCCACATTTAATCCAACTACCCTACACTAGACATGATGGGTGAGGAGG 
ATACAGATAGTTAGGAGGCCTTTGGCTACAAGTAACAGCCTCAAGTGGGCTAACGAACAAGGAAAATGTATTATTTCATATCACAAGAAGTTCCAAA 
GCAGGACTCCTCTGAAGGCAGTTAGTGTCTCAGCAATGGTGTCAGAGATGAAGTTTGTTTCTCTCTCTTGGCTCTGTCTTCCTCTGCCGGTCAGCTT 
C AT CTT C AGGC AGC TTT CCT C AT AGTCAC AG AAT GGCTGCTG CAG CTCC AGGTATCC CAACCCAAAACAACAATGTCCAAAAGCAGGGAAAAGATAT 
TTCTTACTGTGTGTCATTTTAAGAGCCCCAAAAACTCTTCCCAGA 

AGG C CAATCCCTGGC AAG CAGAATGGGTCCC C GTGATTGACTT AG ATCAGTC AGG ATTTACCT C TGAGC AAGGT AGGGGAGGAGTAAAGAC C AG AAC 
AAACTGGGGGCCTCTGCCATCCTGGAGGAAGGGACCACCAGCAGTGTGAGCTA 

GCAGCTGAGGCTGGGCATGGTGGCTCATGCCTGTAATCCCAACACTCTGGGAAGCTGAGGTGGGTGGATCATTTGAGGTCAGGAGTTCAAGACCAGC 
CTGGCCAACATGGTGAAATTCTATCTCTATTAAAACTACAAAAATTATCCAGGCGTGGTAGCTTGCATCTGTAGTCCCAGCTAGGAGGACTGAGGCA 
GCAGAATCATTTGAACCCAGGAGGCAGAGGTTGCAGGTTGCAGTGAGACAAGATCACTCCAGCCTGGGTGATAGAGCGAGACTTCATCTCAAAAAAA 
AAAAAAAAAAAAGAGCAGCTGATTGCCACAGAGGGAGGGAGTGGATGCT<^CCGATCATTTTAC^TGTCCCAGGTGCCATTTCAAACA(^AGCAGCTA 
T CACT CTGCTGAAT AGG C AATTGTC TTT TTG AGA CAAGTGTT ATT ATACCC ATTTTGCAGTTGATG AAG CAGG TAC AGAGAGGTT AGATAATATGC T 
CAGTGTCACATAGTAAGAGGCAGAGCTGGGATTCACTGGGGTTGTGGTAGACCTGGGCTCTAACGCAACCACTCAGAGAGGATTTCCCAGTCAGACA 
TGTGAATCCCCATTCAGGCCTTATCTAGAAGCCAAGACAAAGGTGCATAAGAGGTGGGTCCTGCTGGGATGTGGGAAATGCCAGGCAGATGTCCCTG 
CTTAGCCCAAGGGCCCTCCCACGTCTGTGCCAGCTGTGCCTAGGCCCTTGCCAGTGAGGCAGCTGTGACCCCATTCCCAGCCCACTGGGCCTCCATC 
CATGTGATTAGGCTCCCAGGACTCAGGGCCCATAGCTGGCCACCCTGAGGTCCCAGCAATGACAGCTACTGGCCATACTGGTCTGTGGAGAAGGTGG 
TTAGTTGGCAGAGTTCTGGAAGGTGCAAACCAGGACTTCACTAGAAATTTCTTCACCATCTCCAAGGATATTATCAGGCTTGTTCTATTCTATCTCC 
CTAATGTGCCCCCACAAAAGTCTTCCCCGCAAGACCCCTGAGATGGAGAGAACTTCCACACCTCTGTTTTGTTTCGTTTTGTTTTGTTTTCTTGAGA 
CAGTTTAGCTCTTGTTGCCGAGGCTGGAGTGCAGTGGCACGGTCTTGGCTCACCGCAACCTCCGCCTCCTGGATTCAAGTGATTCTCCTGCCTCAGC 
CTCCCAAGTAGCTGGGATTACAGGCATGCGCCACCATGCCCAGCTAATTTTGTATTTTTAGTAGAGACAGGGTTTTGCCACATTGGCCAGGCTGGTT 
TCAAACCCCTGACCTCAGGTGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTATAGGCATGAGCCACCATGCCTGGCCTGGTTTTGTTTTTTA 
AGGAAACATTTACGTATTTGGTCCTCTGATGAATGACTGTCCCCAGCAGATGGCAAGTTCCTTGCAGGCCAAAACCTGGGCTGCTTTTGCTCATCAG 
TGCTCAGCTGCCAAGGGGCAGCrGCACACAGAGTTGCTCCATGAAGATGCGTTGAAGAAATGGACCAGGTGAAATTCACAGAGCACCTACTGTGTGC 
ATCAACCGGAGCTGGGTTATAAGGAAGAAGGCAGACTCCTTGAGGGTAGTCCCATTCCTCGTGGAGCCTTGTCCACAGCAGGCATGCAGTCAATGTG 
TGCGAT GAG AAAT AC AG ACACTTTC CGC C CT C AG AGCC CAGGC GG AAGGTGGGG GCCCATCGTGACTAAGGTTGGAGAC AGAAGAGAC CAAAGATGG 
CTCCAAGCCCCTCCTGGGTTGCAACCAGGGCTCCCTCCCCTTCCCTGCCTGCTTATGTCCCATACACAGGGGAGTGGAAGGAAAACTCAGGAGCTCT 
AGT TCC AGGTG ACTGAG AGAAAC CGTGGCAG CCACAC ACAGAGGT CTGGCT CTAACTC CT CAGGTAGG ATC ACAGCC ATGCT AGGAGCTGGC CTTGG 
GTTAGAGCCCCAAAAGCCTTGGGCAGGCCAGCCATGTCCAGGGATGGGGCTCAGGGTTATGAGGCAATGGCAGAATAAAGGGGCACAGAGGGAGGCT 
GAAGAGAAAGAAAGGGAAGGAGAAAGAGGTGTGGCACATGGCTGGAGAGGCAGCACGAGGGGCCACCAAGCCCCAGATGCCCCCATGTGGCAGTGAC 
TTCCCAACAGGCTGCCCACCCTTAATCTCATCCCACTGGAGGCAGCACAGGGTCCTATGAAAGGGACACAGGAACCAAGCGACAGAGCAGGGTAGCT 
GGGGTGGGTTGGAGTTGTAGGGGCCAGGGCCACTCAAGATAGGGGCACTGAGGACCCTGAGGGGCCACAGGGAGCCCAGACCTGAGTCAGTGGCCCT 
TCTATCTGAGACCTCGGCCCTGCCTGCTCCCTCTCCTCTGGGCCCATCATCAACAGTGTTTTCGAGGGGCTGCTAAGTGCTGGATCCCTCACCACTG 
GGGGTTCAGCAGTCTTGTGGAGGGAGTTTCTTTGTGGAGATTTCACAGTTGGAGACGGACAACATATGCAAGAAACTGAGCAGATATGTCTGCTATG 
TAGTATGTATAATCAGCCGGGCATGGGGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCGTACCACCTGAGGTCAGGAGTTCAAG 
ACCAGCCTGGCCAACATGGTGAAACACTGTCTCTACCAAAAATACAAAAATTAGCCAGGCATGGTGGCGAGCGCCTGTAATCTCAGCTACTTGGGAG 
GCTGAGACAAGAGAATCACTTGAACCCAGGAGGTGGATGCTGCAGTGAGCCAAGATCACACCACTGCATGCCAGCCTGAGCGAC^GAGCAAGACACC 
GTCTCAAAAAAATAATAATATGTGTAATCACATATTAATCACATCAGTGTCAGGGCTATGCTGGAAAATAAAGCTGAATAAGAGGCCTGACAGGACA 
GGGCTAGGGAGTGGCAAGGGGTCTGGAGGGGCCTCTCTGAAGTGGATATTTTGAAGTGGGTCTTGGAAGGGGACATCAGACAACATTATCCCGGATA 
AATCCTTCTGG AGAAGGC TCCTGGG CGCCTGAAGGGGGACTG CTTAGCAGACAC TGACTGGGGCAGGAAGAGAGACCTG AGAGAG AGG CTTACT CCT 
TGCCTC AAG AG AAG AAGGAAACC CAATATTT TGT ATT CACCATCCGCAGTCTGGCATTC CGT C AGC AC C ATTCTCATTT AACG CTGAC AG CTCCCTA 
CAAGGTGGAAAGTGTTATCTCCACTCCACAACCGAAGCC^GGGACTCAGAGATG^ 

TTCTAAAGGACTTTCACCGCAAATCGCTTTCCAAAATGCTCTGGGGAGGCCAGGCATGGTGGCTTGCACCTGTAATCCCAGCACTTTGGGAGACC^ 
GATGGGTGGGTCATTTGAGGTCAGGAGTTCAAGACCAGCTTGGCTAACATGGCGAAACCCTGTCTCTACCAAAAAAACACAAATTAGCTGGGTATGG 
TGGTGCATGCTTGTAATCCXTAGCTACTTGAGCGGCGGAGGCAGGAGAACC^ 

CTGTACTCCAGCCTGGGCGACACAGTGAGACTCTGTCTTAAAAATATAGGTCAGGAGTTCGAGACCAGCCTGGCCAACACGGTGAAACCCTGTCTCT 
ACTAAAAATACAAAAAATTAGTCAGGCGTGGTGGCGGGCACATGTAATCCCA^ 

CAG AGG TTG CAG TG AGTCGAGAC CGCGCCATTGC ACT A CAGCCTGGGC AAC AAG AATGAAAT TCC ACC TTTCT ATAT AT ATAATATAT AATAT ATT A 
TATATAAATATATATAATATATTATATATAAATATATAATATATAATATATAAATATATATAATATAATATATATAATATATAATATATAATATATG 
TGT TTAT TATAT AAT AC ATATTT AT TAT AAAT ACATATTTATTATAAATACATATTTATTATAAATACATA TT TATTATAAATAC ATATATATATAA 
ATACATAAAACTGAATGCCCTGGGGAGCTAGAGCTAAGACAGGAGGAACTACTGCCACAGGGAGAACCTCTCAGGTGCCCACAGCGGCCCCTGTGAT 
CTCGGGATCCAGGAAGAACTGGTGACATCCGCCTGCCGCTGGCAGCCTCCCATATGTCAGCGGAAGAGGGAGGCATCCGCAGGGTGGATCTGGCGGT 
GCCCCAGTGCGCCCTGGACCGGCATCACCTGGGCCCGCTCGCCCTCCGCCTGTCTGGAGAGGAGAGGGGAGCCTGAAGCCATGCACCCCTTATCCCC 
AGCTAAGCTCCGGGATTGCGAGGTAGATTAAGGGCATTCACTCAGGAGGGCACCTGCTTAACCAAGATCCTCCGAAATCCGGGCTGCTCACCAGCAA 
CAATTGAGGATGCCGGGCCCGCAGGGAGCGGAGTGGGACCAGGCGCGACCCCTTCCTCGCCTGCTGACCTCCCGCCTTGCTATCAGCACTGCCGCCA 



118 



GGGGGCGCGCCCGCGCCGCTGGGCTCTGAGCCGACCTCCGCGTGCACCTGCGCTCCTTTGTGTGTTAGATTAACATCTGGTGCTTACCTACTATGCG 
CTTACTCGGTGCAGGTCAGGGGCTTCCTGCATTTCCAGGATGCAGCCTCTATGCC^GCAAACTAGGCTGGGCAGTGGAAAGAAATCCGATTTACTCC 
TGCGCCCACC^CCCACGCTAGGGCCTTCCTGCTTCCAGGCTCTGCTGTCCA 

TCCTACAGCTCCTAGGAGAAAGTTAAACGCCCTCCTCTCCAGGTCAGAGGCCTTGGAGCCCCCTATGTGCTCCCTTTGCACTGTTCCTCCATAAGCC 
CTGTGGTGTTAGGCCTTCGGCCCTTCGCACTTCCCCACCTGCTTCCCCGCTGCAACGGCTTCGCCTCTCCCTTTACCCCCCCAGGTTGGTCCGGCTC 
CCGACTCCCGCCTGGCAGCCCCAAGACCTCTTTC 

GGTAGCCCCATGAGGGCGGGCGGGGGCCTGGCCACCGGATTCTTTGTCTACACGGTGCCTCCGCAGGGGGTCCAGTGGAGCCTCACTGAATACACAG 

CGGCAGACTCTGAGCCCCGGGGGAGTGCACATCAAAAGGAACCTCCGAGCCAGCTAGAGAGAGGGGTTGGG^ 

CTAAGGTCCCACACATTCACACACACAACAAAAAGTGCTTCCCACGTGCCACC^ 

TCCAAGAGACCACCCTTCCTTTCAGGGCGATGGCCTCCTGGTCCACCGGCTGATGCCTCCACCCCGCCCTCCCTTCCTGGTCTTGGAGTGAATTTAA 

CCC TTT C TC CACCAAGTGAAACC AC CCTGGG GCTTGGGGT AC CTGTTAG AG AGAGGGTGTTT CTAT CTGTGATGTGTGGTCAGGGG CGCCCGCCCTC 

CCCTGAGAGAAACCATTCCTGCGTTTCCACCCGCGGCAGCGCAAGGGGTGCAGGAGGAAGGAAATTAAACCTGTGTTGAAGGAAGGACACCTGGGGG 

AAAGGAAGGGACTCAGGCTCCCCTTGTCTGAGGAGCTTTCAAGTGGCGGAAATCCCCACACACACCTCCATTCACTAGGGGCCTGGCAGGGCCGCTG 

GTAGGTTCCAGTCCTCTCCTCTGTCACTTCCCAGGCGCACATGGCCAGCTCCTAGATTCCACCCTTCAACCCACTTGTTTCCTGCCTGATGCACATG 

ACCTGCGTCTGAGTTCTCCAGGCTGCCTGGAGGAGGCATTCAGGTACACATGACCCCTCAGGGAATGATGTTCTTGCCCACAGCACCCAGGATGCTG 

GCTGGGCATATCTCAGCCAACACACACTCCCACCGACAGCCTCGGATCTTTTGCTCCCACCTCACAGGGTTGGCACCTGTGGAGTCTTGAGGCCCAG 

TAATGACCCCAAGAAGCCATCAAGTCCAGGCCCTGTCGCCCCACCATACTGCATCCACAGTGGTGTCTGAGTGGCCAGGGAAGGTAGCCCTCCCCAG 

CTCTCAGCACCACTGTCATGGGGCTCTGGAACTCCCGAGCCCTTGGGCAGGACTCCAGCAGGCAGCTCTGATCAGAAAGCCACCTGGGAGCCCTGTG 

GGTGGGACAGGACTCCAGGTAGGACCTAGCAGGCCTCCATCCAAGAGGGAGGTATGTGGAAAGCTCTCCACTGCCAGAGCCTTGGTCCTGAGGCTCT 

GCATTCATGGCAATGGTAACCCTCTTTCCAGGTGAATCAGAAACAGGGCACCCCAGAATCAGCCCAGCCATGAGGAGCATCTCTAGGGATCCCCATC 

TCTGCAGCCCCCACCAGGTTCACCACTCCTGGTCAGGTTCACGATTTTTCTTCGCCTTGCAGCTCCTTGGTGCTCTCCGCCTCCCCCCTGCCCCTTT 

CTTCCTCACACCGGTGGCTCAGCCTCCCCCCGACAGTTTTCTTGGTTTCTGTCTCAGCGTCTCTCTCATCTCTGAGCCCCTCTCTCTCTCTCCTCCT 

CTCGGATTCTCCTTTTCCTCCCCTTCCTTCCTCTCTTTCAACCTCCCCTCCCACCCCCTCCAGTCCAGCAATGGGCTGGGAACGCAGCAGGAGCCAT 

GACAAGCCCAGGCGGCTCTCCCGACCCTTGGTGCCCCCGAGGCCATTTCCCCGCGCTCCCTGTGCCGGCAGCAGCCGCGTGCGGAGAGGGCTCGCCG 

ACCAGAAGGGGCAGCAGGTTGGAGCTGTGGGTACGGGGGGCTCAGATCAAAGGCCAGGAATGTCTGGCCCCCTGGCGTAAGACTTCCTTCAGCCATG 

GCCCTCACCTCGTGGAACGCCCGCCGTTCCAGGCCTCTGTGGGACTCGGGTCGGCTCATTCTCTGCCCTTTCTGCCACAGTTCCCTACACAGCGGTC 

CCTGCTCCCCACCGGCIAGTGCTTCCTTCACCCC^GACCGGGGCTGCGCAGAGTCCTGGTGCCTCAGGTGGCCTGGGAGGGGGCCTAGGGAAAGGCAC 

CCAGGTGACCGGGCGTGAGTCCCCAAAACCTGACCATGATGTCAATACCACCCAGGGCTGAGACCTAATTCCTCGCTCTGGAGAGGGAGAGGAGAGG 

AAGCAAGCCCGAAGGGGGCCTGGAAGGACTCCTTTCCTGTGGGGAGGGGTGGGTAACTGGAGGCATTTGGTGTCCGCACAGGGCAAGGGCCTTTATC 

GCCAAACCCCAGGCTGGAGCTCCCTATCAGTAGAACCGGGGCCTCAGGCGGGACGGTCGGGCAGTGAGGGGCGCTCCGGAGGGCAGCAAGGTCTTCC 

AAGGACCTGGGCCGCCCTCGGCCAGAGCTGCGCGTTTCACCGGTCCCTGATGGTCCAACCGAGAATCCCACTGAGGCTGTGGGTCACGCCTCCGACC 

CGGGCCAGAGGTCGGGCCTGAACCCCTCAAGGAGGAGAAAGTGACCCGACTCCGAGAGCAGGAAATGCGACTGGCCGCTGGTGTCAGTTGCAGGAAA 

TGCAAAGGCAGCAGGAGGTCCCGATACCGATCTGATCCCTGACAAAAGACTCAAATGACACTCGGCTCCGTCCCCCGGCGCCGCCCGCAGCCAGACC 

TTCGAGCCTCCGCGGACTTTTCCGCACGTTCCTCGGACACCCGGGAAGTGACCTCCCGCGGCCAGGACGAGAGGCCAGAGAGCGGGCGGACTCGGGA 

GGCTGGAAGCTGGGAGGGGGTGCGTTCCCCCACCCTCTGATCCTGACCTGCCCGAGCGGGGCTTCTGGAAACATCCCTGTGCTTTCTACCCTCGATT 

CCTCGCGGGCCCACGTCCCAAaSGGTCACGGTCCCCGCCAGCAAAGGAGCCACCCGTTCATGCCCACGCCAAGCCCTCGCCATGTGAACGCCCGGGT 

CAAAGCCCCCGCGCCCTAACCCTGGCCAGCGGGGGACCGCGGCGGTAGCAGAGGGTCCGGCCCACCGCGTCCCCCCGAAGCCCCGCGGCGTCCGGGG 

CGCGGGCTTTGCCCAAGTTTGCTGGGCGATGCCCTTCCCGGAGCCCGAGTTGTGGGCCGGGCGGGAGGGGCGCGTGATTGACAGGCTGAACTACAGA 

CTCATCTCTTACCTTAGGCCGCGGGCGCTGATTGGCTGCTCGCTGACATCCTCAAACCCGGCTGCTCCGCGCTGGGCTCGGGAGGGGGGCGGCTGCG 

GGTGGAGGTGCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATTGGGGGCGTCGCTCCCCCTCTTCATGGTCG 

CGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCCACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGC 

TACCCAATTTGGTGGTAAGACTCGCCTCTTGTCTGCCCGCGGCCCGGTTTCTCCGCCGCGTCCGGGGAAGGTGGGGGCGCCGGCTGGGGGTGGGCCG 

CTGCAGGGCTGGGGCTCCTCTCCTCGCCTCCCTCCAGCCCCCCACCCGCCCACCCCTCCTGGATTTTCTGCTGGTGTCGCCTTCAGAATCCAACTCC 

TGGCTCCTCTGATAATCACCATTTTCCCTGCTCCCTOCTGCAGGGGCTGGAGGACTTTTCCCATCTCAGAAGGTTTCTTATTCTTAGATCCTAACCC 

TCTGCCCGTCCCAATCTAGAGGCCCCCGAAGACCCCCCATTTGCTTGCAGTTGGGTTTCTGGAAATCCAGCACTGTCCCTGGGGCCCAGCCACTCCT 

CTTCTCCATTTTGGGGTCTTCATGGGCAAAACCATCCTGTTCCTTCACTGCCACCCCACCTCTGCCGGCGCCTCCATCAAGAGATTCCCCATAATAA 

CCCCCCTTTGCCTTCCTATTTCAGACCCCAAAGGCTTCAGGGCTCCTCCCGGGGCAGGGGAGTGGGTGGGGGGATTTCATCAGATTCTCCCCACCGC 

TCTAGATTCATCTTGAGCCTCTGAAGTGAATATGGTATTTGCTGATCCGGGATGCCATTGAATTCGGATTTTATTCATTTCTTGCTTGTCCACCCTG 

GCAAAAGGGGGGTTCCTTGAAAATGTGTTGTCTGGGAGGGGTAGGCTTGTTGTTCTTTGACCAAAAATCCAATTCTCCATGTCACATTTCTTCCTTT 

TGG AAA ACCGAGG ACCGAGGAGAAAG GGAGAGGGAAGGGAGAATTC CGT C CCTG AGAAATGTGGAT ACAAAAGAC AT TTT AGT AAATAAGA CTGGGA 

AGCCGGGAGCAGTGCTTGGGGGGGGCTGTGCGTGTCTGGGGGGCCTGGGTGAGGAATAAACATCGCTTTGGGAGCCCCAGCTCCAGGAGTCCCCTTG 

TTTTCCGAGCGGCCCCATCGCCGCTCTCCCACCCTCCGCAGGTCCCAGGGGCCAGAGGGGAAGACACTGAAACCAGTTCTCTAAATTACACTCCGCC 

TCTCCCCAGTCTCCAGCCCCCTACATTCCGGAAAACTTTAATTAAAGAATCTTCCCCTCCTGGACATAGGAGAGCCTTGGCCTGGGCGGGGGGAGGT 

GAGGAGGAAGAATTTGGGATCTGTGAGTGAGAAGTGGGCTGTGGCCCCCGGGGTGAAAGTCAGAAAGAAGTGCCACCACGGTGCGGTACCGCCTGCA 

GGTCCCCAGACTCCCTGCCTTTCGCCAACCCAGAGTCCAGGGTAGGAGGAGCAGAGACAGCGAGACTTGGGGGCCACACCCCGAGTCTTCCAGTTGT 

T TGGGG ATGGCAGCATTTGTGTCTGGGTG TTTGGGGCTCCTGGGAGAGAGGGTG GGGACATGGCC AAGGGTCT CTGTTT AGAGGGTGCTAGGAGTTG 

CTCCCAGGACAGCCAGCCAAGGGGAAGTCTTTCTTCTGCCCAACCCACACTTTTCCGGGGGGCTGGGCTAACCTGAGGCTTCCAGCACATAAAGAAG 

T CC ATGG GGTGGC CAGGG CAGC CAGGG AC TTC AGG AAGGAGGTTCGGCC AGT CTGGGGAAAGGCTG CTGCCAACACCAGTCCAGCCTCACCCGG CCA 

TTCCTGACCCTCTTCCCATTCCCGGGCCAAGGCCATGCCGTTTTTCGCCTTGCTTTCGCAGAGACTCTGGTGGGTTGGGGGAGGGTAGGAAGGAAAG 

GTTTGGCGG CC CAT AGGTGG CAGGGACTGAGGAAGAG AGACC AGTCTC TCCGGC CCTATGAACTAGAAGAGGGCACT CTTCCAGTT GGTGTAGAT TC 

TGGTGTGAAATGGTATCTTGGGGACCATCTAGATAGGGATTGCCCAATAGGAAGAAACGCTGGGGAAACCTGGAAGGTCAAGACATTGGCCCrAACA 

TAATGCTAGACACATGGCTTGTTTGGGAATTTGGTGGGGGACAGGAATGCrGCTTTGTCTAAGCCAAGGAGAGAGGTGTGTGTGTGTGTGTG 

GTG TGTGTGTGTGT GTGTGTGTG CGTGTGTT CTGGGT CTATG ACCCTAAGAGGTGTTTT CTAT AC ACC AGTGTGTCT CTGTGTGC AAAAG TGTTTGT 

GCCTGCAACTGGGCTACTATGGATGTGAAATCGTGTGGGCGTGTTGGGGGATCCATGCCCACGACAGGTTGTGCTGTGTGTGTCTGCACACGAGTGT 

GAGCAAGTGGCTGGTGCTGTGATGTGTGTATGTACATGTGTAAGAGACAGAGAGAAGTCTAGAAGTATCTTTGCCCATGTAAAGGTCAGAGCCTGTG 

TCTGCCTGTGACGCTGGGCTTGACTGTGTTCATGACTGGGCATGTGAGTGTGTGTGTGCGTGCGTGCGTGTGTGTGAGTGTGTGTGTGCGTGCGTGT 

GTGTGTGTGTGTGTGTACGTGCGCGCGCATGCCTTCATGAC^GTGTGGGGAAAGCCCC^CAGATGTCTTTGTGCATGAATATCAGTGCTCATGGCCA 

TGA CCTGAT TGT GCTTGTAAGGAGGAGAGTG AGCCTC AAG TCCAT AGC TGTGTGAGGGCTGTGTCTGAGT CCGCCTGTGAT CC TGT CCCC AGGTG CT 

GGGCCCAGGGCTGTGTGGCTGTGAATATCTTTGCAGACATCTACCTGTGGGTATGAATGACACCTGAGCTTGGGTGTTTGGTGGGCAGCCATGGGCC 

T GG AGGG AAGGT CC AAC AGG AAG CAAAAGTC TCAC AAGGT GG CTGC AGGGGGCAC AG CTGGC C AGG GATGGCAAGCT AGAGAGGC AGGGG CAC AGG C 

TTCCTTGACACCAGCTTGTCCTGTCTGTCCAGGCACTAGAGTCACTGGCCCTTCTCTCTGAGGTTGTTTTTCTTTGGTCTTGGAGCTGGGGTGGAGG 
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GTGAGTCACCTTCTCAGGGTTAGGGCTGGGGCTGTAGGGTAGATGGGCCTCTGATTCCTGATCCCATTTCTGCCCCCACTCCCCAACCCAGCCCCCC 
GCCTGGCAGTCAGATTCCAGTACTTTGAGCATCTAAGCCCATCTTGGCCTGCCCTCTTTGGCCCACCCAGCTCATGCATTCCCCAGGAAGGATTTAC 
AGGGGAAGTCATGT AGCTGGG ATTTCTGTCCACGCGG ACC CTCAAGCATGACAC C CTGAGTC CAC ACT GCT CAGAC CCC AGT CCTGACAG AAGGTTG 
ACCAGAAGGGACAGCTCTAAGGGGCCTCAGACTGACACCCCTGGAAGGAAGAGATGTTCTCCAGGTAGTGACAGAGCCGGGACAGGAAACAACAGCA 
CTGATTTCTCGAATTCCCCTTGTCAGTCCCAGTTAGCTCCTGGTGGAGAAACTTG 

CCTATGTTATTAGGGCCTAGCCCAGCACCTGCTACATTCCAGCGATGGACGTGCAGATGGAGGGGGCCTGCTGGAGATCAGGAGCTCACCCTGCCGC 
TGTGTGACTGAGGACCAGTGGCCACCCATCTGGGTCTGCTTCCTTACACTTO 

TGTTTTGCAGAGGACAAGGGCAGTGAGCTGGG AAATAGCACACCTG AGTT CTCGC CAT CC C AGGACCCATCCTCC CACTCGAG CAC CTGCC TAAGAC 
CTACCTCAGTTTCCCCATCTGTCAGATGTCCTGCTGCCTCTCCCACCCACTCCCACCTCCACCCACCTTTCAGAGGCTTGTGAACATGTTCAGTCAA 
ACCCCGTGGGGGTCAGTGGGCTGGGCCCAGGGAGGCACCTGGAGACTGCGGGGGAGGAAGGCCCCAGGCCAGGTGTTAACACTTGTAACAGGCAGGA 
ATTACAGCTCGCTGGGGCTGGGCTGGCTGGGGCTGGTGAGGAGCTGAGCTAGCTGGAGGCTGTTGATCCCAC^ 

GGCAGATACCAGATCGGATGGCCAGGCTTTAACCGGGTGGTCTCCCTTC<^GGGCAGACCTTTCCCTGGGGGGGATATGGCGGGGAAATTAGTCCAG 

GTCTCTTGGAAGTTAACAGCCCTCCCGCTCACCCCGCCCCCATTATGGGTGGGGCrTCGGAGTCCAGAGGGAGCAAAATTGGTGGCCGAT 

TATTCTTCCGATAGGATCACATTTTGCTTAAAAAGAAGAAAACAGGTCAAACGTTATAATAATACCAAGAGCCTGAGCTGAGTTATGTCCTTAGAAG 

AACCTGCTGTAATATGTATGTGGATGGCATGGGGGAAGGGGACGCTCTCAGGGAAAGTTCTAGACTCTCAGAGCATTTTGTCAACAGCTCCGGGGCC 

AAGAGGTGAGAGATGGGTGGAGTCTGAGTATCACCCCCACCTCCACCCCAGGGATCCTGAGGGCTTGGGTCTGTGATGTCTGCCCTGCAATTAATCC 

TGGCCTGGTGGGAAGACCTTTGGCGATGAGGTCAAAAGAGGCAGGAGCTGGACTTTGGTGCCAAGTCCTTTAGGAGGTGCTTGCTGCAGCTTCACCA 

GTGG CA AACCC AGC CAGG C CCTACCAGGGCCCACACTCTGGCATCTGGATA CATGCT CAGTC ACT GAC AAAGGTGGAGAAACT GAG G CAGAC AGCGA 

GGGCTCTGGGTGGAAATCTCCAAAGAATGCACTCAGCCTAGCAAGCATCTCTCATCCTCCTCACTACAGAACTTAATGCTAATAAACATTAATGGTA 

CTTGGGCAGATGTAGAATTGGAAGAAGTCTCTCCTTGGTCTGGGCAGGAGACAAGGAGCCAGGGGATAAGGAGCCAGGGGATGGCACAGGAAAGGTG 

ACTTATCACAGAGGAGGATGCAGCTAACAAACCCTTCCAGGGAAAAGGAAGTGGGAAAAAGGAAGATTTTTCTGGTCTCAGAATGTGATGAGCCAGG 

AGTTGAGC&GAACCTTTTCCACCTACCCTGATGTGAGCCACAGAAACCTTCTCC^ 

C TC AGGGAGGAGGTATGCCTAGCCCCACCCCCACCCCCCG CCAAAGGTTAG AGC CAT CCTGATATTAC ATCTG T AC ATC C ATGTT ATCTTGTATGC T 
GGAAAATGGTGAGAGTGGGGAGATGGAGAGACCTATGTCTGAATTCCAAGTTTAATACCAATGGACCCTGTGACCTAGGATGAGTCCCTTCCCTTCT 
CTGGGCTTCATTCAAGTCCCTGCCTCCTTCTGGGCTTCATTTTCCTTAACCACATGGAGGGGTCCCTTTTGCCTCTCAGAGTCCCAGGGCTATGACA 
GGTTCTGGGCTCGTTGCTGTGAAGCATGACACTCTACTGCACACATTCCGGGCCTGTATTAGACAGTCCACCCGTTTCTTTCTGTGGCTCCATCTGA 
CCTAAATCGCAAAGAGCTGGCTGCCCCACTCCTGCCAAGCCAACCTGCTTGCACTTCCAAACCTCGCCCC^AACACTCCCACTCTTCTGGAGCCAGG 
CCATCTGCATTCACATCTGGCTTTTTCATGTACTACCTGGGTAACTAGGGCAAGTTAACCTCTGCAAGCCTCAGTTTCCTCATCTGTAAAATGAGTG 
TAATGATTGCCTCTATTTC^TAGGGTTTTGAAAATGATTGAGTGAATACTTCTTAGAATGATACCTTGCATACAGGAAGCATATCGATGTTAGCTAT 
TTTCACCTCCTATCTGCAAGCTATTGGCTCACTGAAGCCCTGGAGCCAAACTTGCCTCTTCCAAGAAGCCCTCCCAGATTTGCCCCACCTGGCTCTG 
TTTTCCCCTTTTCTTCTTTTTGCCTGTTTGCCCCACAGTGCTTTAGCTTTATTCTTAAGTTGAGGGTGCTGGGCCAGGCTAATCTCACTTCAGCTCT 
TTACCTGCTCAGGAAGGGGAGGGCAGGGTAGAGAAGGTTGCTGGAGTGTCCCCTAATCATTCCCCGACCCCCAGAGTCTGCCAGACTCTTGCTGGGA 
CCTAGAGCCAGGGGGCTGGGCACAGAAGGGGTGGGGGAGGCAGGCTTCCCAGGGGGAGTGGCACCAGGGCCGCCCAGAGCAGCCTGAGCAGAAGTTG 
CTTCTCTAGGAGCTGAGAGATGATTCAGAAGGTGATTTATAGTCGGAAGGCAGAAAGCTTGGCTCCTGGCAGAAGCTGCTGCTTAATTACATTCTTG 
GGGCGGGGGTAGGGGGGTGGAGGTATGTGTGGGGAGGCAGCAGGGGCCAGACTTCTGCCTCCTCCCCATCCAGGCCCTCCCTGTCCCTTTGGCTGTC 
CCGGCTCTAAGCAGGGTCCTTCTTTATCCCGCTGCCAGAGAGGGAATTCCCTGCTCTCAGTGAGAGGCTAGGCCAGGATTGCAGGAATAGCTTCTGG 
GAGTGGAGAAGGTGGGAAAAGAGGAAGGAAGCTGGATCCCATCTCCCGGGACTTCTCCAAGGGAGACTGGATGGCGACGGGCTGGAAAACGGGCCTT 
CTGGGAAGGAGCACTGGGTCTCCTCCCTGCTTCrGGGCAGCGGAATAGCTAAGGCACCTGGGCAGAGTTGAGTTTCTTTCTTTCTTTCTTTTTTTCT 
TGAAACAGAGCCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAACTTCTGCCTCCCGAGTTCAAGCAATTCTCCTGC 
CTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCGCACCAGGATGCCTGGCTAATATTTGAATTTTTTAGTAGAGATGGGGGTTTCACCATGTTGGCC 
AGGCTGGTCTCGAACTCCCGACCTCAGGTAATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTATAGGCGTGAGC<^CCGCACCCAGCCAGAGTT 
TCTAGAATCTTCTGAGAGGCAACTCCACGCTCTCCTTCGGGCACCCTGTCCTGCTGCTGCCTGGGAGGGCCTGCCTGCAGGTTTGCTCCTTTTCTGT 
TGCTCAGGTCTCAGTGGGCCATGGGGGCTGGAGAGAAGGTCCTGCGCTCAGAGGCTGAAACCCGGTTCTTCAGCGTTAGGCAGCCTGTCCCTCGTGA 
GCACAGTGGCTACCCCAGGTGCACAACGGTTCTGCCAGCAGGCACGGCCCCATACCCTCATGTCAAACTAATATCTCAGAGAGTCACCCCAATGCCG 
CTGGGTGCAGCTACAATCACTTCCGCGCCATTGACAGGAGCCCCCATTCTGTTTCCATACCTGCTGAGTTGACATCAGCATTTCCCATATTAACACT 
GACCCATGCAACAACCCGCCCGATGACACCTAAGGCCGCTTAGTTCCCTGCATCGCAGCCGTGAATCCCAAATCCCTCACTGCTAGTTCTTAAGGCT 
GGAAAGGTGTAGGGCGGGCAGGGGTAGAGTTGCTGAAGCCAGTAGTGTCGATTTATCTGTTTGCAGCCTGGCCTGTACAGCCGGGAGGAGTTGGCAG 
GGGTCAGGATGGGGACACTAGACAGCTCTCTGGTTCCCCCTGAGGCGGGCCTTCTCAAAATGGCCTCATTATGTGTGTGGAGTGAATTCCAGAAATG 
TCAGGCACACCCATACATCTAGGGAATGAAGGAGATGCCAGAGCGGGCTCTGGGGCGAGAGGGGCCAGTGTGAGGAGGTGGGCGGCTTCCGGGCAGG 
GAGT TAGGGCA CCT G AGACCTGG CT CTGCCACCATCTTGATAT T AGGC AACTTC CTG AACTGTTCTGAGTCT CAGTAACTAGGAG AGTTGGATAAGC 
TGCTGGCCAAAGACTTCCAGCTCCGAGATTGCAGTCATGGAGGCTGTATTCCGCGTTCAGGTCACCCAGCTATTTGGACCATCTGTGCCTGGTGGAG 
GGGGTGGTCACTGAAAGCTGGTCTCAGGGAAGGGTGGGGGGTGCAGAGCTCTAAACTCTACACACAAAGAGCTGTGAGACAGTGGATGAGTCTCCTC 
CCTCCCTCAGTTTTCTCATCTGCAAATTGGATGATGATAATAACACGACCACCTGCCCTGCCTGCTTGCAGGGCCGTGAAGAGCTTCAGGTGGAATA 
ATCTTTGTGGGCATGCCCCGTGACCAGTTGGCCTCAGACCTGAGGCCTCTGTGCCCTCTGCCAGGGGCTCTCCATTGAGCATCAGGACCCCTACAGG 
AAGGAGCTGAGGACTCCTTCCTGAGGACACTTGGGAAGTGTCAGGTTCTTTCCAGAGAAGTTAACTATCACTGTGAGTCCTAGAAAAGAGGAATAAG 
AAGGCTGGGCATGGGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTCAGGAGTTTGAGGCCTGCTTGGCCA 
ACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCTGGGCGTGGT^ 

GAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATTGCACCACTGCCTTCCAGCCTGGGCAACAAGACCAACACTCTGTCTCAAAAAA 
AAAAAAAAAAAAAAGAAAGAAAGAAAAGAGGAGAGGAATAAGAGCTAGTCCCTGCTCAGACAGTGCTAAGAATGGACCTAAGAGCCCCCCTGAGCCT 
CTGAGCTTGAGGAGTTGATATTTCCAGCAGAGGAAGAAGTGGGGCTGGACAGGGGAGTCCTCAGCTCCTTCCTGTAGGGGTCCTGATGCTCACTGGA 
GAGCCCCTGGGAGAGGGCACAGAGGCCTCAGGTCTGAAGCCAACTGGTCACGGGGCATACCCAGGTTCACAGCTCAGGGCATGGGGCTTGGTTATCC 
TCCTGGCCTAATGCAAAAACCACTGGCCAGACCCAGAGTGAGATTTCCAACCAGGGACACATGGGAAGATGTAGGTGGAGGTGAGGGGGGCGAGGGC 
AAAGGGCTGCCAGCTGGAAGGGGCTCTGCTGAGGCCTGTAGTCATCAGGGAATCCTAATGGCCTGGGTAAGGCCTAGCAAACTAGGAGACAGGCACA 
AGCCATTCCCTCCAGCTCGGCTGGAAAAACTGCGAGGACTGAGTGTGTGTACGTGTGTGTTGGGGAGGGGGATGGGAGCAACAGGACCAAGCACCCA 
GGTCTGTCTGGAAGACCTTTCTGGGAAACCTGCCCCTGGGCACAGAGTCCCACTTATCTGCCAGAGCCAGCTGGGCCGGCAAGGAAAACCAATCTCG 
CAGGGCCCATTAGGAATGCJAAATAGGGCTTGTGCTGTGAGTGAGCAGGATCCCGCCTCCTCTCCAGCTCCCACCAATCATCTTCACAAGGTGGATTC 
CTGAGGGTGCCACTCCCCTGCTCTGACACCTCCCATGGCTCCC^ 

AGTTGTAACTTTCACAAACCCCCTCGATCCCCCCAATAACCCACTT<2AACTGGAATACTCACCTCTTTGCTTGTTCATTGAAATACTTCCACCTCCT 
CTCTCTGACTTTAAAAGGCAGCCCAACCTGGGGCCTGTTGACCTGGGGTCAGTCTGCTCTGCGCTGCTCAGCTCGGACCCCACTTGGCTCAGGTCAT 
GTAGCTAACAGCCCTGAACTATGGAGGGTCCTTAGAGGGTCTCTCAAGGTGGATGGGCCCCTCGACCTGAGCAGGCTGCTGTCCCAGGAGTGGAGAT 
GAGCCTTTTCTCCACAGTGTCAATTCCCCTGGGTTTCTTTTTCTTTTTTTTTAATCCCAGTTCCACTTGTAAACGTTTCCTATTTGTGTACTATTTC 
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CCCCTCCATGTTCaCCAGTTCACACGATGCTTAAGTGTCTAAGTCATTAATTTTAAGATTGCAATTGGCAGAAGTTGTTTCTCTATCTAGGAACAGG 

T TAATACGTTTGTTG ACC ATGTG CT ATGTGCTGATCC C CATG CAGACAT CATGATTT AT CTG AAT CTAATTCT TGCCAGTTT ATAGAT AGGGAGTCC 

CACAGTGCAGCCTGGGCATGGCATGGGAAGCCCTGAGCCCACACCCTGTTGGACCTAAAGCCTrTTTCCTTCTACCACCATTCAGGCTCAGTTGTGG 

ACTTGTTATTGGGGTAAAAATTTGCTTGTTCATATTCTACTTTGTTCCCCAAAGGATGGTTTTGAGACAAGTTTACACTTGACCTTCCACTCAAAGG 

CTGATGCTTTGGGCTTATAGATACTATTTGTGAACGTCTGGGAAGTCTCTTATTTGGGGATGTGTCCTCTTCACTGGCCACCTTTCCACAAGCCATC 

AGGCTGTCCTCAAGCTGGGATTGCACAGTTGCTGACAGCAGTGGCCATGTGAGGATTrTGTTGCCCCTGGGATGGGTAAGGGGCGACCTTGATCAAA 

TCACTGCCTCCCGGGCCTCAGTTTCCTCATCTGTGAAACAGAAAGGGTAGCGTAAATGACCTCTCAGATTCCTTCCAGCACTAAGCTCTTTCCTTCT 

ACTTAGTCCTTTTTGCTGCTTTGCATCTGGTCTCCATCATATACGTGCTAGAAAGCAGCTGGGCAGAATCTTCTTTCTTGCCCT^ 

TGTTGGTATCTCTGAAACAGCCAATTGTTC^TTTCACTTTTACAATTC^ 

TGATAAAGCATGCTTCCCTTTATGTCCTGGCTATGAGAAAACTCAGAGGTCCACCTGAAGCCACATCCTTGCACCCGTGTGGACCTTCCCTGATCCT 
GCTCTTCCGTTTTTATTTTGTTATTGAATCTGATGTGTTTTCCGTCATCAGCCTCTTGAGGGC^GACATGTTGGAGTCTTTGTACCTGGCCTGGGCA 
T AATGGATC CT C AATAAAGA CTTGT CTTTGG GATACAC ATTTGCTGTCAACCTCTTC AAATCCTTTCTGGAAAGAAG CAAGGCACATG CG AATT AAA 

TATTTATTTACTTAGAGATGGAGTTTTGCTCTTGTTGCCCAGGCTGGAGTGCAGTAGCATGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCA 
AGC AATTCT CCT GC CTC AGC CTC CC AAGT ACCTGGGATTATAGGCATGCGCCACCACACCCAG CTAATTTTGTATTTTCAGT AGAGATAGGATTTCA 
CCATGTTGGTCAGGCTGGTATCAAACTCCTGACATCAAGTGATCCACCT 

CGGCCTTCTTTCTGTTTCTTTACTGTGAGCTTTTCCCCAAGTCTACAAAGGTCTTGTCTTCAGAGAAGTCGCCTCTTTTTTTTCCTCCATGATCTTC 
TCCACTCTAAAGCTGAGGTTGGTTCCTAATCTTCCTCTAGGGACTGTCTCTCGTTCCTGGCCAATGACTATGTCCACCCCTACCCCTGGCTCCCTTC 
ACAACTGGTTCCTCCC^CCCTCCCTTGCCTTGGAAGCCGACGGTG 

CTCCAACAGAGCTGCTCCCGTCCCCAGCAGCCATCGTGAGGGCCAAGCCCCACTCTGCAGATGGAGCCCCAAGGAGGAGCAGAGGGGAGCGGCGGCC 

AGGGATCTCCCCATTCACCCAGCAAGGTGTCCTCACAACTCTAACCGGCTCTCCCCTCCCCTCCGCTTCTGCCCTTGGGCAACACATGTGCAGGTCA 

CCCCACCCAGTCCCTGACTTCGGAGGAGAGAATGTCTAGCCTCAGGCTGGGTGGTTCTCCATCTTAACTGAAGCTTCCATAGGGGATTCTGGAGCCT 

TTCTCTGCTACCAGTAAGGATGGAGTCTGTGACCTCTGCACTGTCCTGGAAGCTAAAGCAGCCTCGAGAAAGCAGCCTCTTCATCCCCATGTTACCT 

GTGAGAAAACTGAGGCTCAGCTGGGCGCGGTGGCTCACGCCTGTAACCCCGGCACTTTGGGGGACAAAGGCAGGCAGATCACCTGAGGTTGGGAGTT 

TGAGGCCAGCCTGACCAACATGGATAAACCCCAACrCTACTAAAAATACAAAATTAGCCGGGCGTGGTGGCACGTGCCTGTAATCCCAGCTACTGGG 

GAGGCTGAGAAAGGAGAATCGCTTCAACCGAGGGCTGAAGTTGCGGTGAGCTGAGACCTGGCCATTGCACTCCAGCCTGGGCAATAAAAGCGAAACT 

CTGTCTCAAAAAAAAAAAAAAAAAAAAGGAAAAGAAAAAGAAAAAAGAAAAAAAGAAAACCGAGGATCACAGGACCCAACCAGGAGCTGTAAGGAGC 

TCGGAAGCAAGTGGGCGGCTCCAGTGGCAGCTAGTAATTCCTCGCTTATGATGTCCCGGGTATCACATACATTGAGGAAACAAAGCACACACATTAG 

CTCATCCTCCCGACAACCCCAGGAGGCAGAGACTCTTGTTATCCCCATTCTACAGATGAGAAACTGAGGCCCTGAGGCACCATTCCATTGCCGAAGG 

CTCCATGGTTGGTAAACCCCAGAGCTCTAAACAAGAGACAAACTGCCTGAAGCCAGGCCCTTCAGTGACACCACACCACACCACAGCCAGGAAGTTC 

CTCCTTAGATCTGACTTCAGTCTTTCCTTAATCCAGGCCCAGGTTTTCTTCCTGGGGATGTGAATATCACTGTCAAGTTTCAGTTTGGTTTGGTTTG 

AGACAGGATCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGTACAATCACAGCCCACTGTAGCCTCAATCTCCTGGGGTGAAGCTATCCTCCTGGCT 

TAGATTGCCAAATCACTAAGACTACAGGCAAGCACCATATGTGGCTAATTTTTTTAATTTTTATTTTTATTTATTTTTGAGACAGAGTTTTCACTCT 

TGTCACCCAGGCTGCAGTGCAATTGCAAGATCTTGGCTCACTGCAACCTCCACCTCCCCGGTTCAAGCGATTCTCCTGCCTCAGCCTCTCACGTAGC 

TGGGATTATAGGCGCCCGCCACCACACCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGATTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTG 

ACCTCAGGTGACCCGCCCACCTCAACCTCCCAAAATGCTGGGATTACAGGCGTGAGCCA.CTGTGCCCAGCCTTTTTTCTTTTTTTTTGTAGAGA 

CATCTCCCTACATTACTCTTGCTTGAACTGGACTCAAACAATCCTCCTGCCTCAGCCTCTCAAAGTGCTGGGATTCCAGGGGTAGGGTTCCAGTTTG 

CACTTGAGGACCAGGGTTCGTCTCATCCCTGAATCTTCCTGTGTCTGGAGTTGGTTCCTTCCAGTGGGTTCGTGGTCTCACTGACTTCAAGAATGAA 

G CC ACGTAC CT TTGCGGT GAGTGTT ACAGGT CTTAAGG AC CC CAAGAGTGAGCAACAGC AAG AGTT AT CCTGAAGAG CAAAAG AAC AAAGTTTCCAC 

AGCATGGAAGGCGACCCAAGGGGGTTGGTGCTGCTGGCTTGGGGGTGGGGGGGTGGGOSGGGAGGC^GCTTTTATTCCCTCATTTGTCCCAGCCCAT 

GTCCTGCTGATTGGTCCATTTTACAGAGTGCTGATTGGTCCATTTTACAAACCTCTAGCTAGCTACAGAGCACTGATTGGTGCATTCTTACAGAGCA 

CTGATTGGTGCATTTTACAAACCTCTTGTAAGACAGAAAAGTTCTCCAAGTCCCCAGTTGACCCAGGAAGTCCGGCTGGCCTCATCTCTCATTCCTT 

CTCTAGAACAATTCCTCCCAGGTGACCCATCCTCTACTCACCTTTAACTACTTGTTCTTCCTCTTCTGATCTCTTACCAGCTTTTCTAC^CTTCCT 

GCAAGTCCTATAGCCACAGAATTGCCTTTAAATCAGGGAAAGAGGGGGCCTGCCATCAGCTCTCTGATTTAGAGGCTTCCGTCTGGTGTCTTACAGC 

ACTTCCCTTCCCCATAGGGCAGGGAGTCTGGAATCCAGCATGCCTGCCCTCAAGAGGCTTATGACGGAAAACACATCAGATTAAATAACAAAATAAA 

AACAGGCTAGTTGTGGTGGCTTGCGCCTGTAGTCCCAGCACTTTGGGAGGCCGAGGCAGGCAGATCACTTGAGGTCAGGAGTTCGAACCAGCCTGGC 

CAACATGCTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGCACGCACCTGTAGTCCCAGCTGCTGGGGAGGCTGAGGCAGG 

AGAATCGCTTGAACCCAGGAGGTGGTGGTTGCAGTGAGCCAAGATCACACCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTCTGTCTCAAAAAA 

TAAAAATAAATAACAAAAACAGAAGAGCAGGATCAAGGTCAAGGAGAAGACAAATATGCAAGTCAAGGTCAATGCaGGTGCAAGGATGGAGCTTGCC 

CTGAGTTTCCTCACAGCCAGGACATAAGGGAAAACACGCTTTACCTGAAAGGAGAAGGCATGCTGGGTCCTCAGGGAGAAATGCCCGCCTACAGCCT 

GTGCCCACCCTCTTCAGCTGCGCCATGAGAATTTAGGCTCCTGGAGTATCTGGCCCAGACAGCCGTGAGCCTGCTTCCGGGCTCTGCAGGCCTCTGC 

CTGGGGCCCATTCTGCCAGGGCAGGCGACACTGCCTGTGGCCAAGGGCTGGTCIAAGGGAAAT 

TGGCTGGGGCCTCCAGCATGGGATGCAGAAGGGATGGGCAGGCTGCAGGCCTGGGGCTAGATCTCCCATGTCACCATTGTCTAGCGTGAACTTTATC 
AGG TAA C CTAG CCC TCT AGGTGG CAAGT AAGGTTTTTGCC AAGAT AGG AGGATAGAATG ATTGTTTAACTTTTTTTTTT CTTTCTT GAGACAT AGT C 
TTGCACTGTTGCCCAGGTTGGATTGCAGTGACACGATCTCACTATAACCTCTGCCTCCCAGGTTCAAGCGATTCTCCTCCCTGAGCCTTGCGAGTAG 
CTGGGATCATAGGTGTGCACCGCCACACCTGGCTAAGTTTTGTATTTTTAG™ 

GAGCTCAGGTGATCTGCCTGTCTGGGCCTCCGAAAGTGCTGGGATTACAGGTGTGAGCCGCTACGCCTGGCCAAGTGATTAACTTTTTAATGAAAGT 
ACAGGAAAAGACGTAAACTGTCTTTCCTCTGCTTTCACCCTGCAATAATTGGTACAGAAGACTTCTGTGACCAAATGCGTGCATACACC 
TGTCAATTCTGCGGCAGACATCAGCCAGCGTCCTCTAATCCAATTCAATTCTGATGATCTCCACCTGGACATAGCATCT 
CTTAGTTCCC^GGACAGCCTCCCACTTCCAGTAGC^ATCCCAAGCCCCAGGCTC 

TGTGACTTCCTCCTTGGGTTCCATTAATTTGCTAGAGTGGCTCGCAGAACACAGGAACACACGTTTACTGGTTTATTATAAAGGATATTACAAGGAT 
ACAG ATG AAGAGGT GCAT AGGGCGAGGT ATG GGAGAAGCTGCACTG AGCTTCCACGCCCTTCCCAGGCACACCA CCTTC CAGGAACCT CC ACGTGTT 
CCGCTATCCGGAAGCftCTTGAACCCTGTCCTTTTGGGCTTGTATGGGGGCTTC^ 

ACTTAATCTTCAGCCCCCTCTCCCCTTCCCAAAGGATGGGGTTGGGGTTAGGGTTGGGAGTCCAACCCTCTAATCTTGGCCGGGCACGGTGGCTCAC 
ACC TGT AAT CC C AGC ACT TTGGG AGGCCGAGGCAGGTGGATCATCTGAGGTCAGGAG TTCGAGACCAGCCTGACCAACATGGTGAAAC CC CAT CTCT 
ACT AAAAAT AC AAAAAA ATT AGCCG AGC ATGGTGGCAGGTGC CTGT AATCCC AGTTACTGGGG AGGCTG AGGCAGGAG AAT CG CTTGAACC CAG GAG 
GCGGAGGTTGCAGTGAGCTGAGATCACAAC^CTGCACTCCAGCCAGGGAGACAGAGTCAAACTCCGTCAAAATAAATAAATAAAAGAAAAGAAAGTC 
CAACCCTTTAATCCTGCCTGGGTCTTTTGGGGTAACCAACCTCAATCTTGAAGCTGCCTAG^ 

AAAAGA CAT CA CTT TGG AGTTT CTAAGG ATTTT AGG AGTTGT ACGCTGGGAAA CTAGGT CGAAGAC CAAATAT ATATTTTAT AAT ATC ACAAAC ATG 
C GT ACAG GC GAG CAC AC AAC TTGTC AGTGTA C AGC TT AATGAGCTGGATAG AAT ATCTT ATGTTT AAC AGTTTGTGAAC CTGGCATGAAACTTTTCA 
ATACTGGGCACATGGTAGATGAGGCTTCGTTGAGTGGTCTCAATGCTACAGCCAGGCCTTAGGGCTTGGCGGGGAAACAGTTCCACTAGGGTCGAAC 
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TGGCACAGCATCGGGAAGAAGGCTTGCCTCCTCTAATGTGGCTGCATCATCTCCATCCCTCCCTTGGCAGACCCACAGTGGCACTGCCCTCCCTCAC 
AGCTGTGGTCACTTACCTCCAAGTCCCTTTTTGTTAAACTTGGACCATGTTTTAGCCTTTCTCAGCCTCAACGGATCTTCTTTTCTGATCAGTTTAG 
TCAGTCTTATTTTAATTTAATTTAATTTAATTCAATTCAATTCAATTTGTTTTTCAAGATA 

CGCAATCACAGCTCACTGCAGCCTCCGCCTCCC^GGTTC^GCTATTCTCCTGCCCCAGCCTCCCAAGTTGCTGGGATTACAGGCGCCTGCCACCAT 
GCTCGGCTAATTTTTCTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCT^ 

CCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCACTTTGGTCATTTTTAAACGGTCTAATTAGGAGTGCAGCAGCAGCAGATAT 
GACTGTGTTGTTGTATACTCCTCAGCCTGGCCCCTGCCGCCCTGCACCCGCTCCTCTCCCAACCCCAACATGGGTAAGCACATGGGCCTGGGTACAG 
CAGTCCTGGACTCTAGATGGTTCTGCCTTTTTATCTGGGTGGACTTCACCTCTCCAAGCCTTGGTTTTCTCTCTGCAAAATGGGGATAACGAGGGCT 
ATTGTGAGGATGAAAGGATATAACGCATGTGAGACTAGCACAGGGTCAGGTGTGTACTCAGTGAGTTAGACACACATTTTGCTGACATTAATTTCAT 
GTTACAGTCTGCGTTGTTCTCGTGCACACCTCACTTCTCCGTGCCTCTGTGCTACCCCCTGCCTTCTCTTGGGATGACTAO^CTCCTGTTCGGTC 
TTGAAATCATAGTTCAGCCATCACTTCCTCTGC 

AGA CTGT AAGT C AC CTGTGGG CAGG ACC AGTTCT C CAT CATCTTTCAACCC ATACCACCAAACAT AGCACAGGGCACACAGT GGATCC TCAAAGAC A 
GGTTGTGGGATGAAAGGACACTGTTGAGAGTCACCTGCTGGTCCCAGACTCTTGTCTCCCTGCTCCAGCTGGATTCTCATTGCCTCTGTGATGCTCT 
GCCAGGAAGACCAGCTGGTACTTGTTGTGTATTTTCCTGTATTCTCAGATACCTCTTCATGTGCATCTGTCTTGTCTTCCCAGTGTAAGAGTCAGCC 
CCTGAGGGCAGGGATTCTGCTCTGTAGGCCTCTGCATCAC^ 

T CAGGC CAG GC ATGGTGGCTCAC AT CTGTAAT CCC AG CAC TTTGGGAGGCC GAGGTGGGCGGAT CACC TGAGGTCAGGAGTT CTAGAC C AGCCTGAC 
CAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGTGGGCIAGCTGTAATGCCAGCTACTCGGGAGGCTAAGGCAGG 
AGAATCACTTGAACCTGGGAGGTGGAGGCTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGACACAGTGAGACTCCATCTCCAATAA 
ATAAATAAATAAATAGGAATTCTGTTAGCACTTGGCCCAGGAATCAGACrCCTT 

GGGACAGCACTCCCCTCACTCTGTGATCTCCCAGGAAGTCCGGCTCTGGGCATCTAACCTTGGTGTGGGGAGGCAACATAGCACTCCAGAAAGAGGG 
TAGGCTCTGTGACAGAACC^CCTGTGGTGCAAATCCCAGATGTCCACTTCATAGCTTCTAGAACTTTGCTTTGGGTGGTTGCTrAAAAATCTGAGTC 
T CAGTC TCT TC C TCTGT ATAATGGG AATAAAAAAT ATGAT CTC AAGGT AGT TGTGAGGTTGAAAT GAAATAACAGAC AGAAA CTGGCTAGG CG CAGT 
GGCTCATGCCTGTAATCCCTACIATTTTGGGAGGCCAAGGCGGACAGATCATTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACTC 
jU, TGTCTCTACTAAAAATACAAAAAGTAGCCAGGCATGGTGGCGCACACCTGTAATCCCAGCTACTCAGGAGGCTGAGATGGGAGAATCGCTTGAACCT 
GGGAGGCGGAGGTTGCAGTTAGCTGAGATGACACCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTCCATCTCGAAAAAAAAAAAAAAAAAGTAG 

Q AAATAATAGAAACTGAGTTGAAGGTGAATCATTAAAGATCTTCTACAGGTCATCTA^ 

TCTTTAGAGCAGAAAGATCTGAATTCCTTCCAAAACAGAAAAGGCTGGCTTCAGGGAATCCCACACACTTTATGTTGGTTTGTCCCAACCAATAAGC 
CTATCTGAAGAAAGAAGCGGGTGGGGAGGC^TGTGGCAGAAGTCTGTGACTTGGTGGACAAAAGGCAATGCAGACTCTGGCATGACCTTGCGTTCTG 

■& GTATCCTGTTCAGCCAGAGTACTCACAAAGGCCTCCAGTTCTCAGGAAGGGAAGGGTGAATTAAATGTGGTTAGCTATCAGGGATTTTTTTTTTTTT 

1-4, TTTTTTTTGAGACGGAGTCTTGCTCTGTCACCAGGCTGGAGTACAGTGGTGTGATCTTGGCTCACTGTAACCTCCGCCTCACAGGCTCAAGTGATTC 
TCCTGCCTCAGCCTCCTAAGTAGCTGGGACTACAGGCACGCACCACCATGCCTGGCCAATTTTTGTATTTTTAGTAGAGATGGGGTTTCCCCATGTT 
AGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCACCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCCAGCCTATC 

hi AGGGATTTTTATGGGGGTTCCAAGGAGTGGTCTGTGTAGATTTAATATCACCCTAGTTAATGGCTGAGGC^CTGGCTGTGTAAGCCCTTGCTAAGCA 
CAGTGAGTGAGTGTGGCTGGAAGTGCAGGAGGCCTGGGTGGGATGCAGGCTGATCCAGGGCTGGCAGTCACTGTGTGCATCCAACCTTTAGGCGAGA 
GAGGGC TGC T AAACT CTG AG AAGG AATTC AGGGTG AAGAAG AG CTG ATG TG CCAATGGTA TGGCTGGCACAAGC CTAAGCAGATACCAGCT ATTCT C 
CTGCAAGCTGATCAAAAATATTTTTATTACATAATACTTGACATATACAAGAGAACAATCATATATATTAGACATGATGACAGAATGAAAACCTGAG 

^ CCTCACTATGTCCCTGAAGGACCAGACCGTGACTAGGCCGTCGTGTCCATCTGTGGGTCCTCCTCAGCCCATTCCCCATCTCCCTCTAGATGTGACC 

W ACATCCTGAATTTTACCTATTTATTCCATTACTTAAAAATACTGGTTTTTGGCTGGGCACAGTGGCTCATGCCTGTGATCCCAGCACTTCGGGAGGC 

flj CGAGGCGGGAGGATCACCTGAGGTCAGGGGTTCGAGACCAGCCTGGCCAACATGGTGAAGCCCCATCTCTA 

ifs TGTTAGCACACGCCTGTGGTCCTAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAATCCAGGAGGCAGACGTTGCAGAGAGCCGGGATCACGC 
]fi CACTGCACTCCAGCCTGGAGACCCTGTCTCAAAAAAACAAAAAACCAACAAACIAAAAAACTGGTTTTCCCAGGTAATAATGC 

Q TATTTTGCCTATTTTTAGCATTATAAAAATGGTGTTATACTGCATGTGGCCCTCTATGTTGAGCTATTCTCTCTCTGCATTGTGTTTCTAAGTTTCA 
|.„X AACTGTTCCTGGATCTTGCTGTAGTTCTCACATGCTCACAGTTGCATAATAGTCCATCTTTAGAATATACAGTTTATCTATTTTGTCTCCTGGCCAT 
GGATGTGCATCTCGCTGTGCACAGCAGGGAAAGGTGTTCATGGAGGTACCTGCCTAGGCGTGGAACTTTGGGTTGCAGGATGTGTGAATGTTGAGCT 
TTACAAGATGATGCCAAACTGTTTTCCAAAGAGGTTGTGCCCATCTGTGCTGTCATCTTTGAGATTGTTTTTCCCATGGCCCAAGGCCCTTCTTCCC 
GCCCGAGGCAAGCCCACTCCTGAGGACTCCACTGCCTTGGGCCTTCACCGTCTTTCTCTCAGGATCCCATTCCCTACTGGAAAACTCTTTCCTTTCC 
CTCCTCCACTTGGCCCTGAACTTCCCAGGATTTTTCAAAACCCAGCCTGGAGCTGCCGTCTTCCTGGGAGTCCTCTCAGCCTCACTTCCACTCTCTC 
CCAGGTTTTTGGAACAGCTTTTCCAGACAGCTCTTGGGCCAGAGTAGGACATCGCATAGTGTGTTTCTTATAACATCAGCCCCACATAATCCTGTTG 
AGA AAGGCTTC TGG AGTCAAATAAA CTTAAGAATCGCCAC AGG CAACT TGG AGC CTC TTGGT GTT TAT TGTTCT CATTTG AATATT AG CAG CT CATT 
AGCATAGTTGCAGCACTGAGAGGTCCTGCCAGTGTGACCCGTTTTTATTTGTTTAACTTGTTTTC(^AAAAATATTTAACCTTGGAGCCCTTTTCCCA 
TTTTCCCCCATTTGGAAAATGGGATCGAGAAGATTCTCCTGCCTATTTCAGTCAGTTTGGTTTTGTGGTCTTATTTTCCTCGGTAACCAAGCTGAGA 
GTCCTCTGACTTCTGTTGTAATCATCCCGTGTAAACTAGTTCAGTCCAGACCACCCTTTCTGGAGGGCTTGAGTGGAAGGAATGAATGAATGCACTC 
AGGAATTAATGAATGAAGCAGTCAGCCAATCAGTCAGCCATCCAGCTGAGCCCCCTCTCCTATGCAGC^ 

GCTCATCTTGCACCAGGGGACACCCGGGCTGTCTGGCTGGTGAATCATTCTCCCGTCTTAGGTCCCCTGTAGTTGCATTTTTAGGATCTGTCACCCA 
ATG AGG CTGGGAGC CCCTGCGGGGC AGG AACC AAGTTGCGCTTGCCTCTCT GTG TGC CC CGAGTGGGACAT AC AGATGGGATCTCTCATCCAACTCC 
TTCTGAGAGGGGCAGGAGAACATTGTGGTTAAGTCTTCAGCCTCTGGAGCCAGACCAGATGTGTCCCTAATTACCTGGGTAATGTGGGCCAATCTCT 
GCCTCAGTTTCCCCACTGAGATAGGGGTGATGATAACATACCTTCCTTACAGAATTATTATGGGGTTAATATGCGGCAAGCACACAAACAGTGCCTG 
GCATAGAGGATGTCACTGGTCTTTATTAGCACTATGACAATGGCAAC 

ATCTACATCATTTCAATTCTTCCTCCCCGCAGCCTCTTTGAAGTGGGGGATGTTTTTCTCATTAGGAAGTTGAGTCCTCCCAAGGAGACGTGGTAAT 
GGGTGGTAGCAATAAGATCAGGATTGGGGTCAGCC^GATCACAAAGCCCTGGCCTGCAGGCGTGGAGGGTGCCACCTACCACCTGCGGATGGGACAG 
CTCCTCTGTTTCTTTTTACTGATAATTACCCTGCCGACAACCTGATCAGTTCTTTTTTTTTTAATCCTCTATTCATCTCAACAAAATGGAAATCCAT 
GGTACTGCAAGGGTTAGAATTTAACATGTGTGCACACGTGTGTGTGCGTGCATGCATGTGTGTGTGTGTGAGAGAGAGAGGGAGAGACAGAGACAGA 
GAGAGACAG AG AGC AAG AGTGCACAAGT GAAGGAGAAGGAGTTAT CTGAGAAGC CTGGGGGTTACTAGCACCTT CATGAC AG ACC AAGGAAGAC AG A 
GATCCCCGGGTGGGGCTGGCTTTGAGACTGCACAGCGGCAGAATCGCTTGTGAGTCAGCAGTG^ 

GGTGTCGTCACCCACTGCCCCTTGCACAGGGCCCTGTGTGCCGGGAAATGTTTGCAGGCTGGACACCTGTGGTTGCTTTTAACCAGGCAAGTGAGCT 
GGGTTAGAGGTAGCCACATGTTCCTGAGGGCTCGCAGCTTTGGATCTGAGAGGGTTAGACAGGGCGTCAGGGGCTGAGGGTCACTGGAGGAGAATTT 
GCTGTCGTCCCTTTGCCCACGGGCCGGCrGGTCTTCTCAGACCCCAGAGGCCAGCTCCTCTGTTCTTCCCCACCTCGGAGGGGAGCCCCAGAAGTCA 
CAGGGTCAGCTGTCTGCTTTCCACACTGCCTAGGGCTTTATTCTTGCACCATCCCCAGTTCTGGCCACACCAGGGGAGTGAGACATGGGGCGCCGGT 
AGAGGGGCTGAATGCGCTGGTGTGGGGTAGGAGGTTTCGGATGCACATTCCAGAGGGTGTTTATGGGGGAGCAGACAAGCCTGGGGATTCATTCCTG 
GTAGGGAACAATTCCCTCAGCACCCGAGGGTGGGAAATGAGGGCCAAAATGGTGGCTCAGAGGGACACAGGTGAGGTAAGCAGGTAAGAATTCCCAC 
CAGGTGTCTCAGAAATGTCCTGGGAAGGGCCTGGCATGGTGGCTCACACCTGTAATCCCAGTATTTTGGGAGGCCGAGGCGGGCAGATCACCTGTGG 
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TCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAAGCATGGTGGTGGGTGCCTATAATTCC 
AGCTACTCGGAAGGCTGAGGCACGAGAATCACTTGAACCTGGGAGGTGCAGGTTGCAGTGAGCTGGAATCGTACCTCTGCACTCCAGCCTGAGTGAC 
AGAGCAAGAATCTG TTAAAAAAAAAAAAAAAAAAGAAAAAGAAAGAAAGAAAGA AAAGAAAGT C CTGGGAAG CTGGTGGAAGGAGGTC GAG ATCTTC 
CTGACCTGGGAAAGGAGGGAATTCCCAGGCATCTATAGACCAATAGGACTGTCGAAAGTTTGACTGGTGTGAGAATGGGTTAAGGTGCTGAATTTTA 
GAT TTTT AT CCTCT CCACGG TGGTAC AGTGTGTTCTG C AAAC AAAGGAG AG CCAGCCTTGAAG AG CCCTTCTCAGAGCTGACTATGTT CT TTGGCT C 
CTCAGCCCAGTGAGATGGCCTTAGCTCTGAGCCCTGTGCAATATGCAAACTGGACCAGATGACCCCCAAGGCTCCTTCTAGCTCTGACAGGTCCTGT 
CGGGGGATCCAGCCTGCATATAGGGCCTGGGCTTCTTTCAGCCCCATAGATGGTGACCCCTCTTCACCGCTAGTAAGATTGAAGCTCATCAGCAGGA 
CCTCAGGCCCTGGCCTCCTTTCTTCTTGGTCCCTTGGGTTTTGGAGTTTTATGTGACATCAATGTCCCAGAAATGCCCATCCTCTTCCTT TTCTCTC 
AGGGCACAGTCCCCTGAGGTCTTGTGGTTTATCCACCCCCACCCCTGTTCTATGTCCTCCTTCACTTGCTCTCCCCCACGCCCCTCCCACCAGAGAG 
TTTATAAGCTGGCTGTCCCCTTGCTAACCTCCTGCTGTTCCCTATGAACATGCCTGTTAGATTTCACCGTGTGACACTGTCTTGGTTATGTCCCCGT 
CGTGCCTAGTTCAGTGAAGAACACATAGCAGGTGCCGAATAAATGC^ 

GGCAAAATAGCGATAGCATGGGATTTGGGGTCCAACTCCACCTCCACTTCCTGGCCGATGGCAATCTGTCATTGGTAAACTGGAGCCGGCATCTCTA 
CTCTCAGGGCTGTTGAGAGGATTCATGGAGAAGCTGTGTGAGGAAATGCTTTGCAGGTTGGAAGGTGCTGGGAACGTGTGTCATCTACCCTGGCCTC 
TCTCTCAGGAACTCTTGCCCACATTTGTCAGCATCTGCTGGGTGTGTTGT^^ 

CCCTGAAATTCTTTGTCCTGTCTCACACACAAATGCCTCCAGCCCTCAGGTCGGGGGTCTCCTCTCCTCCTGCAATGCCTTGGGCGTGGGTCCAGCC 
TTGTCTGAGACACAGAGAAATGGACCTCCAGTGCGGAAGATCTGCACCAACATTCGAAAGAAAAAAGACTGATAGGCGGGAGCCCAGCATTTGAATT 
TACAGGGATGTTAACCCCAACAGCGGCCGGAGCAGCAGAGCAGGACCTGAGCACTTCCTTCCCTTTCTCCTTAGCCTTCCATGAACATGCTCTCTAT 

GGCTCGATCTTGGCTC^CTGCAGCCTTGACCTCCTGGGCTC^AGCAATCCTCCCAACTCAGCCTCTCAAGTAGCTGGGACTACAGGTGGTCACCACC 

CCCGCCTCGGCCTTTCAAAGTGCCGGGTTTACAGGTGTGAGCCACCTCCCTCAGCCCTCACTGCTTTTTTTGCCATCTCAGTGCCACCATTTCCTAG 

CTATGTGACTTTAGACGAGTCACTTGGCCTCTCTGAGCCCATGTCATCATCCGTTAAATTGGGTTCTTTGGCAAATTAAAGGATGTATTCTATGTAA 

AGTGCTTTACATGGCAACTGGCACATAGTC^GGCCTCAATAAATGTTAACTATTATTATTAATAATGGCCACTGGCCAGGCGTGATGGCT 

GTAGTCCCAGCGCTTTGGGAGGCAGAGGTGGGCAGATCACGTGGTCAGTAGTTCGAGGCTAGCCTGGTCAACATAGTGAAACTCTGTCTCTACTAAA 

AATACAAAAATTAGCCAGGTGTGGTGGCATGCACCTATAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCAGAGGT 

TGCAGTAAACCAACACGGCACCATTGCACTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAAATAATAAAATGGCCACTCTGAGTGCCTCA 

GTGAATAGGCTCCTTACTACCTTACTTAGAGCAGAGTTGAGGAAATTGGTGAA 

CCCCACCTTATCTTATTCATTATGGGTAATCACCTGTAAAAGCGTTCTTTATTTCATTCACAGCCTGAGTGTCCCCAAAAAGGTGCTCCATGACGTC 
ATCCCCCCTCTGGAGGACACCTCCTCCCACCCATGGTGATTTGAGCCTTCrCTTAGGCCAGGGTGATTCCTGGGGGAACCTTAACTGGGGCTAATGT 
GCACTGACATTTTGCCTTTTGTGTCGAACACACATCCCATTTGCAGAGATTTAGAATTTTTTGACTTTAAATGTTCGCTATTTCTATGTATGTGTAA 
GAGAGAGTTGTTTTCTTTTGTTTTGTTAAGAGACAGGATCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGTGCAATCACAGCTTACTACAGCCTCA 
AACTCCTGGGCTCAAGCGATCCTCCTGCCTCAGTCTCCCCAGTAGCTGGGACTACAAGTGTGCACCACCACACCCAGCTAATTTTTCTGTGTTTTGT 
AGAGACAGCGTCTCGCTGTGTTGCCCAGGCTGATCTCAAATTCCTGGCCTCAGGTGATCCCCCTGCCTCGAAATTCTGAAGTGCTGTGATTATAGGC 
ATGCGCCACCATGCCAGAGAGAGAGATTTTATCTTTTCTACACAACGTGTGTTTCCCCCAGAAGAAATGCCTTCCTACTTCTGTGATGAATGTAAGT 
AGAATGCAGACAGAAATTACAGAAACTCCCTGTGCCCCAGCACCCTTACCAGGGCATGCCTGGCCTGGTGGAGTGACAGGGTGTTGTACACAGTGAG 
GGGAAGGAAGGGAGTGGGAGAGGAGGTGAGGCTCTGCAGAGCAGAGTGGACCATTTCTGGTCAGAAAACAGTGAGGAGGCCGGGCGTGGTGGCTCAC 
GTCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGTTGAAACCCCGTCTCT 
ACTAAAAATACAAAAAAATTTAGCCAGGTATGGTGGTCCJTCGTCTGTAATC^ 

GCAGTGAGCTGAGATCACGCCACTGCACTACAGCCTGGGTGACAGGGCGAGACTCTGTCTCAGAAAAAAAAAAAAGAGTAAGGAAAGCGAAGrCCTG 
GGTTGGGGGGAAATGTAGCTGCAGKSCAAGAACCAAGGAAGGTGTGGA^ 

GAG CAT C CT GTACCTTTG CAG AC AG AAC TTC C ACCTACTC AC AAAC AG ATAC AC C CC ACTGCACAGCT AAGAAAGCTAAGGGG AAACC AGGAAG AAG 
GGAAAAGAAAAGAGGAAAACCCCAGAGGAAACGAGGAGTCAGGCCTGGGCAGGGGGTTCCAGATGACGCATGAGGCTGCAGGAGGAAAAGGTTTCTT 
TTGAAACACATCTTTTATCCTGTCCTTTTCCGATGTACGTGAGCGTGTGTTTTGGAACGCACATAAAACGCGAGCGCAGTAGCCGGCCTCTGATTTC 
TGCCAAGGTGGACGGAAACGGAGGGGCGTGCCCAAATGCTTCTTCTTCATGGCTGCTGGATCAGGGAAGTTTTGCTTCTGTGGCCGTGAACATGGAG 
ATTTGCTGAGAGGCCAAGGAGAGAGATGATGGGGGAGGGCTACTTCGGGAGAGGAAGTGGACAAGACCAGAGGTTGGAAAACAGGGGACGGGTTGAT 
CCT AAC AACTTGGAGGG ACTGGT ACTGG GAATAGG AGGTGGG GCCCAGGGGGAGGGC AG AGGACC CAGGAGCCC CAGCTGTGG AC C CAGGGTGGGG C 
TGTGGGCAGGTCCCCAGCCCTGTCTGTGACCTAGGGCAGTGCTTCTCAACCTTGAGGGAGGGAGGGAGGATACCACAGAGGGGAGGGAGCACCTAGA 
CCACTTAAATCCCTTTCTCTGCAGGGGGCCCAGGCAGCCTTGTTTCTAAGCTCAGCAGGGCCACTTACTATGCAGCCAGGGATGAGAACCAATGGCC 
CACCCTGCTGAGTGCTGGAATGTGTAATGAAGTCCTGCGTGGGTCTGTAATGAACCCAGATTCGGCTCCTAACAGGTGCTGAGTGCATGTTTGCAAA 
CCCAGG CGC CTGTTGCAAGGAGAGG ACAG AG CT AAAG C AAGG AGC C TG CGG AGG AAT GGGTGG CAGGG ACTGATGAAGTTGGGGG AAGGAGGTGCAG 
AGGAAAATGTTGAGGCTCGCTCTGGACTGCCTCGCCTGGGGATGGAGAATGGGGCTGCAAGGGAGGGGCAGCCAGCGCTTTGAACTTGGGTGGGGCC 
TCCCTAAATAGAGTGGGGGAAGTCCAGGTGCTGGGCAGGGCTGTCTTCACCTGCCCATCCTCAGCGCCGTGGGGGGATGAGGCGCAAAGAAGCCCTC 
CTGGGAATGTGGTGGAAATACCCACCTTGAAAGGTAGAGAGTGGAATCGTTGCCCTGAGCACCTGGGTGTGGTCGCAGCATCCCCCGTGAGGCCTCC 
CAACCTGCAGCTGCCTCCTGCGTGCGCATCTGCCACTGCTGGCTTTAGGGCCAGGGCCAGGCCCA.GCTGGAGGTCCAAGCTGGAGCCCTGGGCAGTG 
TCCAGTGCACTGTGGCCTCAAAGTGCCCAGCCATGGTTTAGTACTCCAGCT 

TCCCCATGGTGTCATTTGGTTTGTTCCTCTGTCCCTTGTGTTCCCTGATAACAAGAGGTTACCTCTCAAAGCTTGGTTAGATCAGATTAAACGTCTT 
TGGC CAAAGAACAT CAT AGG G AAGGT TGGGG CCTT CCTGCTG CAT CAT CTCATA CCATT CTACTG TCAC ACTTCTGGTGGAGGCTT TT AATGAT CCA 
GTTTCCTGGACTCCTCTCCAGAGGCTCTGATCCTGCCACGCTGGTGTTGGAGCCTGGGTGTCTGTATTTTTCAAAACCACCCCAAGTGATCTGCCGT 
AGTG TATAGCC TG CATCCCTGACCCCAGTAGCCAG CTGGTCAGGGACTGTCCT TGGTTTTCTGAAG AGC ATAGAAT AAGAG AGTGG ATGGGACAGTC 
CAGGTAGGTGCCAGCCCTGGTGGCCAGTGCTATCAGGTGTGAGGCAGGGAGGGTGGCCTCTGGGCCCCCATAAAGAGGAGAATGGGCTTCCTCAGGG 
CAGCTCCAGGTTCAGGTAAGAGTCCTTGCAAGACCAGGAGTGCCCCACGCACTGTGCTCCOSGCGTCCAGCATAAGGCCTGGCCCAGCGTGAATGTT 
TATTGAATAAAGACACACACCAGGCTGGTGGCTCTCTGAGGAGCCCTGGGGAGCTGCTCTAAGAGTGAGCTCCCAGCCAAGCAGCACAGCTTTGTGT 
GTTCCCCACTGGGCTGCTTCTAAGATTCAAACAAAGGACTCCCAAACTGTGGAGCTAGCAGCTGTGTTAAAAGGCCCAGAGAGCCAGCAGGGCCATG 
GCCATCACCTGCCCTC^GCTCGCCGTGGAGACAGCCTGTGGGACCTGGGAGAGACGAAGGAAAGGACTGGGCACTCAGTGGGGCTGCTGAGGGCAGT 
GAAAGGAGGCAGGGAGGTCCTCGCAGGGAGGTCCTCGCAGGGCCACTGCACCTGCTGAAGCAGGTGCTAGATTCTACATGGAAACGAGTGAAAGAAT 
CCCTCCTCTCAAAAAACTCACCATCAATGGAGTCGGTGGACAATTCACTTGAGTGCATTGAAGGTTACCTGTGACGGATGTATAAGAGCTGGGGAAA 
TGGCCAGAGGGAACAATTTGAATTGGAGAC^TTAAAAGGGGATTTAAACATGGGACCTTGAGGGATGAATAGAAGTCTGGAGTCAGAAGCGAACAAG 
AGGG CTGGGCAGGCTATG TCTTGTCTTTTAC C AGAACTGGCTT GAAGGGTATAGCAATGTGGGGAC CC AGC AAAAT C CTGGTGTGGGAAC CAGC AG A 
CTTGGCTGCTCAGAACGGCAGCCTAACTCTGTAAGGTTCTCATTTCCCAATCTGTAAAATGGGAGACTGACTTAGGGTTTGGAGGGGTCAGGAAGGG 
G AG AAATGT T ACAAGGT TAAAATGAACT AAAG ATGCAAACATGGC TTAGGAAAAATTTGAATT AG AATTCC ACTTAGTGGAG TGGGAGGC AAAG AAA 
CAGATGGCCAAGCTCCAGGAGGAAAAGCTGGCCAAGCTGAGCTCAACTTCAAGGCCCAGCTCGAATGTCACCTCTTTGAAGGGGACTTCTTTGCCTT 
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CTCCCACTTCCTTCCCCTCAGCACACCTATTC 

CACTGAGGAGGGTGGAGGCTACATCTTTTTTTTrTTTTTTTTTCTGAGGOVGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAATCTCG 
GCTCACTGAAACCTCTGCCTCCCAGGTTCAAGCGATTCTCCTGTCT^ 

TTTTTTCTGTTTTTAGTAGAGACAGGCGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACTTCAGTTGATCTGCCTGCCTTGGCCTCCCAA 
AGTGCCGGGAATACAGGCGTGAGCCACCGCGCCCGGCCGAGGCTACATCTCATACATCTTGGGCACATAGTATGCAGGTACTGAATGAGGCCCAGCC 
TTCAGATGATCAGGTCTGGGGCACTTCAAGAACGATCTGCG 

GCACTGCCTCCAACCAGTGGAGGAACAGCAGGGAAGGCACATCGCTTCCCTGGGCCTTAATTTCCCCAAGGAGCTGAACTAGGTGATGTCAAAGGTC 
C CG CTTAGCTTTGACATTTTATGACCTAACACTGTAATAGAATCA CTAACATGCGTGGC C ATTTACAG AATGCTCCTAAAGCC CTTCC AG G AC ACTC 
CTATTGTTCCCCCGCCCCCGCCCTACTCCAATAGCAAGAAAACTTTCACAGGTGGAGACTTTTCTTCAACAGGTCTTTATTGGCTGGGCACGCTGGC 
T CACAC CTGTAATC CCAGCACTTGGGGAGGCTGAGGTGGG AGGATTGC TTGGGC CCAGGAGTTCGAAA C CCAGC CTGGG C AAT ATAGCAGGACTTC A 
T CT CTACAAAAAATTTT AAAAACTT AGC AGAG CATAGTGG CATGTGCC TGT AGT CTCAG CTACTC AGG AGGCTG AGGTGGAAGGACTGGTTGAGC CA 
AGTAGTTCAAGGCTGCAGTAAGCTGTGATTGTGCCACTGCACTGCAGCCTGTGTGACAGAGTGAGACCCTGTCTCAAAAAAAAAAAAAAGAAAAAAG 
AAAAGAAAAATGGCCTTTACTGAACATCTATCTTTGTACTGATGAGCACCCCCAGAGA 

TTC AGCATC CC CTG AGTTGCC AAGACTAGAC ATTT AGGGGGAGCTGTATGAC AATC ACC C AAAACTGATGGGC AGAAATGCTGTG ACCCCAAGTAGG 
CCTCTGGGCCTTGA TGACATCCTCAGTAACAGAGAGCACGGCACCCCTTTGTGTGTCTGAAGGCAGAGGGCTGGGACCCTACCTGAGCACGCACAGG 
GCATCTGGGCTTTACCTGGGAGTCCTGAACTGCGCAGCAAAGTGGAGAAACGGACCCAGCTCTCAGC^TGTGTCCTTGGGAGCTGCTCCCCTCTGTG 
GTCACCAGGCAGCACTTGCCCACCTGTGGGCTTGGTGCCTCTCaGGGAGACGCTCTGGGGTCTGAAGAGTGAGGTCCCTGCCTTTTACCCGTCTCTG 
GACACCCCTCTGGGCCTAGGCCCTTCCCTGTGAATCCCCTGCCTGAAGGAAGAGACCCACAGCCCTGGGGTTCCTCTGGGGTGCCAAGTGGGGGACT 
GTGTTTGGTCCTGGGGAGGATGGGCTGGGGCTGCTGAGCAGGCCAGGGGTGCAGGACCCCTCCATCCCCCGCTTCTCCTGTCCTCCTCCTTCTTCCA 
CCCTTGACTCCTCTCACCTCTTGTCCCCTCTTCCCTCCTGTTTCCACTCTCTTTCCTCCTCCCCAGGTCCCACCAGACTGCTCCATGCCCCGGACCA 
CAATTTGGCTGAAGGCAACAATCCCTCTCTCGC^CC 

GGGTAAGAAAGAAGGAAAAAGCACTTCATGCTCACAACACAACAAAAAACAGGTTTA 

ACATCTTATCAGATCAGCCAGAGCCGCAGGCAGGCGCTCCAGGAGCTGCTTTGGAGAACAGAAGCTGCAACATTCCAGCCCAGGCACTGCCCCGCCC 
TCCACACCCCTTCCCTCCCTTCCCTTTTCTCCCCAAAGAAACATCCCACAGGGCTCCCTGGGATGGAGCACAGGGCCCGCAGCCTCTTCCCTC^ 
TGAGGTATAGTCTGTGGCCAAACCCTTGGCTTTGAGCTTCTGCCCGCACCCTTTCCTTCTCCCCACGGCTGCCCAGAACCATGGTCCTCCCGGGCTC 
TCAGGTGGTCCCTGGCAGCTGAGCTGTGCCACCTCCTCGGAAGGCCCCTC^ 

CTAAAGTCGTCCACAGCATGTAGAAGTGGTCATTTGGCCAGCAGGGCTAGTGAGTTTCCGGGGCTCCTGGGGCCTGTCCTCGGGGGCTCCTGTCGGA 
GGGGAAGGAATAGCTTTTTCCTTGGTTATCTTTTCTCCGTGTGCATGGGTTTTTCTCCTATGGCGTGGATCACTGCCTGGGATATCCCCTTCCACTC 
AGTCACCCCCAGACCTCACCTTTCTGTTCCTGCTGCACTCACGGTGACATTAGTGACACAATGCAGGGTGACAGGTCCAGGAAGGGCTGGCCAAAGC 
C CAAAG GCT CAGAGTTAAGC CGAGG C AA CGTG AGGGAC AG ACC C AG AAC CTGGATCC CAGG AC AACAGGGAAGAG CTGGCTTTGTAGC TTG TAACAG 
GAG CATGGT TAGGG C TGGGAGGT CC CACTGTGTGGTCAGAGAGGGGTC C CC AAGGGAAGGCAGCTGTG C ACTG AAG GCTTAGGATG CATCATAAGAA 
AAGCAAAAAGGGGAGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCACGAGGTCAGGAGATCGAGAC 
CATCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAAATACAAAAAAATTAG 

GGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGTGGAGCTTGCAGTGACCCGAGATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTC 
C AT CTC AAA AA AAAAAA AAAGAAAAGCAAAAAGGGGAAAGCGC AGAGG AGG TAGGGGCACAGTGGCTCCC TCTGTGTAGGTATGT TGGCTGTGC AT C 
TCGTGGGGACCTCGTCCGGAATATGGAAAACACAAGCCAGACCTGGGCGCCTGCTGCCAGTGTTGGCTGGGCAGGGGCTGAGGTTTCGGGAGAGCCC 
ACCTTTCCAGCCTCCTAGTGTGAGGTTCAGGCCCTGAAGCTGGGCTGGCTGCCCCTTGGCCCCCAGCCCCCTCCCTGGTGCCTCCACCCCTTGGCAT 
ATGGCATCAATGCTCCCACCGTGACGCTCAGTGTCCTCCCACCGTGTGCCCACAGCCAAGATGGGGCCTAAAGTATAATTCAACTTTTAGCTCCTTA 
CAAGCATCCCCAGATGGATTTCTAGCTTCCGTCCCTCTCTTGGGCCCAGCTCTGTCTCTACGTGGTGCCTGGAAGTCCCTCTGGCCCCACGCACTTC 
CTCGTCTTGTTCCCAAAGCTGCTTTGCTTCTTACCTTCCCATCTTGGTGGACACCATCTCCATCTTCCTGGTCATCCTGGCTCAAAACTTTGGCCAT 
ACTTTCCTCCTTCCTCCTGTTTCTCATGGGACCAAACGGCTACCAGTCCTTATTCACAGCTTCCTGTCTGTTCCTGAAGCCACTGTCCTGAGATGAC 
CAGCGGGCCCATCCCCTGCCACAGACCCCGGTGACCCAGACACATGGGTCTTGGCCCAGCTGCACAATGAGAGCTGAGCCCTGGGAGGTTGCAATGG 
ATCCCCAGGCCCCAGAGTCTCCTTAGCATCCACGTCAACCCCGCCTCCCAGCCGCTTTCCCAGCACCCACCTCTTGTCTCATTTTGCACATAACTAA 
CAGGATGATATTCCAAAGCCCAGCTAAGGAAGGTGCTAGTTTGTCCGAATAATGTCTGGCACTCAGGGCCCCTTTCTAGCTGCTTCCCCTCCTCTGC 
TTCTCTGTGTGGCTCCTAGACCCTGGCTTGGGGAGGCATGCCACCTTGCCTGGGGCTGTCCCCCTGCCACCTGGCTCCACCTGAGGGCCCCTAGCCA 
TCCTTCTCCAGCTCGGTAGCTGCTCCACATGGAGCCCTCCAGAGGCAGTGAGGACACACTTGCCCCCGTTCCTAGAGATCGCTCGCCCTGACCATGC 
ACCCTCTGGCCCAGAGGTTACTGGGGCTGAAAGTTACAAGTGCCTTGGGACCATGTCTCACTCATTTAAGGCACCCACAGTGCCCAGACCTAGTAGG 
TGCTCAAGAAATGCTTGTTGGTCTGTTACTGACAGCAGCCTCCTGGGCTGGGTGCTCCTCAGGTGTGGCTTCATTAATCTCCACGTTCCCAGAACCT 
GGAACCCTGCCAGGCTGAGAGTCATCACTCAATTCGTGCGTGTTGAGTGAATAGGCAGTGAATGACTTTGTTTACCAAGGCCAGACTTTAACCTGCT 
CTCTGAAGGACAGGCACTTCCACTGACTCCACAAATCACCAGGCCCCAGTCTTGACAGCTGGGGCTAAATCAAGACAGGAGCCCACTGGGCTTAGAG 
CCCGGGCAGCTGCTCCTTCATGGTTTTATCTCAAGACCTTGGGTCCCAGCTCAGGGCCTGCACCCACTCTACCCTCACCCTTCCCTTCCTGTCCTAG 
ACATAGGCAAGGGTGCTCCCACCCCTGCTGGAGATGCAGGCGAGCCAGAGGGCCCCGTTGAGAGAATGAACTAGCCCCGTTGAGAGAATGAACTAGC 
CCCTAATGGCAAGGCCCAGAGACATGTCCCACTTCTTTGATGAAAGGCACCCCCTGAGGCACCTTGGCGCTCTCCCACAGCCACAGGACACAGCCCC 
ACCTGAGTCCCTTTGGGCTCAGAAGAAACCCCCTACCCTTGGCCCTGCACTGAAATGTCAAGAAGGTCCAGACCTGTGCTCAGTACCCGCAGGGTGC 
ACATGTGTTCCAGGAGGGCGAGGAACAGTGGCAGCTGGAATCCTGAGCTAGGTTTCTCTGGTTTTGTCACAAGCTCCCTGCCAAGCAGCCAGGTGAG 
CGGCTCCAGTCCAGGCCTGTGGCCTCCCTCATCCACAGCAGTGATTGCCTCGTCACTATGGGTGTAGTTGGGTGCATCAGAGCCCACCAGGCCCAGC 
CCCACGGTGTGCACACATGCCCCTGAACTGGCTAGGCTTCCTGTCCTGACATCTGCCAATCCTGAACGGCTTCTTGACCTAAGTGGAAAGGTAGAGG 
CCTGAGTTCCAAAGCCAACGCTTGGTTCCTCACTTCCTGCTCTTTCCTCAAACCTTTCCTTTCCCTTCTAGAAGTTTCCGTGGAGGGTGGAGCCCTT 
CTGGGCCCAGGAACAAGTTGGGCCTCTGTCCAGTACTGCCCAGGAGCCAGCAGGTGGAGCACCTGGCCCTGGCCTGTTGTGGGGTTGACAGGAGCAC 
CGT GGC CCC CACCACAGGGAGAGGAAAATGACC^CAGGAGAGGTG ATTTCTG ACTCCTTCTCCTCACCTTCCCTCCTCCTCT C CCT CT CA GCAGCCA 
TGTCAGTCATTTTAAGAGACACTAGATCTTCAGATC^GAGTCTTAAGATGCCTTCCCTTTGAGGCTGGCACAGATGGTTAGATTTCCGCATGGAGAA 
AGGT AAGGGCAAAGGGCCTATTTCTGCAAAG AAGG CATGACTTGGGGAGAC TTGGCC CTGGAGAGCACCCTGAGACCGGAGGG AGGAGGGG AG CCT T 
TCCCTGGGCCGCAGCCCCTGGGTCCACCTTCTGCAGCCCCACCCATGTGAAGCCAGGTGGGTCCTTTTTGTGCCCAGCCCCAGAGAAGCTTCTGAGA 
ATGTGCATTC^TCACCCCCAGGTTAAAAGCCTGGTGGGTAAAAGAACTGTGAGCCCTGGGTTCTCCTCCCACCTCCTGGGCAGCCAGCTGTGTGACC 
CCGGAAAATGACAAACCTCTCTGGGCCTCAATCTTCCCGTATAGAGGAAGAGGGAATTAGACTAGACCCGGGGCTCTGGAATTTCGTCCTGCACAGG 
GGCTCTAGGTGC^AGTTGAAGGCTGGGGATGCMGCACCATCCCTCCTTTGATTTTTTTAATTTTGCTATTTAACCATTTTAAAGTGTACTATTCA 
GTGGCATTAAGTACATTCCAATGCTGTGCGACCATCACCACCGTCTGTTTTCATCATCCCAAACTCTGTACTTATTAAGCAGTAACTTCCCACTCCC 
TCCCTTCCAGAACCCCTGGCAACCTCTGTTCTACTTTCTGTCCCTGTGAATTTACCTAGACACCTCATGTAAGTGGAATCATACAGTGTTTGTCCTT 
ATGTGTCTGGCTGCTGTCGCTTAGCTTCTTTTCAAGTTTCTTCTATGTTGTAGCTTGCATCAGAATTTCATTCCTTTTTTTTTTTTTGAGACAGAGT 
CTTACTCTGTTGCCTAAGCTGGCGTGTCTTTACCTAGAACTTCATGTAAGTGAAATCATACAGTATTTGTTCTTCCGTGTCTGGCTTCTGTCGCTTA 
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GTCG CC C AAGC TGG AGTGCA GTGGCGCAA CT GGC TCA CTGTAACC T CTGCCT CT C AGGTTCAAGC AAT ACT CCTGT CTC AGC CTC CTG AGT AG CTGG 
GACTACAGGCACGCACCACCAGGCCCGGCTAATTTTTGTATTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCT 
GAGGTGATCCACCCGCCTOVGCCTCCCAAAATGCTGGCATTACCGCCA^ 

TATGGGCCACCTGTTGTTTATCCATTCATCTGCTGAGGACTCTTGGGTTGTCTCTACCTTTTGGCTGCTGTGAGTAGCACTGCCGTGAACACTGGTG 
TCACTGCAAGCTCTGCCTCCCAGGTTCATGCCATTCTC^^ 

TTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGraCCTCCCAAAGTGCTGGGATTACAA 

GCGTGAGCCACCGCACCTGGGCGGAGTCCCCATTTTCAATTCTTTTGGGCGTATACCAGGTGTGGCATTGCTGGATTATAGGGTGATTCTATGTTTA 

ACTTTTTGAGGGACAGGGCCCTCTTTTCATATGTTCTGTATGTAGGGGAGAGGAGGGAGATTGTGTGTGATTCCATTTGGGAGGCCACCTGGTTGGG 

TGCAGGACTGCCTACCCTAATGGATCACTCAGGGGAGTGGGTGCTGGCAGCACAGTGGGTGTCCCTGGCAGCACAGTCAGACCCCCATCAGC^ 

ATTGGCCGTC^CTGGGTGGACGGCAGCATCTGAGGCCAACAACCCCAGTGTCAGATC^ 

CATATGTTAGGTGCGTAAGTCAATGCAGCATCCAGGCTGACCTGAATTCACTGATTA 

GGCTTACGC CTGTAATC C CAGCACTTTGGGAGGC AGAGGT AGGTGGAT C ACTTGGTCGGGAGTTC CAGACCAGCCTGGCCAATATTGCGAAACCCTG 
TCTCTACTGAAAATACAAAAATTAGCTGGGCATGGTGACGCATGCCTCTTATCCCAGCTACTTGAGAGGCCGAGGCAGGAGAATCACTTGAACTCGG 
G AGG CAGAGGTTGC AGTGAG C CAAG ATC ACGCTTTGT ACT CC AGCC CGGGC AAC AGAGC AAG ATT CTGTCTCAAAAAAAGAAAAAAAAGGTCCTCT C 
TGCCTTGTGCTCACCATGCAGGCCTCCTGCAGCCCTCTGGCCACAGC^GGCGTTATCAGGGTAGAGGCACAGAACTGGGGTGGGGATGGGAGTTAGA 
TAAGGCCATGCACAAGGTTTAGGTCTTGCCTATTTGTGGAGGGATATTTCACTCCAAGGGGTCAACACCAAAGTTTCCACAAGCAGCCCGTTGTGCA 
GGGCACCCGCAGCCCATCCGCCTGGCTGCCCGTGTGTGTCCTTGGCCCAGCTGGGCAGTGGGTGGTCGGAGTGGAGGATGAGCCGGCTTCAGGAGGA 
GGGTGGCAGGGTCCTAGGGAAGGGAAGGGCCGGGCCAGGAGCTGGGTATAAATAGCCCCTGGCCCAGCTGCGAAGGCCTGGCAGGGTCCTCCGCCCT 
CCCCCTCACCAGCTGTCAGTGCCAGGCAGGTCTCTATCCCCCTTGGGAGCCTGGCCATCATCTCAGATCCTAAACGTAGGAACTTCCTGAGCGGAAC 
CAAGGAGTCTTGTCCTGTGTGTAGGCGGAAGCTCCGGATGCCGGGAGGTCTGGCGAGTGAGGGGCATCAGGCTCGTCCTTTGGAAGAGTTCTTCATT 
CCTTTCCTCCAGAATCATGAATACCTAGCATGTGCCCGGCCCTTTGGAAGACCGAGTGGGCTGGGGTGTGTGTGTTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGATGGGGTAGGGACCGGGGAATAACTGGAGAAAACACTACCACTACCTCTCTTCTTGGAGACTGAGACTGGCATAGCCAG 
ATGGAATTGGAAACAAATCAGTTCAGCACTTTGGGATAAGAAGGTTTGAATTAGGTACAGGAAAGATTCAGGCGGAGGAGAGGTGACTTCTGTCTGG 
AGCTCACAGAAGGCTTTAGAATGGTGGTGACCTAGGAATCAGGTTTCGAAGGATGGTGTAGAATGGTTGCCGGGGGCCGGGCAGTGGAGAGGTGCTT 
CGAGCCGAGGGCTCAGCATGTGCAAATGTGCAGAAGTGTGAAACAGCCCCGCTGGAGTCATTCAGCTGGCTAGGCAAATCCGTCAAAGCCACGCACA 
CTTTGGCACAAAGAATCGAGGAG(^TGAGAGTCACTGTGCCGCCTCCCGAAGTGGAACGTGTACTTC^ATAATGGGGAATAGGAGTGGCGAGTTAAA 
GTAATGTGTCTGCGATGGTGGAATTTCAGCAGTGTGCGGGCAGGAGGCTCAGGGTGGGGGTGGCCGGGCATGTTTGGGGAAGGGCAGCCATCCTGCA 
GAAATCAATCCCATGGCCACAGCCTCTTTAGTCTCCGACACTGGCCCAGGAGCTAAGCCCCATCGCCAGAGGCCTTGCTAATTACTCGCAGGATGAC 
CCCATCTGAACTGAAGCCTTTGAATGGAGCATCCTGAAAATGTCCTTTTGCCTGCACTCTTTGAGTGACCTTGAACCCTCATCCAGATCACAACTGT 
CTTTGTTCTGCCACCCCTGTGAGGACTGAGGTCTGATCCAGGCATCCTTTCCAGTGTTGGATAGGAAGGGGCTGAGATTTCTGATGTGTTCTACCAA 
GTCAGGGATGCAAGGATGGGAGAAATGAGGTTGGTGACAAGGCAGGGAACAAAGTGCTGGGGACAGCTCCACACACACCGTAACCATAGCAACCACC 
ATCTAGCAAGGGCTCCCCACGTGCCATGCACTGGATTGTGTCACTTATTCCTCCTGTCGTGTCTATTTTACCGATAAAGAAACTGAGGTATAGAGGT 
TAAGAAAATTGCTGAGATCAGCAGCTAGGAAGTGGTAGAGCTGGATGGGACCCTGGCACCATGGCTCCCAGGTCCACAGTTGGTGGGACCTGGGCAA 
C AGGAAGGAGACAGGGTTTGGTG GAAAG AGGGAAAACAGC TCA TC ATG A CT GAGGCT CAGGAATTGTT CTC AG AGGACATCACAC TGGGAGTCAG AC 
AAACTCAGGGCTTGAATTCTGGCTCAGCCACTTCCTTGCATAAGTCGCCCAACTACTCCATGCCTCAGTTTCCTCATCTGCAAAATGGGGATAACAA 
TGC CCACCT CC C AG AGTTGT TGTGAGGATTC AGTG AGATT ATGTATTTG AAGTG ACTGG C ACAGAG CAAGTGCTAC ATAAATAGT TGTCATTA GTA T 
TGCCAGTATCCGGGTAACTGTGGGGACAGAAGGAAATCCCTGTGGGGCAGAG 

TCCTGGAGATGTCTGGGAAGCACTCCGACCTTTGAGGCCAATCAAGCCATGCAGGCAGTGTGCCAGAGCTCAGGTGCTCTGCTGGGCTGCGAAGACC 
CACCTGGGACCCTGAGGGAGGTAACTGGTGAAACCAGGAAGCAGGGAGCCCCTTGGAGAGCTGGGTGAGGAGCCGCCTTGCCAGCATGGGCCCCCCG 
ACTCCTCAGTCCCCCAAGCCAGCCGGCCTGGCTTTCTCTGTGGATATGGCTGTGTGACCTCGCACAAGTTGGTCCTCTTTCACGGCCTCTGCTTTTT 
CTTTTTCCTTTTTTGTTTTGAGACGAAGTCTCGCTCTGTCGCCCAAGCTGGAGTGCAGCGGCATGATCTTGGCTCACAACAACCTCCACCTCCCAGG 
TTCAAGCGATTCTCCTGCCTCAGCCTTCTGGGTAGCTGGGATTACAGGCGTGAGCCACCTGGCATGGTCGGCCTCTGTTTTTCATCTGTATAATGGG 
CTTTGTGCTCCCTTAGGGCTCCTCCTGTGGGTTCTGCTGCTGGACCAGGGCTCAGCCTCCAGTTTAGCTTGAGGGTGCCCAGGCCTGCAGAAGGTGT 
CAAGCAAGGGCTGGGAGAGGGGGAGGAAGGAGGAGGCAGGAAGGCAACGGGAGGGGAAGGAGGAGGGCAGAGGAAGGGTCTCCGAGTGCCTCTCACT 
CCCGCCTCAGCTCTCCGCTCCAGTGGTCCCATATAGGGCTCTGGGTCCCAAACACCAGAGGAAGGAGCGGGGACAGCCCAGGCTTCTTCCCTGGGTC 
TTGTTCCTGAGCTGCCCATCTAGGATGGTGCAAGGCCGAGTTGTCTGCACTGAACTCTCCCTCAGTTTCCCCTCCAGTGATTGCAGGGCTGTGGGGG 
CAGCTGGCACAGCAGGGCTGGGCTGGAGATGCTGCTTAAGAGCACGGGAGGGGCAGCCTGAGCCCAGAGGGAGGCTCAACGGGGACCTTTGCAGCCT 
GGCAGACAGGGCAGGGTACTTAGCTCCCTGGCCTCAGTTTCCTTATCTTGGAAATGGGACAATGGCGATGATAATGAAGTTTTTATTACAGACCTGC 
CGTGAGG AT TAC AT AAGGTAACAGAC ACAAGCACTCAGAACTGTGT CC AGC ACATGATAGGCCCATAATAAAC CACAGGGATTATATGAT CAC TCT A 
ACCTGCTTCTAACAGGGCAGGGGCAGTGCTGTGTGCGCCGTGTGCCCATCAGGGCTCTGGGCTCACTGACTCTACCCACTCAGGCATGTGCTCAGTA 
TGGAGGCCCAGGGTGCAGATGGGCCAGGCCTGAAGCCACCTTCACCCCATGAAGACCCAGGCAGGCACTGCCCAGCAGGCAGGGTCACATAGTGATG 
ACCCCCATGGGCTCTGGAGTCTGACACCCTGGGTTCCAACC^GCAATTCCATAGGAGCTCTGTGGTCTGGGGTGCTTCATCAACCTGCAAGGCTAAG 
CCATCTGTAAAATGGGGCTAATAATGACCCTACTTTGTGCTGGGGCTATTTTGAGGAT^ 

CAGGCACCGGGGAAATC AGT AATTAGCC ATT ACCACC GAT AAATCAGAACT TCTGGGCC GGG CGTGGTGGCTCGCG C CTGTAATC CCAGCACTTTGG 
GAGGCCGAGGTGTGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCG 
GGCGTGGTGGTGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCCAGGAGGCGGAGCTTGCAGTGAGCCGAGATC 
ACGTCACTGCATTCCAGCCTGGGTGACACAGCGAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAAAGAACTTCTGGCTGCTGCTTCTTTCTCTTTA 
GAAAAAGCTCAGTGCTTAAGACCTGGAATGCCTCACAGATGTTACCATCCAAACTCCCAGATCTGGCTCCTCAGTTCCTCCACCCAAAATAAACAAA 
TGGAAAAAGGAGAACCTGCCTCTCCCCCTGACAACTGAGCCTGGAGACAACCAAGGTGGATGGGTAGCATGCCCCTGCACGGTGCCCAGGGAGCTCT 
GTGCCTTGGAGTCTCCACGAATTGGCCAAGGCCMCATCAAGCTCAAGra^ 

C AG ACC AAAAAGCCGTAGGC AAGCT AAT TCA CT CTCTAAACCTCAGT TTCCTAGTC C AGG AG AT AGTGAT GTTCTT C ATGTTTGGGAATTGAT CAGA 
AATGTTGTATATAAAGTGC^TGGCAAGCAGTAACAGTTGCCAAATGGCAGATGTTTGGCTCGGCCCTGTGTCCAGCCC^GAAGCTTTTGTGTACTAA 
AAAGCACACAAAAAACCTCACCCTGCCCCATGGGGTCTTCTCCCGGCCCCT 

GGCCAATGCTGGACCAAAGCCTTGGTCAGTGCTGGACACAGAGGGAGAGGAGATGGGCCCCTTCTTCCAGCTCACACTCCCCCTGGTCATAGAGCAG 
ATTGCAATTGTCCTTGAAGGCCACTGACACTCACGGATCATTTTCTCA^ 

AACGGAAAAGTCGTAATTCCTACCTTGGTTTTCTAGTATTTTAAACTCTTCTGCCCCACCCCCATAAGGTCATTCTTTCTGTTGTTGACTATAAAAC 
CTGAGAGAGTTGATGGGATATTGATTACAAAGCGAGGGGCAGAGGAATGGAGTGAATTGACCAGTCACCTGGTGTTGGGTGTGGGCAGTGAGATCTG 
AGATCTCCCTTCTGACTCTGTGCTAAACTTGTTTTGACACAAAGCCAAGCTCCAGTCTCTCATGAGAAAAGAGGTGGAGCTCTAGAAACTTCTAGGA 
AACTGGAAAGATGGTCATGGAGGCCCTGGTAGACAAAGGACTTGGTGCAAGCCAGCAATTCCTTGGGGAGATACTGGGAGGCTCTGGGCTGGGCTGG 
CACCATCGCAGGGAAGAGGCTGAGGGCATTGCCAAGGGTAGTGTGTGTGTGTGCGCGCGCGCGCGTGTCCCATGAGTCACTGGAAGTGTCCATGTGG 
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CCCCAGTATCCTGGACTGCCTTGGT^AATTGCCCAGCCTGATCTCTTCAATGCCCATCAGGGAAGCTGGACCCAGAACCTGGAAAGGACTGGATCTGG 
GCCCCAAATACCATCTTTCTAGTCCAAACCTCTT^ 

AAGCTGGATGCAGCCAGAGGGCTGACTTGCGTGGGTCAGAGCTCTGCX3CTTTGAGGGGCGGCATGCTCCCGAGGTCAAAGGTAGCACATTCAGGTGC 
CAGGGCACAGGCCCTCTGCCTCTCCACCTACCCCGGAGGGTTCACAAAGCGGCCATGGTGCCCAGGATGGGCTCTGTGCTTTGTGCCCACAGTGTTA 
CTACCCTCTGTCTGCCTGC^GGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCTC^CAGCCCCTGCTCTGCGGCTCCATCCC^GGCCTGGTC 
CCCAAGCAACTGCGCTTCTGCCGCAATTAC^TOTAG 

GGGGCCGCCGCTGGAACTGCACCACCATAGATGACAGCCTGGCCATCTTTGGGCCCGTCCTCGACAAAGGTACTGCCTTGGGGCTGAGGGGGGGGGA 
GGGGCTGGGTGGGGGGAGGGATCAGGACTGTATGCCCTTCCACCCCAGTCAGGGTGCCTCTTCTCTGAAGCCAGGGTCCCTCTGGGAGGGTGATAAA 
TTTGGTTGCATGACTGCCACACCTTCCCAGGCGCTGCTG^ 

AGGCCACAGACTTCCTTCCTGGCTTTTGCACAGTGGGCGCCTCAGGCTCCTGCTGTGGACATTTGTGACATTTCCCCAATGCTGTGTTAAGTGCTAT 

GCAAATACCGCTTTGGGGTGGGGGTGAGAGCATTGCCACCCCTAGGTCCCTCCCATCTAGAAAGTCCTCAGTAGGTGAGGAAGCTGCCTGGTGCTGT 

GGTAGCATTGCGTTGCCCCCAAGAGCCCCTCTGGGTTTCTGTTCATCCTCATGCCCTGTCAGCTGGATATGCCCAGGAATCATGGTGGGGCTAGAGG 

GTGCACTGGCCTCTGGGTAACTTTCCGGCCGTCTTGCAGCCTGGGAGTCTAACTGGCTGTGCCTCAAGTGGACAGGGAACCACATGATTTGACTTTG 

AACCTGCAGGCCCTGGGAGGGAGGCTAAGTGGGACAACTCCTCCCAAGAAGGGGACTTGGGGGCTCATGGACTGGGTGGAAGATGGGCAGCTCTTTG 

TGGAACGGCAGTGTCCAGGAATTTAACGATCCTGGCCGGGCTGGCCACAGCC^GAGTTGCTCTGCCACCCAAACCraCTGAGCCTGGCCCTCCCTCCT 

ATTAGAAGCTAGGGAATGAATGAGAGCTGCCTATACAGCCAGAAAGAAAAGGGATCTGACGTTCAGGTGAAGGGGTGGGGTGCTGGGTGGGGTTGGG 

GGGGAAGCTGACAACCCCCATTTTAGAGTGCAGTTCCCCTTTACCGGTGCCTCAGGTGCTCAGGGTCTGTCCCACCTACAGGCA 

TGTCATGGGACAGGAGGGTAATTAGCAAATCAATGGATGATGATTCTGTTTACGCTTCTATTGCCAAAACTAATTATTCCTTGTTGACATAAAAGAT 

TTGGCCTCTTAATTGGGGCCAAGTGGCGGCGGTTTGCATTTCAAACTCGTTCTTGTCGGTGTGTTGGTCTCAGCGCTTTCCGGTGCCCCTCCTCGGG 

CCAGGTAGCGCTGAATCAAACAACCCTGCGTCTAGACGCGCTCCCCAAAAGOSAGTCACCTCTGATCGCTCTTCCCGTCTCCGGCC^GAGCCAGGG 

GAAAGCGGCTCTGGGAGCGCTACCCATTCACTCCGCGTTGAACGTTTTAAGTCACCACCGTAGGTTCTGGCACACGCCCCGGTCCCTAGATGGCCCG 

AGGACAGGTTTGGGTTGCGGGGGGAGGGGCGATGTGTGCCCGAGAGTTTGGGAATTTGCTGGCGGCGGCGAGGAGAGCATCCCGGAAGACGCCGCGA 

GCGCATACGCGCCCCGGTCGGTGCCCAGAGCCCGGACCTTGAACCCAGGTTCGCGCCCAGCTCCGGCCCCACGTTTGCACCCCGCCCCCGGCGAGTC 

CCCACACCACCGCCGCGTGGGCGGCCAAGGGTTAATCTCTGCCCCGCGCTTTGAGACCCGGGGCGCAGCGGGTCGGGCAGCCCGGGGCGCGCGTCTT 

CCGCGAGTCTGGGGGCGGCGGGGTCGGCGGCCGGAAGGGGCGAGGCGCGGCGGCGGCGCGGCGGGCCCGAGGCGCTTCCCCAGGCGGGGGCGGGGGC 

AGGGGCGGGGGCGGGCGCCGGAGGGGCCTGGGCGGCCTGAAAAGCGCCCGAGCACCGCCCCCCCACCGCGCGGCCGCCGCGCCATTCACACGCCCTC 

TGGCCATTCGGCGCGGCGCGGCGCGGCGCGGGGGCGGGGGGCGCCGCTTCAATGGGGATTTCGCX3GCCCGGCGCCGGGGCCGGGGGCGCGGCCCGGC 

CGCGGGAGCCGCCCGTTGCTACGCGGTGGCGGCCGGCCCGGCGGCCCGGGCCCGGCGGCCGCATTATGCGGGTAATGCGGTGTGACACCGCGCGAAC 

AAAGGCGGATTGAGCGGCCCAGCGGGCCCCGCCGGACGGGGAAGGCACAGCGGGGCTGTCAGCGCCGCTCCCAGGCTAATCCCCGCCCCGCCCCCGC 

CGCCCCCGCGTTCCCGGGGCTGGGAACCCAGCGCGGCCCGGGGCCCGGGCGCCGCTGCCGCCGCGAGAGGCCGCAGGCCGGCCGG AAA CGAGGGGCG 

CCCGCCCTGTGAACCCGCCCTCGGCCCCTCTTCCCCGAGGCGCCCCCAGGGCTGGGGCGTCTAGGATTGGGGCGAGTGAGCGGCGCGGCTGGGACCC 

CGAGGCCGGGAGCACAACTGGACGCGGGACGTCGGGCCCCCTTGCCGCCGTCCTCGGTCGGCCCTGGCAGCGCAGGCGCGGGGCGCGGGCGGGACCA 

GCTCTGCACTCTTCGCCCTCCCGTCGTTGGAAGTGGGACGGGCCCCTCTGTAGGTGGCTCAGGAGGGCGTGGGGGCCX3CGGAGAGCGGCCGCGGGAG 

CCAAGAGGGGTCCCGGGCTTCCTTCTTGCAGCACTCGGCCAACAGCCCAGGAGGGGAGGCGGGCGCCTGGCTTCTAAATAAATTTGGTCACATTTAT 

GACCAATTTTGCCGACTAGGCCGGCCCGGCGTTTCACAGCCTGTGTCAGGTGGCGAGTCACCTCCGGCTCCCGGGTCCTGCCATGAGGAGAGAGATG 

GTTGGGCTGCAGGTCTTGC^GGCCTTGGCCTCCCTAATC^ 

AGACGATGACCGTTGTGAGGAGGCTGGAGCACACCCTTGAGAGTCTCTCCTTTCCCTGCGCTGGACCCTGGGACAGAACCAGGGGAACTGCCCGGTG 
TCCCTACCCTTCTGGGAGGGAGTTGGCCGTGCCTGGGCGTGAGGGCAGGAGAATGGGGATGGGGAGACTGAGGCACGGGCCACTGCCCTCCTGGTGA 
GGGGCTCTAGGAGCGGAGGGAAGCCGTGTCCATTTTACAAAAAGGGCGACTGAGACCGAGAAAAGGAGAGGGGCCGTCCCTATGCATCCACGTGCGG 
AAGGGGTGGGCCTGGGCAGGTTTAGGCTGTAGATTCCCAGGCCTCCCTCCTTACAGACCTGGATCTCACTCAGGCGTGGGAGGGCTCAGGCCTGTGC 
AGGGTGGTCTCTGGACTTTGGGAAGGTGGAGGGCAGGGGGTGTGGGACCAGGACAGAAGTGGCTGCCTTATCTCTGCAGCAACCTGCCAGAGCTTTG 
GTGTCGGCGCCTGTTTCTAACAGGCACTGCTTTGCCCTGTGGCATTTGAGGTCCTGAGCTAATCAGCAGCCCAGGGCCTCAGTTTGCTCATCGGTGA 
AGTGGGAATACCCAACTGGGTCTAAACCAGCCAAAGAGGGTGGGGAAGCTGCCTCCCTCTTGCCTCAGGGCTGGTGAGCTCTTCAACTCTTCCTGGC 
CTCGTTCCCCTCACGTCCTGGCCTGGGAGGTGGAGGCAGGGCAGGCGCTCCCAGGCCGGGTCCTGAGTGCTGCCTCCCCACGACCTCCCCGCAGCCA 
CCCGCGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCTCCTGCGCCGAGGGCACCTCCACCATTTGCGGCTG 
TGACTCGCATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCG 
GATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGGCCGCACGGTGAGCCCGGCTACCCTCCCCAAACCCT 
TCAGGGAGCAGCCCCCCGCTTCCCCTCCGGAGCTGCCCTGGCCCCCACCTCACCCCTTCCGGCAGCCCTGTGGGTCGCGGGCTCCTTTTCCCCTTGG 
CCAGGCTTGGCCTCCTCCCCACCCCACAGCCACTT<^TCTGTCCACCCTCCTTCCTCCTTGGGCAGTTCCTGGACTCCACCCCACCCGGGATGAGTC 
GGCCTGGGAGCATGGCCACCCCGTGGGATCAGGTGCCACCCACCACCCATCCTGTGCACCTTGGCACGCGGTGTAGGGCTGTCAAGTCCTGGGAACC 
TGGCTTTGAATTCAGCCTCCACTCCCTTACTATGCTTGTATGATCTAGGGTGAGTCACGTTAAGTTCCCAGCCTCAGTTTCCCCATGTGTTAAATGG 

ATCTTGGGTCCCTGCAACCTCCGCCTCCCGAGTTCAAGCAAT^ 

GCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGGTCTCAAACTCCTGACTTCAGGTGATCCTCCCGCCTCAGCCTCC 
C AAGGT GCTGGG AT T AC CAC CGC GCC TGGGC AAC AACT CGTATTT CATGCG CTGTGAGAAGTGGACGTGCTGTCTACCCATGCTTGGGGTGTG AAGG 
GGGAGGGAAACATGGTCACTATTCTTTAGGTCTTGCCTCAGTC^ 

TGGGCAGTGGCAGAGAGGGAAGGCCTGGATGTGCCCCCTCGTGGGGTCGGTCTGTTGGCCAGCTGCCACTTCTCTCCCCCAGACTATCCTGGACCAC 
ATGCACCTCAAATGCAAGTGCraCGGGCTGTCGGGCAGCTGT^ 

TCAAGGACAAGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCGAGGCTGGGTGGAGACCCTCCGGGCCAAGTACTCGCTCTT 
CAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCO^CCCAGAGACGGGTTCCTTTGGCACAAGGGACCGG 
ACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAGCGGAAGGAAAAATGCC 
ACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAGG 

GAAGCAGGGGGGAGGCTGGGAGCCTGGGCGCAGGGAATGGGGTTGTTTGCCCATCT^ 
ACACACAAGTTCCCACAGCCAAAATAGGAAGCTAGGATTTT^ 

AGATAAAAACCCAGTGTTGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGACGGATCACCTGAGGTCAGGAGTT 
CGAGACCGGCCTGGCC^CATGGTGAAACCCGTCTCTACTAAAACTACAAAAATTAGCCGGGCGTTGTGGCAGGTGCCTGTAATCCCAGCTGCTTGG 
GAGGCTGAGGCAGGAGGATTGCTTGAACCTGGGCGACAGAGGTTGCAATGAACTGAGATTGCACCACTGGACTCCATCCAGCCTGGGCGACAGAGTG 
AGACTCCATCTCAAACAAACAAACAAACAAACAAAAAAAAACAAA^ 

TTAGTTATGAGCCAGATCAAATTTTTGCATAGCTGCTAGCTGTCATTCTTGAGCATTATTTGGAGATGGGGGGCTGGAAGCTGGGAGACCTGACGTC 
TGGTCCTAACTTTGTCAGTAACTTCCTGTTCTTTCACTTTTCTGGGCCTCAGCTTTCTTACTATAAAGTGGTGATGGTGGTGAAGGGCAGGGGGCAG 
TTTAAATACCTCAAGCAGCGAGCCAAAATGTCTCCTCTACCCCTC^ 
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AGCATAAATCGTACCAACCTCAGGGAGGTGGGTGGGGGGAAGCGGCATATAAGCCCATATCCTGACAGATCTGAAGCTTGGGTGGAGTGGGGAGCAC 
T CT CACCG AGTTT ATGGATGATGAC GGGG AGC AGG AGGGATT CAAGT TC AGAGTGGCTGGG ATG AGTAATCGAGGGATCAAAGGCGGGTGGATTTTG 
TTCCCAAGCTTCCCAGAGGTGGCCTGCATGGGAGGCGCT^ 

TCCTGGACCCCCGGGGTGGAAGGTTTTAAGGCTGGGGTGCCCTTTCTTCCTTTTCTGTCCCCTGAGAAGGAGCCTAGTGTGTCGAGGCATGAGCATG 
GGTGTTGGAATTAGGCCCTGCCCTGCTCCAGCTCCTACCACTTGCCAACCGTGTGACC^ 

CATCCTCCCCAGAGATGCTGTGAGGGACAAATGAAAGGACACCGAGGACAGGGCTTGGTTAATTGGGAAGCACTTCCCTGCTGGGTCATTGTTGTCT 

GGGGTGCTGCTGGGCGAGAGCAGCCTGTGAAAGGGTAGAGTGCCCCCTCTTCCTGCAGTTCCACAGGGCACCTCTGAGCTACTGGCACATGTCTG 

CCATAGGGAGTAGGGGAGGGAAGTGGACGGGCCTCTCTGAGATATGGGAGGTACTTGACGTCATGGAGAAGAGCTCTGGCTGAGGGGCGACTCACTC 

AGCCCATCTAGGAAGCCCCTCGGTGTAGCTCAAGTTCTGTCCTGGTGGGGACAGTCCAGAAAGGCAGCAGGTGGCACTACCTTTGTCGGTATCTGAA 

CAAAAGCAGTGTGGGGAGTGGGAGAGGACGAAGGTGAATCC^GCGCCTCCACTC 

GTTCAGTGGTGGTGACAATGGGAGCCCAGCAGAACTTTTTTTTTTTTTTTGAGACTGAGTCTCTCTTTGTCGCCCAGGCTGGAGT^ 

CTCGGCTCACTGTAACCTCCGCCACCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTATAGGTGCCCGCCACCACGATCGG 

CTAATTTTTTTATTTTTAGTAGAGACGGGTTTCGCCATGCTGGCCA^ 

AAGTGCTGGGATTACAGGOSTGAGCCACCGCGCCCGGCCAGAATCAGTACTCTT^ 

TTTTTTTTCCTCAATCTTATGTAAAGTGCTACATAGATATGGTTTCAAGAT^^ 

TCTGGAGGCCTCACTCTGTGACCAGTGGAGGACACTCTCCCAACATG^ 

CTGGCTTCTTCATTTGTGAACTGGGGATGAGAACTGTGTTCTGTGTAGGGTTGTCCTGTGGACTGGAGACCCTGCACATGAAGGGCCTGGCCCAGGA 
CCCAGCTGCTCACAGGGCAGCC^CAGAGGCTCCTGGGCCAAGCAGAGCTGACCTCCCAGAGCCACCTCCCCAGGCGAGAGCCGCCTGCTCTGGAGCA 
ATGGGCCCAGGCTCGGCCGAGACTCCGCTAGGCAAGACGGGAGCCTGCGGGATGCACTGGCAGGGATGGCAGGGGACAGAGTCGTTAGCTGTTCTCT 
TGTCTTCCCTGACAGTCTTGGCAGTGCAGGCAGGAGAGAGAAGGGAGTCGCAGAGCAGGCCCCGGGCCCTCCACACGGTGAGGAGAGAGCGCCTGTC 
AAATGGCTTGTGTCCTTCCHTTGTGGGGAAAACACCTTTTCTGTAGCTTTTTGTAGC^ 

TACAAGTGCCAGATTTATTCCTCGTCACTCTGCTGTTGTGAACATGGTGAGCTGGAGCTGGAAAGGTTCTGGCAAGAGTCGAGTGGCCTGGCTGCGC 
CCAAGTCCTGGATCTGTTACTGCCTTGCTGAGAGACCTTGGATGTGTCCCTTCAATCTGGCTGAGCCTGTTTCTGCCTCTGTAAAGTAGTCTTATCT 
TGG CCTT GT CAGCC CCCT AG AAG AGCTGGGGGTT GACTGGGAATGCCT C AAACCC TT TG TGAG ATATG ACTATGCAAATG AG TAAAATGAACACACT 
GCTGCTCCAATGGGAAGGAATTGTTCGTAATTAAGTAGGCTCTCTCCCATTTGGACAGCTCATTATC^AGCTATTTGGTGCGGTACAAAAACACAGT 
GCATGCCTGGCGTGAGCCCAGCAC^GCAGTTGACTTGGAGTTGGGGACTGGGCCCTCGGCCAGGCACCGTTTCTCCCATCCTGAGTGACCTGCTCCT 
GGGCTTCCTGTTGCCCAAGGCCCTCAGCTAGAGX3TGGGGGAGGTAGAATTTCGATCCCATTCTGTGGACTCGAAAGCCTTACTGCTCCTGTGACTCA 
TACGACCTCCCTCAACAGCAGGCCCAAAGCACAGAGGAGAGCGTGGAGAAACCACCCAAGTCCTCATCACCTCTGAGCACC^GGGACACGTTCAGAT 
CCATCAGCAGCCTCTGATCTTTGAACCCCAATGGGTGGTCTCTGGGAATGGAATTATGTTTCTAATTCTTCCTCTTGATTTTCCCCAAGGGCGCTGG 
GAAGGGGTGAAGTGTGTGGCTGGGCGGATTCAGCGAAGTCTCATGGGAAGCAGGACCTAGAGCCGGGCACAGCCCTCAGCGTCAGACAGCAAGGAAC 
TGTCACCAGCCGCACGCGTGGTAAATGACCCAGACCCAACTCGCCTGTGGACGGGGAGGCTCTCCCTCTCTCTCATCTTACATTTCTCACCCTACTC 
TGGATGGTGTGTGGTTTTTAAAGAAGGGGGCTTTCTTTTTAGTTCTCTAGGGTCTGATAGGAACAGACCTGAGGCTTATCTTTGCACATGTTAAAGA 
AAATAAAAATGAAAAAAAATTTGACTCCAACAGAACAGGCTGGGCTAATGTGAGCTCTCAGCCTGGCAGTCAAGACATCAGCATGGGCAAGGTTCTG 
TTTCCAAACTGCTGCTTCTGGTGACATTCCAAGACGCCTGGAGGGTGGGAGTCAGGAAGTAGGACACACCCCTGCAGTCTCCTTTTCTTGGTCCACT 
CCCATTCAAATTTGAGCTAATTTCTCATTCTGATAAAAGCCATAGGTTTAGCTAGGATGAAGTGGTAGGAAGGTCCGTGGCAGTTGTTAGAGTAGGA 
TTTGGAGTTTGGAAGAACTGGCAGCTCAGGGTGGCCTGGTCAGCCGTTTGAAGAGCAGCCATGTGTTCTTCTCAGTCTCATTTTCTCTATAACCCTG 
TTCTGCACGAGGGGCAGTCAGATCTCAAAATCTTTTTCTACCATTCTGCAGTTTCCACCGTCAATGCAGTTTTTTTTTTGTTTTTTTGTTTTTTTTT 
TTTTTTTGGTGGTAGTGGACCTTGTAAATAGGCTATGTAAGGGGGCAAGTCTTCTCTAGCTCAAATGGCTTCCTAAATAAATAAGCGGTATCTTCAG 
AAGGGGCCATTCAGTCCTTCCCAGCCCTGCTCACCTGCAGATTCTCTGTACAAATAACTCCAGGTAGAGCAGTTGGACTCCAGGTCACCTTAGTATA 
AGTTAGACAAAGGGTCCGTGAGGGAGTAGCCATCAATTCCTGAAATTCCAACTTTGTGACTAGCAGATGGGGAGGATGAAAACCATCCCTTTGCTTC 
CTCTCCAATACGGACCCATCTTACTGTGTCCTTTCCTCTCTGGGGCCAATGTGAGTAAACACAGACACAGAGTTCTTTCCCCCAGCTCTTCCTCCCT 
CACCTGCATGCTGAGATAGCTTCCATCCATGCAGTTCCCAAGGATCTGGATTAGAAGTTCAAAGGGGAACCAGCAGTCACCTACTCCCTTAGGTGAA 
GCATCTCACGGCTGAGTTCTCCCTGAGGCATACTGGTCCAGCTGAGCGTCCTAGAGAAAGCTAGCAAAAGGGAGGCACATGK5ATTTCACAGTATGAA 
TTGGTTCAACAACTGTCTTAGGGAGAATCAGAAAGAAGAGATGCAGCAGGGGAATGAGCAGAACAAAGATTTTTCTTTCTCCCCCTTCTCTCTGGGG 
TCTACCTAACCCTGACCTAAAATACCAGGGCAGCGATCTCCCAGCTGGTGCAGGTGGGCTTGCCAAGATGGTCGTCCAGGAGCCCGCCTTCACTTCT 
AAATCTGCTGGCCACAAGCCCTGCTAAAGATACACATCTCACCCCCTCCGCCAAGTCTGAAATGCCCCTCCCCATCTCACCTTAGACTGAAAAGTTT 
TAAATCATGTC AAC TGG ATA ATA CT TGCTTT ATGTGAGAA TA CTTCAG C AG AATGGATACGAATTTTC AAA AC AAT CTT T TC ATAT CT ATGTATTCT 
ATATTAAAAGTGATAAAGTCATGTTTCTGGGGCGTATTC^GTAGCTGACAAGTAATTATTTAATAATAGTACATGAGTGCATTGTAATGATTCTCG 
C CGTAGTCAGGT AAT AGT AT C CA AC CGAAATTTCCTACCAAC CTG CTGTATCCAAAGTTTTGTAAAAAGT TGT AGAAGT TGT TGATCT TT TTGAT TT 
T ATATTCAA AAAGTCTCT TTT TATAAATATTATT T ATTAT ACAATGTAT AT ACCTTT GAGTT AACT AAGAT TAT AT ATT ATAT AAATATAT AT ATAT 
TTGG AG AAAAT ATATTTC AT C ATGC AGTTTT TTT CTGT TAAG TCATTAAAGAGAAGGTA AAC AAAC CT AAAATGGATAC ATTGTACAC TGTGTGGAG 
TTT CCAGATTAATG CTCAGGAGGTC AAGGATACGAG ATTCTGAACATGGTGGCCT C AG CT AATTAAATG ACC AAG TG AAGGTTATTTCCTCTGT TTT 
CTAAGC AAG ATCATCAG AGATGTCA CGGACAGTGGGTCTG ATACA CCACAG TACATTTT TAAAGGCTT CTT TTGTT CTAAGTT AAC TT CT AAC AATG 
AATTTCCAGTTTTAGAAAACAACAACTTGTGTACTAATTCATTTCACAGACTC^GTATTTTCAATGTATAAATTAATGTCACAGGCACTCCTAAAA 
G ATGTGGCC CAAGG CCAGCAG AC CG CAGGGGTGG CCAC TG CCTTAATC C CC TCCTCACCACGTTTT CAACTCCATTTAATGGG AAAAT AGGATTTTC 
T CT ATTGAAAATGAAACTTG ACTGAAAATTG TCTGTCCCCTG CTAGCTCCA CTGTGAGAGCTAAGCCGT AG AGGGAGGT AGGCTGGTC TA C AAGAAG 
AAAGCGTGTGCCCTGGGCTGTGGGCTGTCTGGAACAATTGTCATTTGAAGTTGGGTTTGGGTTTGGGTAGAAAGGGCCTGCCAGTGGGCATGCTCTG 
GGGATGGGCACCCTCTCAGATCGCCACTTCACAGGACCCACTGGAAGCTCACAGG 

ACAACAGACAATCCTTTAGTTTTCAGACCTACCCATAAAATGTCACCTTTGGTTTATTCAGCACATTATTTCAGAGAATAACAAAAATGTTTTTTCT 
TTTTATTTCTCCTGACCTACCTCATTTAGGAACAAATCTTACCTCACTACGAAGAGAAATAATGAGAACAGGCTGGGTGGGGTGGCTCAGACCTGTA 
ATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCTTGGCCAACATGGTGAAAGCCCGTTTTTACTAAAA 
ATA C AAAAATT AGC AGCTGGGCATGGTGGTG CATGCCTGTAG TCCCAG CTACTTGAGAGGCTGAGGCAGGA GAAT CACTTG AACCTGGGAGGCGGAG 
GTTGCAATGAGCTGAGATCGCACCACTGTACTCCAGCCTGGGTGACAAGGCGAGACTCCATCTCAAAAAAAAA 

G AAAAT AAAGC C ATG AGC CGCCT AT GGTGGAGGCGGGCTG CT CTGCGCTGT ATTCAGAC CTGTGAC CTGAGGTTAGGAATAGCCACCAGAAGTTGAT 
GCACTCTGTTTTGTCC&TGATGTGTCTTTAGCCCCTGAAAAGCTGAAACCACA^ 

GATACAGATCTCCTCTACATCTTACAGTCTGGTCATTTTTATTTCATTTCGACTTTTTATTTTTCTTTTGCATAAAGCTAATGCAGGTTGACTACAC 
CTTTTCTGAAATGCTTGGGACCAGAAATATTTCAGACTTTGGAAGATTTGCATATATACAATGAGGTATCTTGGGGATGGGACCCAAGTCTAAACAC 
GAAATTCATTGATTTCATATGCACCTTACACACATAGCCTGAAGGTAATTTTATGAAATTTTTAAATAATTTTGTGCATGAAACAGTTTAGACTGCA 
TTTTGAATTTTGATTGCCACCTGTCACACAAGGTCAGGTGTGGAATTTTCCACTTGTGGCATCCTGTCGGTGCTGAAAAGTTTTAGATTTTAGAGCA 
TTTCAGATTTTGGATTTTTAGATTAGGGATGCTCGACTGAACTTATCTATCCCAGAATTCTAGGTACTTTTTCCTTTTTTTTTCTTTTTTTTTTTTT 
TTTTGAGATAGGGTCTTCCTCTGTTGCCCAGGCTGGAGTACAGTGGTGTGATCATGGCTCACTGCAGCCTTGACCTCCTGGGCTCAAGTGACCTCCT 
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GCTCAGGCTGGTCTTGAACTCCTAGCCTCAAATGATCTCCCACCTCAGCCTCTCAGCATTGGGACTACAGGCATGAGCCACCCTGCCTGGCCAGTAC 
TTTTTCAATCAATAAGATTG CCAAATAT TTAAATGGT C CCTATTC ATACTAATG AAG ACCATTTG TAATCGTATGATTAGTTTTT AAT TTTTAAGAG 
ACAAGTGTCTATGTCACTCAGGAGTACAGTGGCATGATCATGGCTCAACACAGCCTTGACCCCCTGGGCTCAAATGATCCTCCTCAGCCTCCCAAGT 
AACTAGGACTCAGGCGCACTCTACTATGGTAATTAAAAATAAAATCTTTTGT^ 

GTT C AGGGGGT CCTCCCACT T CAGCCTCCCAAAGTGTTGGGATTA C TGGGTGAG CCATC ACAC CCTGCC CAGT CTT ATTTGATGGTTAAGGAGAATA 
CAGTTGATAGCTGTGCTTGAACATGTAGTTTCTCCTTATGTATAAGTACTTACAAACGTCTAAACACAAGAGGACGTTGTTTA 
ACGACATAATTAGAAGATATCTTTCAAAGCAATATTAT 

TACAAAGAAAGGTGGAAGGAAAGAAGGT AGGGAAAAAG AAAAGCAAGGAAACATGATGGCGA CAT GATT CTTGGTGTTACCT AATGCCGTCTC C CCT 
GGTGGCCCTGGGCATAGGGACATGTTCTTGTGTGTGTGGCTGGTCTGCACACTCGAGTGAAGTCAGTGCAGCTACGCATTGCTGCTACACTGCCACT 
GTTTCTCCTAGTTTAATAGAGAACCTTCTAGGAGGCAAAAATTTCTTACC^TTAATCTTATACTC 
TACGCTGGGAAAATTTCAAACATGATCCCTCCTTGGATATTTTGTCTTC^^ 

ATGCCATGGTGCCCGTCCAGCACACACCCTTGTGGGCTCGTCTCCAGGACCCAGTCTCAGTACGGCTTGCCAGCTGATGAGAAACTCTACATCATTT 
TGCAATAAATTCTGCAACTTACCTGCTTTCT 

GCCCAGGCTGGAGTGC^GTGGTGTGATCTTGGCTCACTGCAACCTCTGCCTCCC^ 

A TT A CA GGCATGTG CCACCATGC CCGGCT AATTGTGT ATTTTTAG TAG AGACGGTGTTTCAC CACGTTGGT CAGGCTGGCGTCGAACT CCTGA CCTC 
AGGTGATTGCCCGCCTCAGCCTCCCAAAGCTCTGGGATTACAAGTGTGAGCTACTGCGCCCGGCTGCCCCCAGTCACTTCTTTAAAGAGGCTATGGA 
CCAAGTTGTCAGGCAAGGAAAATGAAATCCTGCTTTCACCTTCTGTGTTTGGGCTG^ 

TTGATCAGGATTTGATTATTGTACACACTTCAATCTACTCCATGATTTTGTGACTTGTTGCTGAAACTTTTAAAGGGGAGGCTGCAATTTTTAGGAT 
ATGAAAAGACAGCCTTAAAGTAAAAACATATTTCTGACTTTAACCACAATTATGCAATTTTTCTGTGTGACATTGAAAAAAAGTGTCCCAGTGAAGA 
TGTACAAATTTAGAAAGGATAAAATAACCTCCTCTTTTGAGGTTTTGATCTCAAAAACATCTTATGACAAAAAAATAGCCGGTTAACAATTCGCCTA 
TAGATGAAGAAACCTAAGGCAGCAGCTAGCTGGAGCCTCCTTTGCTTCTTCACAGGCCGTTCTGTAATCTCATACGCTCTGCTGTACAGCACTGGCT 
TTGAAGAATATTTGGGCAGCTTCAAGAACCCACCTTCTCAAGGCTATGC^^ 

HUMAN SEQUENCE - mRNA 

GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATTGGGGGCGTCGCTCCCCCTCYTCATGGTCGCGGGCAAAC 
«P TTCCTCCTCGGCGCCTCTTCTAATGGAGCCCCACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGCTACCCAATT 
|\, TGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGGCTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCT 
f*7 TCTGCCGC^ATTACATCGAGATCATGCCCAGCGTGGCCGAGGGCGTGAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGCCGCCGCTGGAA 
§r& CTGCACCACCATAGATGACAGCCTGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCGCGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGC 
yj GTGGCCTTCGCCGTCACCCGCTCCTGCGCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGT 
GGGGCGGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCGGATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAA 
5 CAAGCACAACAACGAGGCGGGCCGCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGTGAGGTGAAGACC 
fd> TGCTGGTGGGCGCAGCCTGACTTCCGTGCCATCGGTGACTTCCTCAAGGACAAGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGT 
CCCGAGGCTGGGTGGAGACCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGAACTCCCCCAACTTTTG 
W TGAGCCCAACCCAGAGACGGGTTCCTTTGGCACAAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGC 
flj CGGGGCCACAACACGAGGACGGAGAAGCGGAAGGAAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGTATTCGCATCT 
* 1 ACGACGTGCACACCTGCAAGTAGGGCACCAG 

y$ 

Q HUMAN SEQUENCE - CODING 



ATGGAGCCCCACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCC 
AGCAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGGCTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGAT 
CATGCCCAGCGTGGCCGAGGGCGTGAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGCCGCCGCTGGAACTGCACCACCATAGATGACAGC 
CTGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCGCGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCT 
CCTGCGCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGTGGGGCGGCTGCAGCGAGGACGC 
TGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCGGATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGGC 
CGCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGTGAGGTGAAGACCTGCTGGTGGGCGCAGCCTGACT 
TCCGTGCCATCGGTGACTTCCTCAAGGACAAGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCGAGGCTGGGTGGAGACCCT 
CCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGT 
TCCTTTGGCACAAGGGACCGGACTTGCAATGTC^CCTCCCACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGCCGGGGCCACAACACGAGGACGG 
AGAAGCGGAAGGAAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGTATTCGCATCTACGACGTGCACACCTGCAAGTA 
G 



128 



TABLE VIII 



MOUSE NOMENCLATURE 
ICSGNM Batf 
Celera mCG5742 

HUMAN NOMENCLATURE 
HGNC BATF 
Celera hCG22346 

MOUSE SEQUENCE - GENOMIC 

GCTAACAGGTACCACATGCTTTCTAGGGACTAAGCATGGTGTTAGGCAATTTAAATATGTCCATTTCT 
CCCAGGCATGGTTTTAAGCGATTGGCTGGCATCACATGGTTACAGAGTGGCAAAGCCAGACACX^^ 

ATTTGCCTGTTCTCTTGCTTCCTAGGCCTTTTCTCCCCACCCCAACCCCCATCCCTATCATCATCTTAGGATCCTGTGAAGGCCAGAGGCTGCAAAC 
CAATTTGCTTTACATTACACTCCCAAAGCACCTCACTCCTGGGAGGTCAGACACCTTTTGCAAGGCCCGGTGTAGACGGGATTCACCCTCCCTGCCA 
AGAATAAAGGGCTGGGTGTGTCCCATGATTGCCTGGTTCTAGC!ACAAACATAGCTTTAGA(^TGCTGCTTCACTGTTAATCCATTTTCTTTTTATTA 
TTGTAGCCTCTTTCACTTTTCTCTCTGGTCTGCACATGGATGT^ 

TTCCTCTCAACGTTATTTTTAAAATATATAATAGTTGTTCTTTGAGAATTTCATGCATACATACCATGAGTTTTGGTCATATTGGCTCCTCACTCTT 
CCTCGCAATTCCTTCTAGATCCACCCCTCACCTTCCTAC!ACCCTTA 

TGCCCAAATACACACAATCACTAGAACAAGGTCAACCTACCAGATGCTCCAAGAAAGCTGAGTTTCCTTTTCCTCTGACACTTTTTGTCCCCTTTCC 
TGC AAC AGT CC TGAGTCT TGGAGGGTGT ATGTGATGTAGGTGC CATTTGTACTG AACACTCCACAGCACT TGT TCT CTGTAC AC T GACCAGTT C AC A 
AGT GTT TGC ATTAAC CACCATCCGCTGT A CA AAG AAAC AT CTCTG ATG AGGGCCAAG AGC CTCATTAAT CTATGGATAC AAGG ATG CAAACTG AAAG 
AACTGTCCATTTATCAAATAATAGTAGTAGGCTCACTTCTGGAGCCTGTGAGTGCCTCCACCATGGGCTTTGCCCAGACTTACAGTACCAGGCAAGA 
GTTTTATTCTTTGGAGGAAGTCTTAAATCTATTCTGAAAGTAGTTGGGTTAACTCCATAATGGTTGTGCCACTAATACTTAGGTATATTGTGCCATG 
TTGGGCATTACTGTGGCTACCAGAATACACAGCTGGGTAANNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGAGTGCTCTTAACTG 

TGGTTTTGGTTTTTTGAGACAGGTATCTCTTTGTGTTGTGCTGGTTGTCTTGGAACTCTATAGATCAGGCTGGCCTCGAACTCACAGCGATCCACCT 
ACCTCTCTCTTGCCTTTTGAGGGGCCTGGGTCCATGCCTTCTG^ 

AGCTCTGCTGTCAGGTTGAAGACTCTAAAGCGGCTCGGAGCAAGGACGTCATTGGCCTGGCTTTATCTCAAGGAGGAGGAGACAGAGGAGGCCACAG 
GCTG AG ACTGACGG ATGG AG C AG AAG TTTAGGAT CTAGGA GCTTC CTC C CT C AGAGC AT CTGGTG ATG ACCAG AGATGTGTCAGAT CTTC TCACCTG 
GGGTTCTACTGAAGTCTGGTGAGTGGAGGACCAGGGCCTGGCCTGCTAGGCCAGAGACAGCCCCCACAACAAAGACTCCTGTGGCCAGAAGATCACA 
AAAACTGCCCTTTTCCTTTTCTGCTAGCCTCACGAAGAGCCAGCTAATGATGGTTTCCTGCAGTCTTCATCTAGGCTGAAAAGGCAGGGGACCAGGC 
TACCGGCCACTGCAAGGGACTTTCCTCACC^TCACCTGGCCATCCCCrCTGGCCCCTTCCCTGGAGAGCTCTGCCCTCAGGTAAATGTTTTCTCAAC 
TTGTTTGG^GAACTTCGTAAAAACTCCCCGCTAGCCATCTCATAAGCTGTGAAGGTGTGCCTTAATGTGCTTTTTTCCACTAAAAAGGTTTGTTTAA 
AAAAAAAAAAACTAGAGCAGCCAGGGGATTTTTATTATTTTTTTTGTTCAAATAACTAAATAGCAGCCCATATGAAGACTTCAAAGTTAACACATCT 
CAAGAGTATCCCTCAAAAGGGAGCTCAGAAAAAAAAAGCGGCAGAGTGGGGGGTTTAAATGTCGTGAATCAACTCATGGCTTAGACAGGAAAAGTTG 
GTAACCTG<JGCGGTCTGGTCTGCCTCAGGAAACAGCCAGACAAGGTTGGGCATCAGTGCATGGGAGC^TGGAAGGAGCCTTAAACTTCTCCCCTAGC 
CGGGAACCAGCAGCCTAGCCTCTGGGGAGCCGGGGTCATTTGCATGCGCTCTTAAGAGACTCTTGCTTATAATTCCCTAGTGTCATCTTTAGTAACT 
AAAGGGGTTTCTAGGGAATGGGAGGCCAGTGCCCCGGGTATATGAACTAGCCTAGCCACCTGGTTCAGAAGCACCAGTGAGAGCCAAGTGGGAGTGT 
AGG AGGCCTTGGTG CCTGGAAGTTGC TAGAT AACTTGAGGCAG ATGGTGAGAATGGAT C ACTGT C TACTTG AG ACCTTTTAGGC AGAAGAGTTTG AG 
GCTGTGGGTG<:TGATGGGACCCAGTGAGTCAACTCTTGCATATCCCCAGAGAGTCAAAAGATAA 

ACACTAAGCTGGCCCCTGAGGTCACACACTGTCCCCTACTGGACAATGTGTCCAGAACTTCGCATCCAACAAAGGGCCTCATGAGTGACCAGCATTA 

GGGGTCAAGCCAGCATTTCCTTCCTTCTCCCTGGCTTTGGGAAATGTCCCACCGCCTTN^ 

NNNNNNNNNNNNNNNNGCACCTTCTCCACAGTCAGCCCAGGCCCACA^ 

T TGT CAATC AC CAGCTAG AGGGAGC CCA AAG CCTGTGAACTG AGTTGATAGTTGGATGTTGCAGTGGAG C CTGT C TTT ATGGAAAC AGGGAAGAGCA 
AAGATAGGGCACCTGGGCTGACATGAATATACACTGTTACACTTAAAAAGCAAAACCAAAGAGACCCTGTTTCGGATGACCTTAAGAAACCCCTGTG 
CAGGGCTTGGAAAGAGGGCTCAGCAGTAAAGAGCACTCCTTGTTCCTGGTGAGCTCCCGGCATCCATATGATAGCTTACAACTGTCTGTAAATCCAG 
TTTCGGGGGATCCAATGCCCTTTTGTGACTTCTGCAGGCATCAGGCATGCGTGTAGTGCACGTACACACGTGTAGGCAAAATAGCCGTACACATGAA 
AAAATAAATCTAAAAATATTTTTAAAAGACAGAAACCATTTGCGTATGGTATATTCT^ 

CTGCTATGGAAAGACTGAAATACGGGTTGTCAGTTTGACCTTAGCCCTTAGAGTTTCTTCCATTTTCTTCCTACCTTATCCATTCTCCTGAAAATCA 
GCAACCAAATCACTACTGCCATGGCTCAGAGGAGCTGGAGTCTGCTCCGACCACTGAGGGCTCACCATGGGACAAATGGACAGACAAGTAGTGGTCC 
CTGATTGTACATCATCTCTGTTGCTCATGACAACAGCTGCCTGGGGCTCTCTGCTTATCATATTCAGCTGGATCCCTGCACATCGGCTTTTAATTTC 
ACAAGTCTTCATCATGTCCTCCACGTAGCAGAAAATGCTCTAGAGGATTGTACACTATAACTCGTTTGAGTCTTGGAATAGCCTAGAGGAATTTGGG 
ACTTTCCCTTTTGATAATTAGCCAGATGTGGTGGAGAGGTGGGCCTCCAGTGTGGGCTAGCTCCATCATAGAAAGTATCACCAGGAGGGTCTGGCGT 
TATGGCTCAGCTGGAATAAAGGTGCTTGCCACAAAGGCTGAGGACTTCAGGTGATCCACGTGGTAGAACGAGAGAATTAACTCCTCCCAAGCTGTCT 
CCTGTGACCTCCACACCCACAGGGCACACATCACCTACACATACACACACTCATATATACATTAAGACACCATGACCAAATGTAACTCTGGGGGGAA 
AGGGTTTGTTTGTTTCCTTTTACAGGTTGTAGTTCTTTAACCAGGGAGGTCAGGGCAGGAACTTAAGACAGGAGCCTGGAGGCAGGAGCTGAAGGAG 
ATGCCATGGAGGGGGTGTTGCTTACTGGCTTGCTCTCCATGGCTTACTCAGCT^ 

GGACCCTCCCACGTCAATTATCAGAGGAGAAGATGTACCCCAGGCTCGCCCACAGGCTAATCTGGTGGGAACTTTTTTTTTTCAGTTGAGGTTCCCT 
CTTCCCAAATTACTCTATCTGATGTCAAGCTGACATAAAAGCAGCCAGCACAGTCAGGGTTATAAAGTCAGCAGAGGGGACAGCCAGGAGAAAACAC 
GCT ACTGCC TT TGA CTC CCCTTT AG CTGCATCTGATGGAAATTCCCCCAGACATTAAT TTGGGGAGT ATGGTGGG CTTAAG AAATGTCCT CAAGT AT 
TGTGTGTGTGTGGGGGGGGGGGGAGGTGTTTCTCACTATGTAGTCTTGACTGGCCTGAAGCTAGCTATGTAGACCAGTCTGGCCTTGAACTCATGAA 
GATCTGGCTGCTTCTCTACCCCCACCTCCACCCCCAAGTACTGAGATGACAAGCACGTGCCACCACAACCAGCCAGTCTTCAAATTCTTTAATATTC 
CTACCTTAAATAGGTGGGGGTCTTAGCCTGTGGTGGTATACACCTTTAAGATCAGra 

CACTTGGCAGAGGCAGGCGGATTTCTAAGTTCAAGGCCAGCCTGGACTACAAAGTGAGTTCCAGGAAAGCCAGGGCTGTACAGAGAAACCCTGTCTT 
GGG AAAAAAAA AAAAAAGAC C AGCACTTGGG AGG C AGAGGCAGGT AAATCTCTGTGGGTTCAAGGC CAAACTGG TCTTCATAGTGAGAGTCTGT CTC 
AAAACAATAACAGAGTTTGGAGAGATGGCTCAGAAGTTTGGTTCCC^CATTGATGTGGTGGCTCACAGTCATCTGTAACGCCAGTCCCAGAGAATC 
TGGCGCCCTCTTCTGGCCTTGTGGGTATTGCACACACATGGAACACAGACACACATGCAAACAGAAACACTCCTACACATGAAAATTAATTAATTA 
TTA ATT AAT TA ATT TTT AAAAAG ATTGAAAG CAATAACAAAAGAG GTG AAGTCTCTTTA GTCTCC CCTTAAGAGTG AGGTGT TAGTG CTT TGCCTG A 
GAG TAG C AA AG C AG AAG AGAAAGGGG ACTGAGACATAAATGGCTC CCTCTA TTAACAGC CATGTGGGG ACT CCC AC TTAGAAGTG AATCT C CT AGT T 
CATCAGGACTTCAGATGACAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 
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NNNNNNNIWNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNl^NNNNNNNNNNNNNNNNNNN^ 



NNNNISTNNNNNNNNNNNNNNNNNNNNNNNNm 

ATTGAATGAGACCCCTCCTTCC^TACAGAAGACATCATTTGTATTTCTCTTCTCACTGGTTCCTAGTACTTA^CACTGTACTGACTCGACATCATT 
GAGTCAAAGCCACCACTGACTGCTGAGTGATAGACTTTTCAGAGCTAAGAACTTTGTGCACATTATAACTCATAACTACTTTGGTTGTTGGACTATG 
CTTTTCTTT CT ATG AAAGCAGTGTCTCTGTCTTGC TT ATACTTATGATTC TTG ATTCTTGT T AC AAC ACATAATAGGTGGGAAATCTGTCGGGTG AA 
ACTAAATGCAATATAAATAAGATCATTATAATCCTTAAGGGGTTTTTAATATACTTAGGTTTAGAGA^ 
GAGCTTAGTCGGACCTTATTCAAACAGCCCTTTATTGGGTGC^ 

GGGCCACATCTCACCACCTTCATCGCCACCTCTGTGATTCATGCTACCAGTATCTTTTCTTTTGTTTAGGACAATGAGATTACAATGTTAAACCCAA 
CATTGTAATCCTAACTGGTCTTCCCTCATTCACTTTTCTCCCCACAGGAAAATCTCAACTGTGTCTAAAGGGATCCCAGGATACTCCAGTCAGAGCA 
TGTAGTTCCCCTGCTCTCACACTACCAACGTCTGCCCACCTGGGGGAGGCA^ 

GGACTGTATCTTCAGCCCACTGAACATTGAGCGACTTTGTGACTTTCTTTGGTCAATAGGATGTGGTAGATGGGACTCTGAGCTAGACCTGATCATG 
GATGTTGGCAGGTTCTGAGTTCTGCCTTTAGAATCCGGCACCTGCCAAGGGAACAACAGGCAGGATGGCTGGGAGGGTGGGGTGGGGTGGGCCAGGG 
AGAACAGACCAGTCAGTTACCAACCAACTCAGCAGCTGAGAGGTCTGGATGAGATCAGAGCTACTGTGGCCTTCTGGAATCTTCTGAATGACCTACA 
AACTCAGCCAAAAGCATGCCTGTTTAAAGCTCTCGTGAAGTCCTTGGTATGTTACACAGTGGAAACTTACTGATAAACTGTCTCGTTCAGGGGAAAA 
AGTCAAGTGTTTTGCCTGGCA(^TGGTGGGTGCTCAGGGGACGCTTGATTAAATAAGAAATTCCCAAATGCAGTTTGCCTTCCAGTGAAGAAAATAA 
GTTCTAAAACAATCAGCGGCTACGATAAGTGCTCTGAAACAAACAGGCAGTCGCAGTGGCTAGATCTGAGGATCTAGGGAACGGGGCACGGGCCTGT 
H ; GGTTGGTTCTCTGGTTACTTTAGAATGAGGCAAAACTCCTCCCAATTGACTCCAAGAGCCTTGAGAGACGTCCCGGCTGATCCACGTACCGAACTTA 
f$ TCCCCTACCAGTCCCCTGCCACTCTCAGGACCTCTGATCCGGAGACACTGGGCTCTCTGGTGTTCCACCTCAGGTGTGTGCCTGGACCTTTCTTCAC 
'TZ TTAGCCTCTCTGCTCCCCCTCAGGTTCCTGCTCACGTGTGACCCCTTCACCAAGCCCTTCCTGGCCAGCTTACAAAAGTGGCCACTCTTCGAATTCT 
W CCTACCCTGCTCTATTTTTATCTGTGACACTTGACACTCCCTGATGTATTATGTCAGCTCAGTTGTTTTACCTGTCTGCTCTCACCACTAAATACAA 
O ATTCTACAAGGATTGACTTTTATTATTATTATTATTATTATTGTTATTGTTATTATTGTTATTATTATTATTTTGATAGGTTTGTCCTTAGAACAAT 
IS ATTGTC G TC AG GCAGAATTGTAATC AGTGGA TACAAA AATGC TTT AGGAGC CATAAC AT ATTGTCAGC CCT CAG ATTAATCG AGGTGAGC ATTTGAG 
CCACACTGGGAATAGAGTAAGAGCTTTTATTTTAATATGTGTGCGTGTCTTGCCTGCACATATGTATGTGCCCCGTGTCCATTTCCTGGTACCCGAG 
GAGTCAGGAAGAGGACCCCTGGAACTGGAGTAACAGATGATTGGGCCACTGTCTGGATGTTAGGACTAGATCCCAAGTCCTCTGGACGAGCAGCCAG 
jy. TGTTCCTAACTGCTGAGCATCTTTCCAGTCCCCAAACTTTAAAAAAAAATAGCCAAAACACTGGGATGAGAATGGGACTGTTCACTAAAGCGACAAG 
r 7 CGTGTTCGTGTCATGGGACACATGCCAAACACCGCCACTCCACAGCATGCTTTACAGAGTGGGGGACATTCCTATTCAAATTGTCATCCCTCATTGC 
yj AGTAGCCCTGAAAGGTAGTAGGTGGGGCAGGTTTTAGGTGTGTGTGTGTGGTGGGAGGCTGGGGGGGGGGGGAGTAAGGGAAGGGAAGTCACTGGAT 
TCTCATCATATCGATCTGACTCCAAGTCCAGAGCCTCTGTCCCTCATTTTGACGTTCTCTCCTTGTTGGGTTATGACGTCACTGACTTCTGAAGACT 
* , CAAGGTCAACCTTGAACTCCTGAAGGAGCCCCTCCACAGAAGGTAGGTATCCTTTCTCTAGGGAGAATGAAAGTGTCTTGCTAAGAGGACAGGGGAT 
N* AGATAATAAACTTCTTTAGCATCGATTCCAGAGACTCCCCGAGACTCTCAGCTAGCCCCAACGTCTTGCTTGGTCATCTGTAAAAGGCAGTGCCCTC 
fj CTTTCTTTTTGTCCCCAGCCCACCGAACTGTCCTATAGGTGAAGGAGGAAGGCCGGTCATAAAACAGGATGGGCAGTTGCTTTCGGGTGAAACCCAG 
AAAGGGTCCTCAGGCTTGTCGCTGACTGTATTTAGATTTAAAAATCAAGGCCTGTGACCTGGAGGCTCTTTCCTGCTGCTCTCTGCCATGTTGACAG 
TU CAGAGAACTTGCTAATTCTGGCCGGGGATTTTCCCTGTGTGGCTGTTTTTCTTCTTTATCTTCATCAATTGAAGAGCGCTCGCTCTGCCTTTCAAAC 



TAACTTGTTGACGTTTCCGCCCATGTGACTTCCAGCTTGAATTACCAGAAACCACAAGAGAGGGCCTGTGCGCAAGCCCCAAAGCAGCGAGATGTCC 
CTGTGGGGAGCAGTCCCTCTGCACCCGAGAGAGAGGAGGACGCAGGGGTCTGTCAGAGGTTGCTGTTGGGCAAGCAGGGGAGGTACCTGTGGAAGGT 
GGTGTGCTGGTGGCCCCCTAGCAGTCAAGAAGGGGAGCCAGCTAGTGAGAAGATCGCCCAGAGGCATCTGGGACGGTGTGGGAGAGCCCGGAAGATT 
AGAACCATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCTCCTCCCCCTGGCAAACAGGTACCCCCTTCGTCCCCCTCCCCCACGC 
CCCGCATGCCTTGCCTTCTGATTTTCCTGGGGTGGTAGAAGAGGGGTAGGCTCCCTTGTCCTGCCCAGGGAGCTCTGTCTGGGGAAGGAAGTAGCTC 
GTGGCAGGAACTCTCTAGCAGACTCTGGACCGTGTGGTGTTTGCTATACAGA 

TCCAAGAGGTACCCCTCGTCTCTGCCACTTAGGTTCCCACCGTGAAACTTTCTAGGAAGCTGGGGGCAGGGTGAATATCACAGAGCTTAGAAGTATG 
GTCAACCATAAGCAGAGGAATGTTCCAGTAGATCGCTGCTTCGGGCTCCACCCAAATCCTAGAGTTGTAGCTTTCCAGGAAAAAACCAAGCAAGTGG 
CAAGACAGAACCGAGGTCAGGGAAGAGAAATATAGCAAATCAAAGCAAGGTAGACAGGAGGAGAAGAGTGGCAAGATCTGCCTGCGGATTGCCTCGG 
TAACTCCAGCAGTCCGTGGGAAGGGAGGCTAAAGCTACAACCTGTATCTTTCCCCCCCACCCCCCATGTGGCTTCCAGATAGAAGAAAGAAGGCCAT 
GGT CC AG G AGGCGTGT CAGAG ACT TTC ACT AGT AG CTAATAT CATC CCTG AAAAA CCAGGG ATGGAGGG AAAGATTCTGAG ACTCTT AG ACT CT TGA 
CTGATACCAAGCCGAAATGTCTCAGACTGAAAGATGCCTCCGATGGGATGCCTTTCTCACCTTGTGGTTTCTGGGTCTCTCTACCACGTTCCTGATG 
CCCCTTGTTCTAGACATTTCTCCTCCTCATCCAACATAGACCCACGGATTAAACTTTTTTACATCTCATCTTAAAAACTATATTCTTTCCTTAGCAC 
ACATATTGGGAGGGGCTCAGAGGTAGAGTGTTTGGCTCACACACATCCAGGGGGCTGCAAAACGCTACAAACAAAATAAACGTAAACTACCTGACCA 
CCGGCTCCCACATCTGTCACAAGCCCTCACCAAAAGCCATACTGGTGAAGACCTCAAGAGGGAGATAGTCCCTCGCTCACGATGGTATCAGAGCAAC 
TCAGACTGCCTGGAAGATGGTACCCCAGTCTCTTCTTAAGAAGGAAATTACTCAC^^ 

ATAT AT C CTTG CCCTCT CTAACCCAATGT CCGTTTGC TAGAATGACCTGGAAAGTTTGAGTAAAG AGGGAAGGGAATCAGAACCTGAC CC AAGGGT C 
TTTGCCCCTCAGGCCAGTGGAGGTAGGTGACAGGGTGGGGGTTGCAGAAGGGCTTCTCTTCCCTGATCTTCTATAGGAGATACTTAGAAAGCACTCA 
AAGAGCACTGTGTCCTCCAAAGCACCCGTCAGTGGCTCACTAAAGCAGGCGACCTGCACTTCATAAAATGATTTGGTGGTGGTGGTGGTGGGGTTAC 
CTCATCTGTAGCAGGCATCATGTTCAGGACTAGGAAGGTCTCAAGATGTCTGCACAATGATGGACCGAATAAAACAGATCTGGGCGGCGCCCTCTGC 
CTAGGGGTGCAGCACAATGAAATTAATAGTCAAGGGGGAATTCTCAGAAAGGATTGAGCATCAGTGCCAGAAAGATGTCATACCCCTGGATGTCCAT 
GAGCTC AGGGATCAGGGTGG AGAGCCTC AGCTTCGTACTG TGTAAAATAGAAAAGACGG TGCCTG CTCAGAAAGCTGGGG AGT TG AAG CAT AGCGT C 
TCCGAACGCGCTTGTACAGAAGTTGGCACATGGTAAATGCTCAGCGGAGTTTTCAAAGACAACAGCAAAAGGAGCIATCAAGAAGGTTTAGCATGTGC 
GCACACATGCACATATATACATGCACACACACTCACACGCAC^CATACCTCAOITGCATACATATGTATACACACATGTACACACACACACACACAC 
ACAGAGTGAAGCTAGAACAGCTCTGGAGAGAGCAGAAGTGATGGCATCTTAACTGGAATTTCCCAAAGTAGCCCTCCTGAGGCTTCAATTTCTCTTC 
ACTTTTGAGTAGCTTTTGATCATAAGAAGACCATAGGAGACCCAAGGGTGGGTAGCAGGGAATGCTGAAGAAAATGTCTTTGGGTACAGGCCTTTTG 
TAAATAAGGGGAATGGACAATACAGAAAGAACTGAGGACCGGGAAGGAAGGAAGGAAAGGAGGGAGGAAACCAATGTCCTATTTTGACTAAGAGGAA 
CGAT CAG AC CT ACCGTTG ATATCTCTGCTTCCTAGGACTCATC TG ATG ATGTGAGGAAAGTTCAGAGGAGAGAGAAGAATCGC AT CGC TG CCC AGAA 
GAGCCGACAGAGACAGACACAGAAAGCCGACACCCTTCACCTGGTGAGTACCCACACTTCCTACCATGCTCCTGCCCTTAGGCTCTGCGCCCCACAC 
CAGTGAGGCTGTGGGGCCCAGCTCTGATGATTCTGCTGTGGTAGAGCTAAGTACTGGGGGAGAGTAGTAGAGGGCAAGAGCTGCAGGGGTCAGGTTG 
TGGAAAGGGCTTCGCTCTCCCATGCATGTATGAAAACATTTAGTAGCAGAGAACAGCACTGGACACTGAACCTTTCTGGCCCAGGAGAGTGCAAGGG 
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TAGATGTGTGGGATGTTGACAGCCTCAAGGCTTCATGGGAGGCAGAGCCACCTGTGTGGAAAGGGACAGCAGAGAGGGTGTCGTCCTTCCTGAGCCA 
CTG TGAGCAGAACACTG ATG CTG AG AGG AAGTTAGGTGTTAAGTG CTT C AGGGAAGC CATTCTTTCAGAGCTGCAC ATCTCATCCC TCTGGAAGGCA 
ATGGTCTGC CAC CC AAGCAC ACTTGCCTTTGGGAGAAGGAGGTCG ATGATG CTG CCCTT ATT TTTTTT CCCTGAGCATGG AGAAAGTT CTGTGTTGA 
GATTTACATTGTTAATGGTCTCTCTAGGGTCTGCTAAAGACAAAATTTTAAAGCACTTAC^ 

TGAACTGCAGCCTCACTTGGGAGACCCAGGGCCTGGACCAGCGACCCCGAGTCCTGACTGGGCGCAGTGTTCCCACCCTTCATACCTTTC 
CAGTCTGCTCCTTTCACTGTGTTCTTGCTCAACrGTTCCATC 

GCTAATATTCTAGaAATTTCCTAAAAGTGCAAGAGGTGCCAAACCCAGATAAGCTGATAATGATCTGATGTTTAGCATGGCTAAGGGCTGCTTTAAT 
TTTATCAGGGATAAATTATCGGGGGCTGGGGAAATGGCTCAGGGGGTAAAATGCTTGCCATTCTAGC^TC^ 

ACAGTC AAATGGGCAAGAGGCAGAAGC AGCAGGTGGACATCTATGAAATTGAAGCCAG CATAGT CTACA CAGTGAGCT CC AGGATAGCC AGGTCT AC 
ATAGAGAGCCCCTGTGTCAAAAGAGAAAAAAGAAAAAAAAGAAAGAAAAAAAAGCTGAAAAGCTTAATGGATGTGGCAGCCTGACAGTGTAATCCCA 
G CT CATGGGAGGTG GAG ATGGGGAATCTTAG AGT ACGTTGGC TAG CCAGACT AGT CAAAACAGTAAGCTTTAGGTTCAGCAAAAGACTCTGCCTCGA 
TTCTCAATACGGAAGTTGGAAAGTGATGGGAGAAGATACTGACCTCAACCCTGGGCCTCACACACGCATGTGCATCTATGTACACGCGTGCCCACAT 
ACAAGAAAAATACATGGAAA CACCA CACGCACATACACAT GCAAC AAAAGAATAAGT AACAT ACAGTT TTAGAAGAC AAAGGGGAGGATAT CG CCAC 
TTCTTTCAGCCTAAGGACAGAGAACGGTCTGCCCTAGTCGCTCTGCCTGTGCTTCTGCTCTGTTTGATAGGTTCTTAATATAGTCTCCAGTACGTGC 
TAACATCATTTTCAATTGGCAAATATGTGTCAGACTAACAAGAGAAAGAATAAACAGCACTCAGCACAAACAGACCAAC 
CTGTCCCGGTACTCAAACCCTTCAAGATTTCCTGGGGGCTTAAAAATCTCA 

ATATTCCATAGAGCAAGAAAGCCACACTTACGATAAACTCCGTTTTGTTTGTGCAAGTAGACAATGTCCTAGATGGGCAAACTCCTGTAAGCTTTGT 
AAGTTTGTAATATTAAAAGAAGGAAGCTGAGTTCAGCCTCTCTCTGGCAGGGAGGCCAGACTTGAACCACCGCTCTGCCGACTGCTGTCTGTGTGTC 
TGG ACCCACAAGCATTCTTCATGGTCCTTAG CCT TGGGGC CC TTAT CTATAG AATGGGAAATGACAGG AACTGCT TAT AGAGAC AGTGC AGG AGAAA 
AGCTCAGTATGGTGCCTGGCATACAGTAAATACTCAGAAGATTCTGTGCAATTAAAGTTATAGTAATAGTGTAAGAGGCGTGAAAACTGAGACTGAG 
CAGGCATACATATGTACATTGCATAATTTTATATAAAAATGTACGTGTATTTCCCAGTATGTATATATGCACATATTACATGCATGTATTGTGTGCC 
GTGTGTATTGTGTGTGTATCTACCATGTGTGTGTGTGTGTGTGTGTATCGTATGTATGTGAGTATGGGTGTGAGTGTGTGTGAGAGATGCATGCATA 
TGTGTGTGTGTGTATACTTTTTCTCCACATCATCTCAGTGCTGAGAGACTTCTTAACAGCTGTAAAGCTCATGAGACTCCCTTGCCACACTCACTCC 
CGGCAGAAGGTCCCCAGGGCACGCCCAGTCACCTCTAATAGCAGAGTGCTGCCTCCCTCTTGACACAGGAGAAGGGCCTCCTGTCTCTAGCTGCAGG 
CC^CCTTCCTGTAAACTCAGCCCCCTGGTCTCTGTTGCCTAACTTCTTCTGTGAGAGCATGAGAAACTTACAATGAAGCATTTTGAATACCTGTCTT 
GTATTCATGCATGGCGCGCTGATTGTTTTTTAAGTGACAGAGGTGGGGAGGAGCCTCCTGCGACAACTGTGTTACAATGTCCCCAGCATTGATTAAC 
TGC ACAT AATT C ATTTGTGATTC TG TTCCGATTATGAG AT AT ACGT GATGGTT TCCC AG ACA CAGATGTAAAC ATAAGATAATTAGGATGGCA TATC 
AACCTAGTCAGAGAGGTCGAATTATCTAATAAATAGAGTTGGGACACCTGGCAAGTCACTTGAAGAGACAAAAGGCCAGGTCTCTCATTCCTTATGC 
AATGCACAGAC AGT TTAAAT ATTTT T AAAGG AAACTAAAG AAGTCTTG GGGGAAAAT AT GTAATTTGG AAATGTAC CTT GGC ATG AAAATTTT TCAA 
ATAAGATAGATAGAAAATAATAAAAGAAAATTGATATACAAATGTGATTTATACATGCATCAAAAAATAAAAAAATTTGGCTGGGCATCCTAATTGG 
GCTAGGCCTTCAATCCTAGCACTTGGGAGACAGAGGCAGGTGGATCTCTGAGCTCAAGACCAGCTTCATCTACATAGTGAGACGCCCATCTTGAAAA 
AAGGATGAAGTTTACCCCCCCTCTTTCTTCCTTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTTCCTTCC 
TTCCTTCCTTCCTTCCTTCCTTCCTTTCTTTCTTTCTTTCTTTCTTTCTTTTTTTTCCTTCTTTCTTTCTTTTTTCCAAGACAGGGTTTCTCTGTGT 
AGCCCTGCCTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCAAACTCAGAAATCTGCCTGCCTCTGCCTCCCAAGTGCTGAGATCAAAGGCAT 
ATGCCACCACTGCCCGGCTGAAGTTTTTCTTTCATTCTTCAAAATATTATATACAAAATCAAAGGCAAATGGAAAGCCAGGAGGGAATCATCTAGAA 
AATTCATGAGCCAGAAAGGCAGGTTTACTACATATGTGGGAACTCTATGGCGAGGGAAATGATTGATCTTTATTAAAAAAAAAAAAAAAAAAAAAAA 
AGACTATAAAGTGAGGGAAAGGGAGAAACAGAATGTACCTACACATGTAAGGATGCGCCAGGCOTCCTTGGGAAAATGCCAATGAGGGTGAATGTAA 
GACTTCCTTGCCAGGCTGACAGAGGTGAAAGATGGAGCTGACCCAGGAAATGGCCAAGGGAGACGGAGAGTGAGCATCGCATTCCCTTAGCAGGGCA 
AGTTGCTATAACCCCCTAAGACGGCAGCTTGAGCAGGCTCACTCAGGTGGCAGGGGCACCTCCCCTGTCTCCCAGCAGGGTGCTGTCCAGAATCCGT 
CCACACCACAAGCACAGCACAGAGGTTGTAGCTCTCTGAGCTGAGTGTGAGTCATTGTTGCGATGTGCTGCTCTTGGTGTCTCTTGGTTTTTGTTTA 
GTGGCACTGGGGATTGAACCCAGGGACTTGTACACCACTGAATGACTGGAAACTAGTTACATATTCATTAGGAGAATGGGGGTTGAATGAACACTAT 
TGGTCCATTCTGAGTGTGCTGTTATGGGACTGTTAAGTGAGTTGCTTCTGAGTGTGGTAAAACTATCGTTTTCTTTTCTAAAACAGTTTTTCTAAGT 
TATGGAAGGTGGGGTATGTGTACATGAGTGCAGGTGCCC!ATAAAAGACAGAAGGGGGTATTGGTCCC^ 

GTCCTCTGCAAGAGCAGTAGATGCCCTTAATCACTGAGCCATCTGTCCAATTCCTCTTTAAAGCTTTCTATGTTGCAGTTATTTATTGTTTGTTTGT 

TTATTGT GGGG GAG TGC ATG C ATGTGTGGAT GAG TCGGTT CT CTC CCTCCGCTGTGTTGGGT CCGGGCATCGAACTGAAGTC ATAAGACT TGG CAGC 

AAGCAGCCGGGCGTGGTGGAGCACGCCTTTAATCCCAGCACTCAGGAGGCAGAGGCAGGCAGATTTCTGAGTTCAAGGCCAGCCTGGTCTACAGAGT 

TCCAGGA(^GCCAGGGCTATACAGAGAAACCCTGTCTTGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACTTGGCAGCAAGCACTCT 

CACCCCCTGAGCCCTCTTACCGGTCCCTGGTTCCCAGACTCTTAAGGAAGAAAAGTGC^ATGTAGCATCTAGCTTGCGCCAAGCCACACTTTTTAA 

TCTGTCTGAGCATAAGTATACAGGCAAGTTTCAAATGGATCACCCGAACTTATTAATATCATTTTCAAAAGGTCACACTGAAGTTAAAAGACAGGAA 

AAGTGTCCTTTGCATTTTCAATCCTTTTTTATGTCAGTTAGGAGTAGCTAAGATTTGAGTGACTCTAAGCTCCGTGGTACTAGATGGAA 

AGAATGGC^AGTCGTTTTCCTGTGGCCAGCCAGCCAGTTGAGAGTGGGAGCCTTCTGGAATCTTCTATCATAGGGGGATTTTGAGTCAATGTCCTGG 

GTCAGGGTTGGGGTGGGGGTGGGGGTGGGGGAGAACTGAGATCAGTTTAAAAGCATTGCTCTGGAGAGGGGCTCCAAGCAGAGGAGGTGGGGGTTGG 

ACAGCAGAGTCTCCCAGAGTGTGCTGTTGGCAAGTGGCGCCTGGTCTCCAGGTTGCTTGGCCATCCGTGCTGGATCTGGAAACTGGAGTACGAACAG 

GGT GGC T CCTTT CTGTG AAG ATG AGGCAGGATAT GACTGCTGTGG GCTCCTGAAGTAATCTC ACCACAGTT C AGGAC AGGCAATGT CTGTG ATTTGT 

AAA ATC ACATAAAACAG AAAAAT CATGTAGACTG AGCTGATG CTGTGAAAAATGAGGAC CATTGG AGG AAATTAAATAATTG AGAAATTCCCAAAGA 

TGTTAGCTCCCAGGGACACTGGGAGCAAGTTTTCAGCAAGGCCATTGCTCTTGGCTTGCTGAGGTGGCGGGGAAACGAGGCTGTGAGGTTCCTCCAA 

C ATTTCT CATTCTC TGT CTCT AGGAGACC AC ATC CTGAGGTTT ATT AACAT AACAAATT C AT AGAT CTGTAAAACAATAAAACATGAAGGG AC ATTC 

TGGCTAGTTTCACCTTTTAAATGCTTAATCTAGAGCCAGTGAATCAGCCGCCTCCCTGGCATCTCTGCCTCTCTGAATTTCTATTTAAAATATGGTC 

ATTAGCTGGGCATGGTGGCGCATGCCTTTAATTCCAGCACTCGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTG 

AGTTCCAGGACAGCCAGGGCTATACAGAGAAACCCTGTCTCCAAACACACACACACACACACAC^^ 

AACACCTGGCCATGGTCACGCTAAGTGCGTTAAGTGTCCTACCAGTGGGGTTGCACTTGATCTGAAAAAACAAAT^ 

GAAGGTTGGACAGCTTGAACTTCAGAGCCCTCCTTTGTAAACATAAACATCTATCTGTATCCTTGCATCTCCTGCTAAGGTCTTGTTTGTCTTGTTT 
CTATTTAAACAATACACTCTGAAATTCTTATTTTTCCCTGATCAT^ 

T CTGTCTGT CT ACC CTCC CC C CTTT CTCC CC CTCTCTCTC TAT CT CTC TAT CTGGCTGT ATGTGTGTGGGT AT ATATAG ATCTTTATATAAATATGT 
GATACATATGTATGTATAATATTTATACACTAGTTTTTAGAAGATTGTTCCATTTTAACAGTGTACACAAGAATAACCTTTGGACACAGTCCTTCTC 
ACTATAACATTGTA AGG C AAAGT CC ATAGAT G AGGATGCTGG AAA C ATGGAT ATCTG AAC AG CAAACACTTTATGGGTGT CATAAAATTAGGC AGAT 
GGT T AACAGGATATTTAAC AGA CTGTG ATC CACTTA CG CAAT AG AAATTGAGG CTG AC ACT AGAAGTGGCCATGGAGATCTTTGTGTATAGACTTAG 
AAAGATGTCTGCGAGGCTGAATTGAGCTAATCAAGCCAGGGGAGTGTGCAGAGCACCATCTGCTAATGTGAAGCTTGCAAGGTTTGGGGATGGAAAC 
ATAGTCTCCAAATTACCAATGGTGGTTACTTCTGAGTGCCTGGGTTGGGGTAGCTTGCCTTCCTCCTGGTGCTCTTTGCAGTGTTATTGTGACTTAA 
ATGTGCTGTTTAAACAAT<^GAAATAGTCCCAGAGTCTTTACAAATCAGCTATTTTAG 

GCT TAT C CA AGG ACGATGGG CTG CTCCGTTAGGG TGGGAGGG CCCCAG ACTTGAGGAATGTGTGGCTGAACTG GAT TGTTAG T AC C AGCT CAC AAC C 
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CAACAGCTACCCCACTCAACAGGTGTCAGGAGGCTCTCCTCAGCGGCACCAAGTGTGGCTTCTGCTTCCCACAGCAGTGGACTGGTCCACAGGArAA 
AGGTTCCTTTCGGCTTGTTTTTCCATAGCCTCACCATGGGGCCCCATAGTAGCTTTCAGAGGAAATGCCACTAAGTGTGGCCACACTGTCTTGTTCA 
TAGGATCAAAGTGTTAAAAACGAAAACTTGGGGTTTTTAGGCCAACCTGTCATGTTACCAATGAGGAAATCAGTGTCTTTCTCGGGGACACAAAAGG 
ACCCCAGGTTCTTGCTTCCTTACCAGCCTGTATCTGTTTTGTGATATTCTTAGTTTTGTTTTTGTTTGTTTTGTTTGTTGTTCCTATCTATCATTTT 
GCAAAGGGTTAGCCAAGAAGTAGACTGATGGTCAGGATGCTTAC^CAAGACTCTCA^ 

GGAGTTACAGTGGATAAAGCCTAGAGTTTAAAGAATTATTTATTTATATGTGCATTGATTGGTGTTCTGCCTGCGTGCTTGTTACAAGGGTGGCAGA 
T CTTCTGGACTTGG AGTTAC AGACCATTGTG AGCTACCACGTGGGTGCTGAG AATTGAC CTCAGGT CCTCT ATAAGAGC AGCC AT TGC TCTTAACT G 
TCTCACCAGCCTAGGCCTAGAATTTAAAGGCCCCGTTAATTAGA^ 

TCATTAAATTAAATACATGATGATTTAATATGTATACTAAGAAGGAAAAAC&TCCTTTGAGAGAAA^ 

AAACTACACAGAAAGCTTACACACATTTTTCCTACCCAAATGAAAATAACCTAATTTTCAATTCTGATGACAAAAACAGTACACaTTGAGACAGGGT 
TTCTCAACCTCAGGGTGATTGACAGTCTGAGTCTTATACATTTTTCTTTT^ 

GCGCACGCCTTTAATCCCAGCACTCAGGAAGCAGAGGC^GGTGGATTTCTGAGTTC^AGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGA<^CCAG 

GGCTACATAGAGAAACCCTGTCTTGAGAAACTAAAAAAAAAAAAAAAAGACCTTATACATGCCAGGCTAACACTCTACCGCTGAACTATAACTCATC 

TTTTCCCCACTCTACATTTAAAATTTTGAGACAAGGTATCACAAAGCTACCCAGGCTGGCCTTC 

TTCAATACTGCTGCCTCGGCTTCCAAAGCAGCTTGGGATTGTGGGCCTACACCCCCAGGACTC 

TTGTGCACCGCAAAATGTAAACTCTACCCAGTAGATGCCACTAGATTCGTCTC^ 

GAATAGTTACCCCCATTTGGTTTTGATGTATTAGGTTCTGTTTTAAAAAGGTGTTTTGTTTTGTTTTGTTTTTTGCTTCATTAGAAGTTGACAGCTT 
GCA CCAGTAAG CAGGCAGCCGGAAAGGATGGCGG ACGGCCTG G AACCCTTTGGAGTTCCG AGACTTGC CGCCGGGCCGGG CCGAGGCGGGAGGGAC C 
CTTCCAAGTTGACTTTGCGGGTGGAGGGAAGTCCTGGAGTTAGCAGGACTGTGTGCTTGCGACTGTGGGCGTGGAGAGAGCGAGTAGCGGCTGGAGC 
AGCCGTGCTCCTCCCGAACCGACCGCGGCCACGCTGCCGGGACTTCCCGGACAGCGTTTGCGCCGCCCACCCCGTGGCCACGGGCCCAGGATCACAT 
GGCTAGACCCGGAGCGGCCGGGGTGGGCGTCCTCCACTCTCTCCGGCCTGCGTTTTGCGAGTGTAGCGGTGGAGGCTGAATCGGATCCCACCAGAAA 
CCC C AAC AAGTGGGC AGT TGGGACT AGGGCG TTCACT AGGGCGTT AAGAAAT CTT CACAAAACTAGTTGGCACAATTTCTTC CAT TAGAAAGCATGG 
CATCCAGTGCCTCTCTGTGACTGCTCTGGGTGGCTTCAGGTGCAGCAACGTTACATCTCAGGAGCTGTAGTCATCTTGGGCAGGCACTCACTGTTGG 
jp TTATACCGTCTCACAACACAGTATGGATGCATCAAGGTGGCATTTCTAGTGCTATTTGCAACAGGGATATTTTGACAGCCATCAGAATGTCCCATAC 
xxZ CCGGCAACCATCTCAAGTAAATTCCTGATCATGCACTTGATGAGGTGGCCATGAAACACAGTACTTGTGACACTGTAGCCATTACAAAGTTATAAAA 
^ AAGGCAAAAGACAAATTATCTTTTCACTAGGGTTGAGCTTCTACAGAAAGAAGTATACACTTTGTTTGAAAAAAACCAGGATG 

O TGCGAGCAGGTGTTAGGAGCTAGGATTATGGGTGCOTTACATCTTAAAAGCTCTCGAAGGCTAGGCAAGGTGGTACAAGCTTTAATCCCAGCACTCA 
p GGAGGCGGAGGCAGATGGTTCTGATTTCGAGGCCAGCCTGGTCTATATAGCAAGTTCCAGGCCATCAGGGGTTCATAGTGAGACCCTATCTCAAATT 
4^ AATTAATCTTTAATATAGTTATACTGTGTGTATGTATATATATGTATATACACATATATACACATGAGTGTAAGTGTGTGTGTGTGTGTGTGTGTGT 
f& GTGTGTGTGTGTGTGTGTGTGTGTAAAGAAGATGGCCAAAGTTGAGAACTCTGAAGGAAACTGAAATATCGCTGTTAGGTTTTGGCAGGAGGTTATA 
y ;: ATTTATCTTCTCAGCTGGAGCTCTTTAGGGAATGAAAAAGGAGTCCTAGCAAATATCATACCAGGACAGCAGAGATGAACTTGAGCCATCCCCAGAA 
\ I AGCTTGGGAGAACCAAGGACAGTGAC CT GGA TCA CTCACT AGTGC AAAGGAAAAG AT AG AGTT CAAAAC CAAGC AGATT AGGACC AAAAGGCC ATT C 
Pi TGTGTGCCCAGCATGGAAGCCTAAGCCATTGGTCTCATGGGCTTTACATTCAACTGTAGGTCTGGAAACCAATTCAGAATCTCACTAGCTACACATC 
ATG AAAACCGCTTTGAC ACT GAG GAT CAACTG CAAAC CAC AG AGGAGGCTGGGAAGACCTGGGGAAACGGGGCCTGTGG AGAGAC TTGGGGCCAAA C 
J t AC^C^TGAAGCATCAGAAACGGAATGTGAAGCCAGAGTCGGGGTGACAAGGATAAACCAGATATGTAATACAAGGATGGAAGATAGGAAATTGGAG 
|=w CACACAAAG ACCATTGTG ATCCAGGCTGGGG ATTGCT CTC AACTG GAG ACTTTTT TAGGT AG ACC AGCTTTTAT TAATT CTT AGAC AGTT T CAT AC A 
f3 TATAAGCAGTACATATTGATCACTCTCTGTCCCCACTCTCACCGATACATTCCCTTCTCCCTGATCTACATCCTCTGATACTGATACCTGTGTGTGT 
5* s GTGT ATGT ATGTATGTGT AT ATG TGT ATGTAT ATGTGTATATGT AT ATGTGTGTAT ATGTGTATGTGTGTGAATG TG TCTATG TGTATT ATGTGTGT 
1^ GTGGTGTGTGTCTGTGTGTGTTTTAATCCACTGAGTTCAATTAACACTGCCCATTGTATTGTTCACTGATCATGTCAAGTAACGTCAGCTGCAGTGA 
|y GTTTATTGGCCACATTACACTCAAACGAGAGCTTTCTCTAC&TTA^ 

Jwk ACTAAGT CTGTGGT ATAAAT ATG AAC AAGAC AGT TTG ACAGCTTGCCATTAATCAGAAC AAC AGG ATC AGGTTACGT CCT CT ACC CTA CC ACC CTAC 

<M CATCCCATGCTGCTCTCACCCCTTCCCACCCGGGCCTAGACCCCCTAAGCAGCATGAGGCCTAAGTCTCTTAAGCCTTGAATTAGAGAGAAAGTAAG 
§g& AATGTTCATCTCTCCTGGGGAGAAAAGCAGAGCTCCAGCTGGGGTGTTGATTTGTCT 

CCTGCTGTCTTGTCACTTGTGAGTCTTGGCACACAAACCAACCTGCTGTCTTGTCACTTGTGAGAAAGAACTATGCCCTTCTGTCTGTGGGTTCATG 
AGAACTCTATGGTCAACGCTTCAGCCTCCTTGGGTTGGTCTGATCCGGGAAAGTCAAATACAAAGGAGGGAAATTACATTCAAAGTCAAGTGGTGCT 
ACTCTCTATATGGCTGGGCCTTTTGAAAAAGGAACCTCTTAGATTCTTTTGTAACTGTT^ 

AGCCTCGAAGATGTCATAGGTCTACTGAGTAGACACCGTCAGGAGCATTTGTCACATTCCAAGCTAGCAAGCAAGTAACACGATTTGGCTGAGCATT 
CACATTGTGCTGTCTCTGTCGCAACGCTGAGGACCAACGCATGGAGACCATAGCAGCTTCCTTCTCTGCCCTGGAGGAGCTCACCAATCCAAAGGGA 
CCTAGTGTGAGTGAGGTTTAACTGGTGAGTTCACACAGATCCCCACCTCTAGCCTTGCCATCCC^U^GAAGCAGGGGTGCTGGGACCGGGGCAATGCT 
GATGGAGTAAATGCAGTTGTAGCAGAGCCTGCTGACATAAGTTTGACCCCTTGAACTCACAGAACTGACTCTGGGAAGGTAACCTTTGACCTCCAAA 
GGCCTGC CACT C ATT CG AGT CTG CCCTCACACACATACATCATAC ACACATGTAATAACAATTAATAAGTTTAGCAC CT T AAGAGCTG TTCTGGGC C 
AGGCATTGGTGGCGCCCGCCTTTAATCCCAGCACTCGGGAAGCAGAGGCAGTCAGATTTCTGAGTTTGAGGCCAGCCTGGTCTACAGAGTGAGTTCC 
AGGATAGCCAGGACTACACAGAGAAACCCTGTCTCGGAAAAAAAAAAAAAACAAAAAAACAAAAAAAAGA 

TAATCTATTGTCTTTAAATGTCAAAGTATTCTAGAAAATTTTTAACTTAGAAATGTTCATTCAACTCCTAAGACTCTGTTTTATGCTACCTATGAAA 

ATATATTCTTAACTTAGCTCGGTGGTGGTGCACACCTTTAATCCCAGCATTCAGAGTGAGTTCCGGGACAGCCAAGACTATAGAGAGAAAAACAAAA 

C AAAGAAAG ATTAAAAAAAGAGAGAAATATGT CTAATGAG ATAAAACATGAAAT ACATTT AATAGTGATTAGT TCITG CT ATT AT ACT 

ATCTGCAATGAATATGATTCTCAATCTGGCTAAGCACTGGGCTCACCAGGGAAGTATTTTTAAAATCTTGATGCCTACACCCCATTTGTAGCA^ 

AATTGGGGCCTCTAAGAGACAGAGGGTGGAGTCTAGACAATGTGGAGTCTGCATGTGTGCTTGGTAATCACACAGCTTGAAAAGTGCTTTGTCAAAA 

AAAAAAAAAGTGAAATAATGCAAGGCTGAAAGTGACCTCGGGACAGGGATGGGGCAGGGATGCAGGGAGACCTTCCTAAGCCCCTAACCTCGGAAAT 

GATATGTGAATCAAAGCAGAGGCTGAGACGGCCTGGGGAAAGTGACCAATGGCAGAGACTCTGGGCCACAGGCAGAAGTCCTCAGGGACATGCAGCC 

CAATGTGGAAGGAGTCCATAAGCTATGCATGTCCTCGGTCCCAGCACCTCGTGGCTGACACACACTCCAAGCCTCTGCTCTCTGTTTTTCCACATTG 

GGCTTCCTGTGGTACCCTGGGAGCAAGCCATCCATCGGGATTTCTGTCAGCTACAAGTCATAAATAGGCCATGGAGGGGCGGCAGCAGCAAGCCTCG 

CTTTCTTTCTCTCACATACCAAGCTCAGGAATGGAAAGAGCAGGTCT 

CGGCCCTCCAGGCCTC^GTCTGTGTTGGCGGGGGGATTGATTTGTCCTTGCCCAGCAACTCTACCATGAAGAGAAAAGAGAACCTCTGTCCCTGTGC 
CTCTACCTACATATTACAGTCATAATTATGTCACACGCACACACACACACACACAGCCCCAGGTGCAAGGAAGCTGGGTATTTGTGTCTCTCTAGCT 
GAAAAC AGTGGGGT ATTTGGGGAGG ACG CTAAGTGAG CCAAC CTCT AGT AGGCCCATGG AAAAAATGAGGCCCAGGTAA CATCCTTCCAA CCCTGTG 
ACTATGACAGGACCACGAGAGCATGATTGGTGTCTGACCAGCTATTGGTGGCCAGGAGCTGAATGAAGCCAACTCCAGAACCCAATGTGTGGTGACT 
CCTCCCCACCATGAAAGTC^TGGTGGCAGATGGAGGACCGGTCCTCTCTCTCCTGGGGAGCAAGCCCACCTCTCCATTGGAAACCTTCATCAAGAA 
CTGTGGGCTCCACTTGCCATGTGTACTTGATGTCTGGCACATGGTGGCACCAACTCCAAATGCCATGTGAATTGGAAGGTTACCATGCTAACGATAG 
TGTGCATTTTCCTTTGGGCAGAAAAAAACTACAGAGGGTATTTCATCCTCTGGAAATGGCTGT^ 

CTTGGCACAGGCTGCTCTTTCAGATGGACTGCAGCACCCACACAATGGTGTGCAACCATCTGTAACTCCAGTTCCAGGGGATCCGACGCCTTCTTCT 
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GGCCTCTACGAGCACCAGGCACATCTGTGGTGTACTGGCATGCAGATACGCACATAAAATTTTTTCAAAGAAAATGCAGGi^AGTCAAAGCACTGGA 
GAGCTGGCCATGAACAGAGCAGAACAGAGCTGTAAGGCATATGCCTGGGGAGTCCCTGCTGAGCCCCCCGCATGAGCTCAGGCTGGACCCCAGCCAG 
GCAGTGCCAGCTGCTCCCAGTCTCTCTCCAGCTCAGCTGCCTGGGGCTGAGGGACTAGCCTGGGGATTCCCCTGTCACTCAGGAGCTCTGAGTGACC 
TCCTCTGCAGCCCCTCCAATTGGGGATTTCCCAGAATGTTCCCACACCTCAACGCCTCCTCCTTAGCCACCCCCAGCTCTTGATAGATAGAAGTGGC 
ATCAGG TGA CCCTACAGGTG AGGGAGGATCAGCT CGAGGG AGGGAGGG AAACTGGGAGG TTGGCC AGTGAACT GAG CCTGAG ACAATCTAAGCTCAC 
AGGCTCCCAAGGCAGGAGGCAAAGGACGGGTGTTTGTGCTGCAGCCTACAAGCTGTGTGGTCTCCTGACAGGTTGCTTAACTTTTCTGTGCCTCAGT 
TTCTCCTCTTTTGTAAAAGGGTGTGTTATGCAGTGGTGTTACCAAGATTAAATATGATGATAGCTATAAAATATACAAAAGGGTGGGTGTCATGAGG 
CATATTCCTTTAACCCCAGCACTCAGGAAGCAGAGXK?AGGCTGATTTCTGTGAGCTTAT^ 

GGCTCCAATAAGACTCT CAAAAAAATAAAAT ATGTGTGTA CC ATAT ATGCA C AT C TATTT AAATTGTGGTGCCAG AC ACACCC ATAGGCAC ATGCAC 

AAAAATATGAAAAGGCTTGGGGAGGAGATTACAGGCGCTAGTCTACTAATACATCTAGGTAGCCAAT^ 

GGAAACTGGAGACACCCCC^TTCCTGTAGCTGTCGACATTAGCTGCCCAACGGGCAC^^ 

GATCTAGTTAGTTAGAACCTGTCACCAACAATACTAGTCCTCCCTGGAGGACCCAATGTCAGGAATAGCAGGCTCCAGCTGAGGATCTTTTTGGACT 

AGTCAT CTTGCCTTCAGGCG AAG CAG CTGGCATC CAATAGGAAAAGAG AGT CCT C CTGGGGC C ACGCAGCC AAGAC C AC CAATGATTCNNNNN NNNN 

NNNNNNmWNNNNNNNNNlWN^NNNNNNNNN 

NNNNNNN^NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN1WNNNNNNNNNNNNNNN 

NNNNNNNNNlTONimNNNNNNNNNNNNNNNNNNlWNI^ 

NNNN]^NNNl!^NNNNNNNNNNNNNNNNN 

NS^N^NNNIWNNNNNNNJTONNNI^NC^ 

TGAGAACCCTGATGGGCCTAACCACATTCTGAAAGGTGAAGAGAGCAGGCTAAGGTGGAGCAGGGAGGGAGGCAGGCTCGCTTGATCTCCACACTGC 
CTAGTAGAAGGACATTTCCTTCACACACAC^CCCTC^CCCCTCTCCCTC^^ 

CATTGTCTTTGTGGATAAACAGCTGACCCC^CTGTCCCTTTAGCTAGATGAGTCCTCTCCAGAGGCCCTGCTGCCCTCTACAGCTTAGAGGGATGG 
GGCCCAAAGCAGCCACTTCCTCCTACAAGTCATTCTTTCTCTAGGGAGTGGCCAGTGTCTGCAAGGCTCCGAGCTTACAGACATTCCTCTTGCAAGC 
AGCCGTCTGGCTTTAAGAGGTAAAGACACACTTTGCCTCATGGTGAGAGGCAGCAGGCAGTGGTTAGTCAGGTCTACCCCAGCGTGGGCTTTGGCAT 
CTGTAGGCCCAGGAGGTGACAGGAGAAGGCCAGCGGGAGCTCCGCAGAGCGCCTGCTAGTGACTTGCCCTGCAGTCACTTCCCGAAAACTAGGCCGG 
CCTCTAGGCGGGGAGACGATCGGAGCCCGCCCACCCAGTCCGTCCCTCCTTGCTGCCTTGGGGTO 

TAAATGGTGTAACCAGTGGAAACTCTCAGAGGGGGCGAGGAAAGGGCCAGGACTCAGGGCCCCAGTTCCCGTCTTAGCATACCGCTCCACCAGGTTC 
ATTTCGCCCACCTTTCAGTAACGGGGCGCTGCCTTCCTGGTAAAAGG^ 

GCTGGCCTCAGCCTGCAGGAACAGCAGCAGCCAGGGATTGAGCATCCAGTTGTCGGCTTCCCTCAGCCCGCTTGAGATGGGGGTTTCCTTTGCACCA 
GACTAGGGGCCCTGGTTAGCTCAGAGAGCATTCAAAGTGTCGCTGAGGCTCAGCGCCTTCCCATATGACCTTCAAAAGAAATGGAAGCTGTCTTGAT 
GAGCAATGTCAGGGTCCTCTGCTCCTCCACAACTTAAAAGAGAAACCCCGACTGTACTGAGTTTCCGTGATTGGCAGTATAGCTGATAAATCCCATC 
TCTCCTTAGCCTGCAGCTCACTCCTGCTGGATCATTTAGGGTTAATTTTAAATGTCAAGTTATCGTACATCTGCTAACCACTATACTTTTGAGTCAA 
CATTCTCATGCCAACAATrGATGTTATCAGCCAGTCAGTCCACAAACACTTAAGTTCACTTGGAGCTTAAACGGGCATTTAATACATGCTTAAATAA 
TTAAAGCAGGC TTT AATTATTTGTAT CATATTTAATAGGG TTTTT AAACTTGCC CGGGCGTT AAA ACG ACC TAAGAC AC C TGT TAAAAGCT CAGATT 
TCCAGAACTGAGGTGTTGGCTCCGTTGGTAAAGTGCTAACGGAGGACCTGAATTGTGTTCTCAAGCACCCATATAAAAAGTGGGGCGGGGGGAGCAC 

gtggctggaatgtgggcgatggcgagattggaggcagatccctgaaagcccgagggccactggcagttccaggtaatgagacatcctctctcaaaca 
aaatgtggacggtgcccgtgggagttcttctgacctccatgt^ 

gaaaagcccagattccagagtcx:aaatttcttcccaaatcccagatctgcagaagagctgcctgcaagcttctggcctgaaatgaggttcctttgcc 

TCTCAGGGGACTTGGGGGAGGCCCCCAGGACTTGGCAAGGTTCTGTTTATTCAGGTCCGGGAACCCCATTATCTGGAGAAGTTGTTGTTTCTCAGCA 
ACTTCTATGCCCCAGTGCTGTGGTGGCTGGTAAATGACGCAGTCCCCACAACACTGGAAGACATTTCTGGACCAAGCGCTTGTTGCCTCTGCACAGA 
GTACAGGGCTCCTCCTTGTTTGAAGTTTCACCCTCTATGGTTTTAGTTAGCCAATGTCAACTGTGGTCTGAGGATACTGAATGGAAAGTTCCAGAGA 
TAAACAATTCATACGTTTTAAAATAACTCCAGCTATAGTATACTGTCATAAATGTTCCATTCTGTGTTTATTTATTGATAATCGCTTCCTATGTTTA 
ATAAACTTCATCAGAGATTTACAGGTGTAGGGAAGCTCATAGTGTCTTCTGGGTATTGTCCAAGGCTTATGGATGGTTTGGAACAGGATCGCAAAAC 
CCAGGAGGG AGC AGAATAGAATG CC AAGT CCATCGTAG AT CCT CAATATTC AGAAAGGCT ATAAG C AC C CT AGCAC CTG AGC AAAGCAGG C AG AGCG 
CGTGAGGCAACCACAGACACAAGGCACTTGGCGTGTGTTGGGGAGTAAGAGGCACAAACTGCTTTCCTCTACCTGGGGCTGGCCCGACTGCCAACCT 
AGAGGTGCATACGNNNNNNNNNNNNNNNNNNNNNN1TONNNNNNNN 

GAGGTTTGTCCCTACCAAACTTGTATGTGTATTTCATCCCTAGTGAAACAACAGAGTTGAGGGGTGGTGGTGGCCACAAGGGGGGATTGTGTCACTA 

AAAAGTACTAATGTCTTTCTCAGTTTCTCTTCTATTGTTTTTCTTCCTTCTCTCTTCCCTCCTTTCCTTCCTTTTCAGTGTTGCAGATTGAACCTAG 

GGCTCCATTGCTTCTTCACACTAGGCATATCTCCCAGCCCTTGAGAATACTCATCCATTTAGAGACAGGGTGACAAGGGACTGTCTCTCTGTATTTC 

CTTTCTAGCCTGGGCTGGCTTCAAACCCCCAATCCTCCTGCCTCCCCGTGCTTCATGACAGGACTGTGCCACCACTCCCAGCTTGTCTCTCACTAGT 

GAGGGGATGGGACTGGTTATTGTAAGTGGATTGTTAAAGAGACCCCCTCTCTCTTATACACAAGCTCCTTTTCTAAGCTCCCC^CTTGCCCATCI^AT 

TTTCCCATCATGCACCCCTCAGCCACCATGTCCTTAAACCTTCAGAACTCTGATCTAAATAAACTACCCAGCCTCCTGGAGTTTTCTTATAGTGACA 

GGGAAGAGACATCTTAGGTGAAATTGCTATTCCTATTAGTCAAAATAGACAGCAATCAGCAGTTGCCCACGGGTCAACCTTATACTTACGCACACCC 

A CATACAAATCTATGTGTATGTGTAT GTGTG C AC AGG AATTCT C AGT TGC CCT CTTCATGG CTGTGCTAATAGAAAACTTTAGCTCCCTGTGT ACC A 

CCTTTCTGCC^TCATGTACCACCAACCTTCCCCCGACCCCraACGCGAGTCATCATGCCGATTGTCTCTTTACAGGAGAGTGAGGACCTGGAGAAA 

AGAACGCAGCTCTCCGCAAAGAGATCAAACAGCTCACCGAGGAGCTCAAGTACTTCACATCAGTGCTGAGCAGCCACGAGCCCCTGTGCTCCGTGCT 

GGCCAGTGGCACCCCCTCGCCCCCCGAGGTGGTATACAGTGCCCATGCCTTCCACCAGCCTCAC^TCAGCTCGCCACGCTTCCAGCCCTGACCTTCT 

GGACAAGAAGGGCGATGCTACTCCCGTGATCCCTTGGAGGGGCATGTAAACTGAGGCCGGGCTGCCCTCATACCTCTACCCAGAGGCCCAGTGGCAG 

AGGCCTGGACAAGTATTGAACACAAGAACTGTAGTGGTCAGAGGGACTTAAGGCCTCCCAGGGAAGT^^ 

GTCG AGCC AAT GTACTGGAC CCAAAAAATGAC AAGTC AAC CCTGGACTGTC ATG AAT GATGCCCAAAATACACAGC ACAG AGGGAGGAGGG CAGGGG 
GTGGATAGTTTTCTAAATAAATATTTTCTAAAAAACCAGCCCTGGAAAGOSGCTTCCCTCACTTCTCCCATCAGCATCGTGCGAACTACTTTCTCTT 
GATTTGGCACCTGGTTTTCCTTCTTGGCTTCAGGGAGCTCTTCCAGACTCTATGAATTCCACTTAGCTGTCTGCTCCATGTTTTGTTCCCTGTTTGG 
GAAGCTGGTCTCTTCAAATCATCTTCAACCTCTAGCTATCTCCCTCACACTCCTCCACAGTGTGACACCCCCCCCCCCAGCCGTGCCTCTTGCACGG 
TGCTTTCTCTAAATGGTGGAGCCAGTCACTCTACAGTTAGAGAAAGGGCTGTATAAGACAGGTCAGGGAAATCTCAGTTTCTCCCACTTGTACACCT 
TCCTGGTTGGCTGCTGCAGCCCTACAGTCAAGTCTCAAGGATGCTGTGGCAGGTTCCIATC 

TCAAGGCCG TAGGT AACCTTTTGCC ACAC CCTGCCAAAGACG CCCTTAGAAATAGGA CACTCAACCTG TAC ACTGAGTGTCC CAAGGTTG AGT AGC T 
CAGGGCTGTGTTCAATTTGGACATCTGGTAGAAACCAGTGTTCTTCATTTCCCTTTGATGTCCACCTGTTTCTCAGAAGTCCCCAGGCCACACATCT 
GTTGGTCTTCATGTTATGCCTTTACCCATAACAGTGCCCAGAAGCAAGAACAGCCGTCTCCrTCCAAGGTTGCTTTGGAGCCTGTCCTTCTCCATAGT 
CTCTGCTGACTGGCTGGGCACCAATGAGGTAGATAACCCAGGGCCAGGGCTCTGTTGTAGGCCCAGCCCGTAATCCCTCAGAATAGCAGAATCAGTA 
GAAGCCACCTTCAACTGCCATGCTCTCATGAAGGGGAAGGAGGAGTCGGCCCTGTGCTTGTTAGCCAAGTCCTGACCTGTTGGTTAGTGACTCTGAT 
TTCCCTAAAGCCTCATTTTGATGGTCCCTGCCCAGGTGTCTCTTACCCCCTGGAATAAGGGCAGTGTTGAGCTGATGTGAGCATTAAACTTCATAGA 
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GGCCTGACTAACCTGCCTCCTGCCTATGACTTCCTTGTGACCTGTCTATGAGAGGATCTCAGAGCAAATCATGGTGGTAGGGACAGAGTGGGCTCTC 
ATCTTGGAAGCCATTGCCTCTGCATGCCCCAGAGTAGGGCTAGGCTGAGCCTGATTCAGAGTCTGCTTAGGATATAGGCTGAGCCTGATTCAGAGTT 
TGCTCAGGTTACCGCCACTGAGTACCCCCTTGTCCTTCAGTGCCATCTCTGGA^ 

CCCCCCTTCCAGAGCAGAGGCTCCATATGTCTGGGATTTGTGACGGAGAGCGGCCTTCTCTTCAGTTCCCTACAGGACCAGGATTTGGGGGTCCTGG 
C CC CTCTAT CCC ACC CAGCGGCCCCAGCATG AAACCACTCACTTCAGGTGC CCCGTGTGTCATCTGTCATC ATCTTG AGACAC AGAACTC T AGGTCC 
AGTTTTCTTTGCAGCTGACCACTACTTGGCCTTTCATAGGCAGGC^ 

CCCCCTAGtAAATCCCGCAC^GACACATGACTGGCCCGGGCATCTTCTGAATGAGACTTGGGGCCAGAACTAGAAATGCGATCTCACGTGCGCTGTT 
CCCATCTCTCAGCAGTGACTCCTCTTCCCCTTTCATTCCCCACTTT^ 

CAGACAGCAGAGGTGCAGTCTCTCTGCACGGGGACTGGGAGGGGCAGCCACTGAGTTCAGACATCACAACCCAGCT 

ACACATCACAC CAATGG CTGGAC CAGTTCTT AGGT ACACACAGGCACCGTG AATGCAGTGGATCAG AGTATAACAAATG AAT AGTATGTCATGT CTC 

TGTAGCTCCATCCAATGGGGTCATGCTATAATGTAAGTATAGTGATTGCTATATCTCCCAGTTTGTAAGACAAGCTAAAATCATAATTCCTCTATGA 

GTTCCTTTTCTCTCTCTCCTCTCTCTCTCTCTCTCTGTCTCTCTCTCTCTCTGTGTCTCTGTCTCTCTCTCTGTGTTTACCCCTTTCCAGTATTGAG 

GGTTGAAGCCAGGGCCTCCAATATGGTAGGCAC^TTCTGTGCCACTGAACCACACTTCAGCATTGAAATCCAAGTTTTAGGATTATGAGTCTGTCTG 

TCTATTTCGAGGCACTCAGGATCTGTAACCTTGGCTGGCCTGGCACTTAAATAGCTCAGGCTGGCCTTGAACTTGAAGCAATTCTTCT 

T TATGGGTG CTGGAATTAC AGAC ATC TTCT AC C ACACTCTTCTCCAATT TCTAAAATAATCTATGGACCGAC AAACT ACATTG AT AGGCCAGATTTG 

CCCCTTAC^TCTGCTTGACCTGGGACAAAGTATAAATGTCCCATGTTCTAGGTCTCAATACCTAGACACCGTTCTCTTCTCTGGCAAAGAT^ 

ACATCTGCCCTGATGTGCCAGCTTCAAATTTAGGATTT^ 

CAGTCTGTTCTTGCTCCATCCCTGGCTCCTCCTGTCCACAGCATCAAACTCTGTTCACACTCCAGCCCCAGGAAGGACAATTCTGGAAAGAACTGCA 
GCCATGGGAGGAGCATTAGCTTAGGGAGTCTGGGCATCCTGGAGACACAGAGC^CTGCGGGCTCTGAGGAGGTGTGCCT 

CTTTCCTCGAGCCAGGCGTGGTGGCACACGCCTTTAATCCCAGCAGTTGGGAGGCAGAGGCAAGTGGATTTCTGAGTTCGAGGCCAGCCTGGTCTAC 
AAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCAGAAAAAACAAAACAAACAAACAAACAAA 

GCCCCACTAGAAGGAATAAAAAACCAGGAGACAGACCAGGTTACCAGCTGCTGGACCACAACTGGCTGGCAATGTATA7TTGAATGTATACTTCCAG 
CCCCTGTCACAGTTGCTCCTGTGGTTTTGTGACACAGGAGGTGAGGCATCTGGGTATCCCACTTACTTGGAGGACAGGTAGGCTGTCTCCATCCTCC 
TTCTGCCCAGGTTCCCTTCTCTCCAACTGATTTCAGCATTCCAAAGTTGTTCCAGGGAG^ 

CTAACTTCCAGTTCTGCTTTGTTTTGTTTATTTATTTTTAGACATGGAGTCTTTCTGTGTCTATGTATCTAGTCTGGCTTCTAACTTCAAATCCTTC 
TGCCTCATCCTCCTGAGAGTAGGGATTGTAAGCATAGAGAGAATTAATACCTTAAGACCCGCCAAAGGGCAGTGCGTGACTCAAAGGAATGTGTTTC 
T CTTGAT TT TAGGT C AC C AAC ACTG ATGCAG GGATGTTTGTT AGAATTG TTGCT ATAGAGATGAGC AGTGT CT GCCCTAGTC TTC AGT AC C AC ACAA 
AGAGTGGGAGGTAGGACTAGAAAGGGGACAGAGATGTGCCAATCCTAAAACTCTTGTGGGGTATGTGGCTTAGTTAAGTTCACCATGTGATGTAATG 
AAATACCATGAACAAGAACTTGGGGAGGAAAGGGTTTATTTGGCTTCTATTTC^^ 

AAACAGGGCAGGAACCTGGAGGCAGGTGCTGATGCAGAGGCTTGGAGGGATGCTGCTTCCTGGCTTGCTCCTCATGGCTCGATTATCCTGCTTTCTT 

TCTTCCTTCTTTCTTTTTCCCTCCTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTCTCTCTCTTTCTTTCTTCCTTCCTTCCTTCT 

TTCTTTCTCTCTCCCTCCTTTCTTTTCTTTCTTTCTTCTTTCTTCTTTCTTTCTTTCTTCTTTCTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTC 

TTTCTTTCTTCCTTCCTTCCTTCCTCCCTTCCTCCCTTCTTTCTTTTTCTCTCCTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTCCTTCCTT 

CTTTCTTTTTCTCTCCTTTCTTTTCTTTCTTCCTTCCTTCCTTCTTTCTTTTTCTCTCCTTTCTTTTCTTTCTTCCTTCCTTCCTTCTTTCTTTTTC 

CCTCCTTTCTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTCTCTTTCTTTCTTCTTTCCTTCTTTCTTTCTCTCTCCCTCCTTTCT 

TTTCTTTCTTTCTTCCTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTCCTTCCTTCCTTCCTTCCTTCTTTCTCTCTCCCTCCCTCCTTTCTTTC 

TTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTAACTTTTTATTGACTCCTCATGAGTTTCACATCATACACCCCAGTCCCGCTCCCATATCCCCT 

TTCAACTTTGCAACCTCCCCCCCAACATAAAAAACAAACAAACAAAACCOU^GCATAGAAAACATCTCATTGTGAAAGCTGTAGTATGTCACAGT 

TGTCCCACAGTATATCCTTCTGTCCACACATCTTCACTTGCAAATATTCGTTGCATTGAGTCAGAGTCATTGGTCTGGTTAGAGATCTCTGGCTTCT 

GTGACAC CATC AAT ATT GGATTC TT ACCAGG ACT CCTCATGGTTATCC TGTTGTTG C CC CGTGT C ATGGAGATCCTGCAGCTTTGGAACAAC TGGAC 

TGACCCTTTCACGTGACCCAACTTTTAGCAGATGATATAGATTTAGGGTGTACCAACTCAGATCTTGGCCTACCAGGCAGCCAAGCTAGTCAGCCCA 

CTCCCGCCCAGGTCCACCTAGACCAATAGGGTCTCCCTTATCTACATCACCAGGGGAGCTTGCAGTACTACTCTGGCTAGGCCACCCAATGCACCAT 

CAGCAGGACGCAGGTTCAACTCTGTTCTCATGACTTCTCACCGGCTCACCCACACCTCCACCATCAAGGCCAACTCCATGGTGTTGCCTAGATGAGG 

TGTACGGCCTGCTCTCCTGAGTGGTGCAGCTGGTGAGGGGTAGGGCCAGCTCTCCCACACTCATGACCCTGTGGGCAGCTTTCCCGACTGTCAGAGG 

TGATAGGGGCAGGGCATGATCTCTGTGCCCATGCCACCCCACGACAGTAGAGTGCTGTCCCTGGTGGCCACCCTGGCCCCCGACTCTCATCTTGTTT 

TTATGCTGATATGTGATGTG AGT TAACT CCCTTAAGAT AAAC AAAAAC CGAATAC TGCT TGAGTAATCT ACTGGTAAGCAGAGCTAAGAAACCCTCT 

CTTAGAGAATTTCATAGGAAAAGGCACAGATGGATCCTGTGCTCAGGGGTCTGCCTGGTAGACTCCTGTAAAAACGCTGCCAAGCCTGTCAATGACA 

ACCATGGTTGAAAGGGGTGGCAGAGACTGGTCATTGTCTTATATTTTCTTGGCAATAAGATAATAAGTTCCATCTTCCTGCCTACATTGCAGCTAGA 

CCTGGCCATGTGACACAGTTCTAACCATGAACTTTAAACAAGTAGGTCTTACAAAAAAAGCTCTTTAAAAATTGAAGCTGTCAGGGGACTGAAGAGA 

TGGGTCAGTGGTTAGAGCACTTGTTCTTGCAGAGGACTAGGGTTCAGTTCTCAGAACCCATAGGGTGACTCAAAGCCATCCATAATTTCAATTC 

GGGATCTTATGTCCTCTGAACTATGAGAGAACCAGGCACACATGGTGCAAATACACGCAGGCAAAACATTCATACACATAAGAATAAAGAAATCTAA 

GTAATACTAGAGAGG GAG CC ACT AT CTTAGG ACC ATG AGCAAACCATG CATGCATGGAAAGAAAGG AAAGAG ATT CTAGGTCACTG ATAATTTT AAA 

AAAAATGGGGGGGGGGGGCTGGAGAGATGGCTCAGCGGTTAAGAACACTGCCTGTTCCTCCAGAGGTCCTGAGTTCAACTCCTAGCAACCACATAGT 

GACTCAC AAC C ATCTGT AATGGG CTCCAATGCCCTCTTCTGGTGTGTCT GAAGACAG CT ACAC TGT ACTCATAT AT ATTAAAT AAATAAAT AAATAA 

ATAAATAAATAAATAAATATTTTTTTTAAAAAAATGGCTAAAACCATAATGGTTTAAAAAAAATACCATTCTTAACCGTGCATGGAAGTGCATG 

TTAATCCCAGCAATCAGGAGGAAGAGACGGGNNNNNNNNNNNNNNNNN^ 

NNNNNNN^NNNNNNNNNiOTTNlOTNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNTTON^ 

NNNNNNNNNNNNITON^NNNNNNNNNNNNNNNi™ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNN1TNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNMNNNNNNIWNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNI^NNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^NNNNNNNNNNNN^ 

NNNNNNNNNKNN^NNNNNNNNNNNNmNNNNNNNNNN^ 
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NNNNNNNNNNNN^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNl^NNNNNNNNNNNNNNNNNNNNNNNISNNNN^ 
NNNNNNNNNNNNNNNNNNNNN]^NNNNNNNNNNNN^ 
N^NNNNNNNNNl^NNNNNNNNNNTCTCCCTCTCCCT^ 

AGTTATGTAAATAAATCAGGAAAGGCAGGCTAAAAGTCAAAAGAATATAAGTAAAAT^ 

TATGTAATTCTTATGGTATATCATGATTTTCGTGAATCTCTTAATATAGTTACTGACTCAATATTCAGAAAGAGTTGTTTTGCATATAGAAAACTGC 
TGATTT AGAATTAACATCAC TGTTTATT CAACAA CAGCAAGT AAT CAG AAATAG AAATC CTCC ATTAT AT AAC ACATAT CCAATCCC ATACAGGTCT 
GCCAGGCCCTCTAGCACAAGATCATGATGAAATCGGTCAACCATTAATAGGATACTTGCTAGAAGCT^ 
AAAGGGGATTTTTCTATCACTTGACAACACACTGAGGGAATTATAAGGAAATGTCCTGCTTGTTCTTTGT^ 

GTAACC CTAAAGGTCC CCCAAAGAAATTAAATTTTGCAAATGGAT ATGTTTCATTTGCAGAGTTTAG AAAGATTAAAATATGTGCA CCA 
ACACGTATTCAAGATTTCAGTGGGCACAGCTTTAAGTTC 

C CCT AC AAATT AAAACTGAC AAGGCTCC AGC ATATGTCTCTAG TAAAATGAAGCAATATTTTTGC CTACTACAATAGAAAG CATATTGC AGGTAT CC 



AGCCCTGGCTGTCCTGGAATTCATTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCCAAGTACTGGGATTAAAGGCGT 
GC^CCACCACGCCCGGCAAAGGGAGTCCAAAAAGCCTACAGCACCT 

TAGCTTCCGAGATCAGACGAGATTGGTCACGTTCAGGGTGGTATGGCCATAGACCTAATTGAGTTTTTAAAGGGATGCATAACAAATGAAAAGGGAT 
AACAAAGACCCCCAGAGAAAAATTACATAAGATTTTACTAACTTCATTTTTTTTTTGGTTTTTCTTTTTTTTTCTTTAAGATTTATTTATTTATTAT 
GAGTA CACT AT AG CTG TCTT CAG ACAC ACC AGAAGAGGGC AT CG AATCCC ATT ACAG ATGGTTGTGAGCCACCATG TTG TGGCTGGGAAT TGAACT C 
AGGACCTCTGAAAGATCAGTCAGTGCTCTTAACCACTGAGCCATCTTACC^GCCCAACTTTAAATTTTTTAAATGCTAATGAACAACAAAATAAACA 
AGCAAACAAAACAGCTGTTGAGAGACATTGGATTACAGGAAAAAAAAAAAAAAACGCTGAGCTAACCCAGCCTGTTTATATTAAAGATATCCTAACT 
TCAGAATGGAAGGAAGGAAATATGTTATGTTGGGGAAGAGGTTTTATCTTTGTTTTAACAAGAGAAGAAAAGCTATGGACTCCTTCAAACCTAATAA 
AGATTAGATTTGACCAGAGAAGACTTCCAGAAGACCCTGACTTCAGA 

CI MOUSE SEQUENCE - mRNA 

Q GCAGTCCCTCTGCACCCGAGAGAGAGGAGGACGCAGGGGTCTGTCAGAGGTTGCTGTTGGGCAAGCAGGGGAGGTACCTGTGGAAGGTGGTGTGCTG 
IS GTGGCCCCCTAGCAGTCAAGAAGGGGAGCCAGCTAGTGAGAAGATCGCCCAGAGGCATCTGGGACGGTGTGGGAGAGCCCGGAAGATTAGAACCATG 
W CCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCTCCTCCCCCTGGCAAACAGGACTCATCTGATGATGTGAGGAAAGTTCAGAGGAGAG 
AGAAGAATC GC ATCGCTGCC C AGAAGAGC CG ACAGAGACAGAC ACAGA AAGCCG ACACCC TTCAC CTGGAGAGTGAGGAC CTGGAG AAACAGAACGC 
AGCTCTCCGCAAAGAGATCAAACAGCTCACCGAGGAGCTCAAGTACTTCACATCAGTGCTGAGCAGCCACGAGCCCCTGTGCTCCGTGCTGGCCAGT 
GGCACCCCCTCGCCCCCCGAGGTGGTATACAGTGCCCATGCCTTCCACCAGCCTCACATCAGCTCGCCACGCTTCCAGCCCTGACCTTCTGGACAAG 
H AAGGGCGATGCTACTCCCGTGATCCCTTGGAGGGGCATGTAAACTGAGGCCGGGCTGCCCTCATACCTCTACCCAGAGGCCCAGTGGCAGAGGCCTG 
s , p GACAAGTATTGAACACAAGAACTGTAGTGGTCAGAGGGACTTAAGGCCTCCCAGGGAAGTATAGTCAATGTACTGGACTCTCCCAGGGAAGTTCGAG 
CCA ATGTACTGGA C C CAAAAATTGAC AAGT C AACC CTGGA CTGTC AT AGAATGATGCC C AAAAT ACAC AC AC C AG AGGGAGGAGGGCAGGGGGTGGA 
9 TAGTTTTCTAAATAAAT ATTTTC CAAAAAACC AAAAAAA 

I*, MOUSE SEQUENCE - CODING 

l;;J ATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCTCCTCCCCCTGGCAAACAGGACTCATCTGATGATGTGAGGAAAGTTCAGAGGA 
GAGAGAAGAATCGCATCGCTGCCCAGAAGAGCCGACAGAGACAGACACAGAAAGCCGACACCCTTCACCTGGAGAGTGAGGACCTGGAGAAACAGAA 
CGCAGCTCTCCGCAAAGAGATCAAACAGCTCACCGAGGAGCTCAAGTACTTCACATCAGTGCTGAGCAGCCACGAGCCCCTGTGCTCCGTGCTGGCC 
|lj AGTGGCACCCCCTCGCCCCCCGAGGTGGTATACAGTGCCCATGCCTTCCACCAGCCTCACATCAGCTCGCCACGCTTCCAGCCCTGA 



5^ 



HUMAN SEQUENCE - GENOMIC 

ACCCTGGCCTGGAGATGTCCAGCTGAAGAACAGAGAATGCTTGGTTGGGAGGTAGAGACTTGAGTTGCTCAGCAAAGTAGGAATAAAGGTAACAGGG 
CTGTGAGATTTGGGAGCAGTAACAGAGAAAGCCTCAGAAGGGAAGGGGTAGACAGGCACAGAAGGCATGAGAAGGGTGACTTTCGAGGGAAACTGTA 



ACTTCCGCCTCCCGGGTTCAAGTGATTCCCCTGCCTCAGCCTTCCTAGTAGCTGGAATTACA.GGCACCCGCCACCATGCCCGGCTAATTTTTGTATT 
TTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGGCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCAGCCTCCCCAAGTGCTGGGATT 
ACAGGCATGAGCCACGGTGCCTGTCTAGAAAAAGGATTTTTATAGAAAGTTGGTATGTGATGTTATTCCCTTTTTCCCTTTTCTTTTTTTTTTTTTT 
TTGAGACAGGGTCTCTCTTGTTGCCCAGACAAGAGAGAGTGCAGCAGCATGATAACAGCTCACTGCAGCCTTGATGTCCCAGGTTCAAGCAATCCCC 
C CA C CT CAG C CTCC AGAGTAACTGGG ACT AC AGGTGTGCACC ACCACATCC AGCTATTTTTTAAATTGTTTTT AGAGGTGGTCTTGAACT CCTAAGC 
TCAAGTGATCCTCCTGACTCAGCCTCACAAAGTGCTGGGATCATAGGTGTGCACCACCCCATCCGGTCTTACTCTCTTTTTTTCAATATTTCTATTG 
GAT GAG ACT CTACTTGG ATAATC CC AGTTTTATATGGGTT ACATAAAACAAATA TTACGGCAGTAATAGCT AA TAGGCACTGT AT GCT TACTAGGCA 
CTAATCATTGCACTAAGCAATTTAAATACATTCAGTGGTATGAGGTGGCACCATTATATAGATGAGGAAACTGAGGCACAGAGTTTTAAGTGATTGT 
CCAAGATCA CAGAGTTATGA AGCGACAG AGCCAG AATATTGGATTTCA TATT GAAACCC AAAGCTAAG AGGCCGTTTTCGTGT TC CTGCTTTC CTCC 
TTGACC ATGTCTCTCTCTGT ATG CTGTCAGGGTTGAGTGA GTGAGGGAAGG ATG CTGAAAAGT CATTT GGTTC AGGCCT C ACATT TTC AAAGC ACTT 
AACTCCAGTGAGGCCGTATATC CAG CCTGAGATATTT AG CCGGGCCTG AGAGATGAGGAGATTTACCC AGCTT TTG AGT T ACTTT CC ATTTATTGAT 
GCTACCACCTCCAG AGAC CC AGGAATTAAAT ACT C AT ATAAATTCC AT C ATTTT CTGGTTGT AGTATAAGT AC ATT ACTTTT AAAACATACCATATT 
TTTTGTTAACCTTATTTTGATATATCTTCATTTTTAATCTGATGGGAAACT^ 

CTT T CCTAC TGTCTG AAAGGTGC AGGAAG AGGAGG CCCTGCAGGAAGC AGCTCTGATGCCCACTCCGGCTGCTGCCAGTGGCTGG CTGGATGTGCTA 

TCAGCATGGCCATACCCACAGGTTGAAGTAGGTCAACAACCTGGGGACCCAGAAGCTGCGTGGTGCAAAGTGCATCTATCCGGAAATTTCCTGAGCT 

TCAAGGGGGTTTATAAGGCAGCCACCAGGCCCAGACTGACCAAGAGAACAGGAGCTTTGGATGCATGTTCTTTAACCCAGTGGCTCTCACCTGGGGC 

GATTTAGCATCTCCTCCCCGTAGAACATCTGGCAATTCTAGAGACATTTTTTTATTTTCACAACTTGGGGTTCTACCAGCAACTAGTGGGTGGAGAC 

CAGAGAAGCTGCTAAACATCTTACAATCCACAGGATGGCCCACCGC 

TGTTCTAACCCTCTGGGGCTGGCCTTATGAAGCX3CAGCCACACCCCTCCTAAG 

GCCACTCCGCCGCGGGATTTCCCCTTCATCACCTGGCCACCCACTCTGGCCCCCTTACTGGAGGGCTCCGCCCTCAGGTAAATGTTTTCCCTACTTG 
TTTGGGAAACTTTGAGAAAAATCCCCTTTAGCC^TCTTCTGAGCAGTGAAGATGTACCAT^^ 

GGGGATTTTTATTATTTTAGCTCAAATAACTAAATAGTGGCAGCCAACTCCTTTAAAGGCGACTGTCAGCCCTCAAAGGGGCGCTCAGATGGAACTC 
AGA AAA AAT TAAAAAAAGAG TGAGTTT AGTATCAGTGG ATCAGGGGAG TGGGGATAACCTGGGAATTCTGGTCTGGTGGGGC AGG CGACAC AG ACCG 
AAGGCCTCCGCCCACCCCCAGCCTCAAGCCCCCTCTGCGTGGGGCCCCTGGGGGCCTGGGGACATTTGCATGTACTCTCAGGAGATTCTCCTTTCTT 
CTCCCTGGGTTTCATCTTTAATAACTAGATGATGGGCTGTTGGGGAATGCAGGGCCAGCGCTCTAGGAAAGTGAAACTGCCTTGCAGAGGGAAAGCA 
ACCCACTGTGGCCCCGAAAGACCCAGACCGCCAGCGGCCCCGGGCAGCCGCCGAGGGCACCGGCTCACACTTCCTGCATTTCACAAGCTCTTGCTTG 
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GCATGCATTGATGACTTGAACTGGGGGGTTAACCCAAAAAGATCAAGTGCGAGCTCAAGGCGGATGATGAGAACAGATTCGGGGCTGGTTAACCTCA 
GAGAGCGGGAGTCCGTTGGTGCTAATGAGGCCAAGGGAGCCCGGCTCCTTTCTGCCCCCAGGCAGGTGAGAAAGAAAGAACGGAGACGAAAGAGGGC 
CCGCTGTGTCTACTCCTCTTGGCCCAGAAGCCCAGCTGGACCCTGAGGCCACAAACTGCCCCCCACTGGAGAAATCGCTGCAGGATTTCTCCTCCAT 
CAAATCGCTGTGTCAGAGCCAGTTGGGGGCTTTTAAAAGACGCCCACGCTCAGGCCCGCACCAGGCCACCTACACTGGAATATCTGAGGCTAAGTCC 
CAAGCTAGAATTTCCCATCTTCTTTCTGGCCTTGGGAGATTTTGCATCTCCTTCCAGGGAAGGAAGCCTGACTTCTGAAATGATTCTCACGAAGGAG 
AAGTGCTCCGTTCTGCAGTGAGCCTGGCCAGGGGGGCCACACC 

ACG ATTGGG CCAGAGCATCTCTG CTCTTGTAAAATAG C AGGC AATT AG ATCTCG AAGTCTTTGAAGTGGGCAG ATT CAC CAGG CGTTGGACTG AAGT 

TTCTATTTGTCCTGGCACTGAGAATAGCAAAGGAACAGGGGGACTGAACTGGTAAGGATATAGCA 

ATTACCTTAAGAACCCATTTATGTGCAGTATATCCTGCTGCTGATTTTGAACAAGGATGATCCAAT^ 

TTGACAAGTAGCTGCTTCCCTAAATCCTTGGAGTTTCCTTCCTCCCTCCTTCCCCTCCTTTGGCACCAACCCCCTTCCCACCTCCCCAAGATCCAGC 
AACTAAAAGGTTATTTGCTTGGCACGGCAGGGAGTAGGGGCTGTGCTGAATGGTGGGAACCCACGCTACATGGGTTGTCAGCTGTGTCACCCCTAAT 
TAGAC&TCATTTCAATCTACTCATGAAAGCAGCTTGCTGCAGGACTCTGTGCTGGCCATAT^ 

TAAAAACTCATATTGTATTGCCCATGTACCAGAGACTATTCTAAGAACTTTACAAACATTAACTCATTTGATCCTTATAACGACCCTATGATATTGG 

TGGTGTGATATCCTCCCGTTTTCTAGGTATGGAAAGAGGCATAGAGAGGTGAAGTCTCTTGCTGATATCTTCTAGACCGGGCTGGCTGCAGGGTCTG 

TTGCTCTCAACCAGAGGAAATAGAACCATGAAAGAACAAGGTAGGGTTTTCCTCCTTCTGCCAAACTGAGATTCACAAAGTGGACTCTAGCATTCTT 

CCTCTCTGCCATTCCCTCCCACCAAATTTACACCTGGGGAAACAGTGGCCCAGAGAGGGAAGGAGACCTTCCTACAATCTCAGAGTCAATTGGGGGC 

AAAGCTAGGACCAGAATGCTGCCTCTCAACTCCCCCTCGACCTCATATCAT^ 

TATGGTGATTTTAAGTCCATAAATTCTTTATTTAATACTCCTCCCTCTAAGAGGTAGAGAC^ 

GCTCCATTCTACTTTTTTCTTTTAGGTCAGCCTGTTGCCCAGGCTGGAGTGCAGTGGCTCAGTCTCAAATCACTGCAACCTCCGCCTCCTGGGCTTA 

AGTGATTCTTCCACCTCAGCTCTGCCCAGCTAATTTTTTGTAGAGACAAGGTCTCACCATGTTGCCCAGGCTGGTCTCAAACTCCTGGACTCAAGCA 

ATCCTCCTGCCTCGGCCTCCCACAGTGCTAGAATTACAGGCATGAGCCATGCCACCCGGCCCCAAGAGTAGAACAAAGATGGAGGACTAGGTCATAA 

AAGACACTGCAACTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTCTCTCTCTCTTTCTTTATTTCTCTCTCTTGTTGTTCACTCTCCAGG 

GAG C CAGTTGC CAT GTC ATGGGG AGGACACTCAG GCAG CCCC GTGGGGAGACTC ATGTG AGG AGG AACTGAGATGTCTTG CCAACATCCA TATGAGG 

GAG C CATCT TGTAAATG AATCTT CC AGT C CCAATCAAGTCTTCAGATG ACC AGAGCCTC AGC TGAT AACTT GGCTGC AG CCT C ATGAGAGATCCTG A 

GCCAGAACCACTCAGCCAAGGTGTTCCTAGATTCCTGCCTCTCAAAAACTGTGTGAGATAATAGATGCTTATTGTTTAAAGTTGTTTAGGGTAACTC 

ATTGTGTAGCAATAGATAGCTGAAACAGGAGGCTCTACATTTACTCTTTTTTTTTTTTTTTTCTTGGAGACAGAGTCTTGCTGTGTCACCCAGGCTG 

GAGTGCAGTGGGGCAATCTCCGCTCATTGCAATCTGCGCCTCCCTGGTTCAAGCAATTCTCATGCCTCAGCTTCCCGGGTAGCTGGGACTACAGGAG 

TGCGCCACCATGTCCAGCTAATTATTTTGTATATTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAAGTGAT 

CTGCCTGCCTTGGCCTCCCAAAGTGCTAGGATTACAGGCATGAGCCACTGCACCCAGCCTACCTTTATTCTTAAAATGCCATAAATGCACAGAGTGT 

TTTGAGGAGTCTTTTATCCCAAAGTATTCCAACTATATTTCTTTTAAAAATTTAGGGCAATTAAGGCATTTTAATGCATGTGACCCTCCCCGCCCCA 

CATTCCCCCACCCTTCATTATTTCAAACTGGAGTGGTCTTTTGAATTTTTTCCTGAACGTGTTTCCAGAAAGTTTATACCAAGTAAAACAGCTATAT 

CCAAGTTCCTAGTAATTATTACAGCAATTACAAGGGTTCCAAGCCTAATAAAGTAATCTAAGGGCAGGAGGTTTGGAGGTATGGTTATTTCTTGGCA 

TTCTTCTAATGACAGCCTGGAGCTGAGTAGCATCTTGGCAGTTGTCTACTGTTTGAATACGTTAGACTGCTCCCAACTCTGTCACTTGCTAACTGTG 

TGACCTTGGGCAAATGATCGAACTCTCTGTGCCTTAATTGCATATAAATTTTAGAATCTAATTATGATACCTGCTCTGTGGAATCCTGGGGAAGATT 

AAAATGAGATGTGGTAGAGCATGGTGCTTATAAGCATTGAACCAAATTGCCTGAGTTTGAGTCCTAGCTCCAGTGACTTTTCTAGCTGTATGATCTT 

GGGAATGTTACCTAATCTCCTTGTGCCTCAATATTCCACATACATAAAATGGGGTGAGGTGGGGAAACGGTGCTTACCTCCTGTGACTGTCATGAGA 

ATTAAATGGAGCCAATTCATATACTATTCCCGGTACACAGTAAGTAGCATATGGCAAGGATGCTAGGTATAGTTGTGCAGGTTGTTCACTGTTCAAG 

GGCTCCAACCTGAGAGGTCAAGTGAGGGCTTGCCAGGCCATACGCTTTGGTATGAGGTTGCCTACCTAGAGGTGGCGCTGTTTTCTAGTTTGTACCA 

GGCCCT ATATCAGCC AGCTGTCAGC ATAAGGGTT TGT AGT AT CCAGCAACC ACTACCTT AATACTATCTTTAGG CT TTAGGCT GATGTTC ACT GAGC 

ACATGCCAGTTTTATTATAACTGTTACTTG<3TAGCTTCCTGGAGTTCTCCAGGATAACATCAGTGGGATGTGTATATCCCAAACCTTGCCATTACCA 

TTCAGATACAATGCTGAAAATC^ATAAAAAATATTAGCAAGGTAAAAAGTTTAAAATCCTGCAGGACAATATTATATATTGCTTAGGCATACATAC 

AGGTTTGGTTCAAGTATAGAAAAATGCTTAAGAATGATAAACCCCACATCTGGATAATGGGTTACCTGGGGTGGGGAGGAAGCGGAATTAGAAAGGG 

ATGAAGGAAGCTTCGGCTCTATTCATAATTTATTTCTTCAGCTTAATGGTGGCTACATAAGTGTTTATTATTTTTT AATCTT AAATATTTAAAAATA 

ATGTATAATTAATACTATTAAATAAGACCCGTCTTTTTATGTAAAAACTAATTTTGCATTTCTCTCATTAGTCTCTGTTAAATATTAAATCAGTACA 

TTGATTGGGAATGACAGAATCAAGCAATCATTCATTCAACAAGTGTTTACTTGTTGAATGATAGGATGTTCAAAGAGCTAAGAACTTCATCCATACG 

CTGTAATACTTAGAAOSCTTGTAACATTTATAATAACCTGTTTGGTTATTTGACTATGTTTTCTTTTTTCTGTGAGAGCAGAGTCTCTGTCTTATTC 

ATATTTAAGAATTCTGCTCTGTAATAGGTGATAAATAAATGTTAATTTAAACCGAATGAAACATAAATATGACTATTATTGTCCTGAAAGGGCTTTC 

AATACACTAGGATAATGCAGGTCATAAGTGCCATAAATGAAGAGCAAAGTGAATTCACTTATGCATTC^TTCGTTTGAAAACCATTTACTCGGCATC 

TGTTAGGTACTGAGCATATGACTATAAACAAGAGAAACACAGACTTTGGTTTTTACAGGGCCCACTTCTCACCACTTTCATCACTGTCTAACTGGCC 

TATGCTGCCAGCATCTTTTGTTTGGATGGTTGCAGTAATCCAATACTGAAATCCTGACTCATCTCCCTGCTTCCACTTTGCTCCCCTATCGTAAAAT 

CTCAACTGTGTCCAGAGAGATCCTCTAAAAACCTAAGTCAGATCATGTCACTCCTCTGCTCTGACACCTCCAATGGCTTCTCATCTGTAGGAGATTG 

TAGAAATTGCCACAATTAGTTAAATTTTTCCTATCGAGAAGTGAAGTCTGTTTCTCAACTCATTGAGTCTGAACTGGCCTTGTGACTTTCCTCGGCC 

AATAGAATGTGGCAGGTTCTGAGTTCTGTGCTAGCTCTGAGCCTGGGTTCCATGGGGCTTCGAATGCTTCTGCTCCTACTCTTGGAATCCT 

ATCAAATAAACATGCTCCGACCAGCCTGTTGGTGGATGAGAGACCATGTGGAGraAGAAATGCCCAAGGCAAAGCCAGGCCAGACCAGTCAAT 

GCC AAC CTACC AGC TGTCTG AGG AC ACATGACCG AGCCTAGC CGAGAT CAG C CTAGC CC AGC CTATCCCCAAAG AACCTT TCAGATGACCCACAGAC 

TCATGAGCCAAATAAATGGCTTGAGTC^T<^AAGTTTCTGGTAGTTTTTTGGGCAGCGAACACTGATAGACTATCTC^TTTAGACAAAAAGTCAAGC 

TTCTTTTTACTAACTGTATCCTCAGTACTTAGTGCCTGGCACATGGTAGATGCACCATGAACGTTTGTTGAATATATTAATAAAGGTGTAAATGGTT 

TTAACATTCTATTGAGGGAAACAGGAAAAAGATAAATCAGTAAACGAATAAAATAAGTATAAAATTGGGTAACTGCTATGGGAGAAAT 

CTG AGGT AGAGG ATAAGAATG AAGGTGGAAGGGGAAAC CCTCC ATGGGCTCAC TGTGT CTTTCAGAATGAAAGCTAAAAT CCTGAT GTT GGCCT CTA 

AGGCCCTCCCTGACCGGGCCATTACCTCACTGACCTCATCGCCTGCCACTTTTGCTGCCTCTGGTATCCAACTACCTGGCCCCCCTGCTCTTTTTCC 

AATACTCAAGCACACTTCCACCTCAGGTCCTGGAATGTCCTTCCCTGAGACACTCACCATTCACTCCCTCTCTTTCTTCGAATCTCTGCTCA 

CCCCTCTTTGGGC^GGCCCTTCTGACCACCCAGAATTCCCTAACCTCCAACCCTGCTTTATTTTCCTCCATAGGAAAGGAATTTTGTCTTACTTAA 

TTT AAC CTT AGGGC AAT ATTGTC TT AGT CTAGAAATT CAGCT CATGGATACAAAAATCCTCT AGGAGCCGT AACTT CCT GTC AAT CCT AAAATTTAA 

TAAGGCAAATGTTTGGCTTCGAGCCATACTGGGAATAAGGTAAGGCCTTTTGAAAAAGCCACGATGCCC^ 

CCCTACAAATGCGCTCATGGCAAGGCTGGGAATACCCTACACTTACCCTCTCGGTCATTTACTTCAGAGTCTGCGGGGCATTCTAATCCGCGTTGTC 
ATTTGCTTTGTGCCACATTCCTAAGAGGTAGGCAGGGCAGGTTTTATCATCCTCTGTTTACAGATGAAGATAATGAAGGTCACATAGCTCAAGTGAG 
CTACCCAGGGCCACAGACATACAAAGTGGTAGAACCAGAATTTAAATCTTGGTTGGTCTGACTCCAAGTGTAGTATTTGTTTCACTTTTTCTTTCTT 

TCAATATGGAAGGCATCTTTTCTCTAAAGAGTGTGAGCTCCCTGCCAAGCAGCACAGGCGCTTAATAGGCTTTCAGAGCAGCAATTCTAGAACCTCC 
CTTGTGGCTTTCTGATCTCCCACAGCCCTTGTGTTTGATGGTCAGGAAAGGAGAATACCCTTGGATCGTCTTATCTCCAGACCACCCCGAGGCTGGC 
TGGACTTAAGGGGTGAGGAAAGACAAGATGGGTGAGCAGCTGCTTTCGGCTGAAAACCAGAAAGAAATCCCAGTGTTTTTGCTTTATTTTGGTTTAA 
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AAATCAAGTaSTATGACAATGGAAACATTTTCCTGGTTGTCCTCTCCTCTCTGCCC^TGTTGACAGAAGACGGACTTGCTAATTCTGGCTGGGGTTT 

TTCCCTGCCTGGGCTGTTTTTCTCTTTTATCTTCATCAATTGAAGAGGGCTGCTGTGTCTTTCAAACTGTCGGAAGGTATCTGTATTTTTTTTTCTT 

TC^CATAATTCTGTAGGTCTTTTCCCAGATATGTATTTTTCTCTCTAACTTCTAACTCGCTGAAGTTTCCGCCCATGTGACTTCCAGCGTGAGTTAC 

C AGAAAC CACAAGAGAGAGAGAG AGCGTGCAAGCCCCAAAGCG AGCGACATGTC C CT TTGGGG AGCAGTCCCTCTG C AC CCC AGAGTG AGGAGGACG 

CAGGGGT CAGAGGTGGCTAC AGGGC AGG CAG AGG AGG CACCTGTAGGGGGTGGTGGGCTGGTGGCCCAGGAGAAGT CAGG AAGGG AGC CC AGCTGGT 

GACAAGAGAGCCCAGAGGTGCCTGGGGCTGAGTGTGAGAGCCCGGAAGATTTCAGCCATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCC 

GCTCTCCTCCCCCTGGCAAACAGGTAGAGTCCTCCTTTTTCTCTCTCCTACCTTCTGATTCTCCTGGGGGATGGAAAGAGAGCCAGGCTTCCTTGTC 

CTGCCCAGGGAGCTGAGGATGGAGGAAGTGGCTCGTTGCACGGGCACTCTGTT^ 

GAGATCCTCATCCTTCCACCCATTCTTCCAAAGCCCCTTTTTCTCTCC^ 

AAAAGCTTTCTAGGAAGCAAAGAGGAGGATGAACATCAAAGAATGCAGAGAAAAGAGTCTACTGTTCTCCAAGGCTGTAGAAAAGTGAGAGGAGTTT 
TCAAGTGGG C C GCTGGCATGC ACAG CTC AAATCCT AG ATATGCAG TTTCCCAGGAGACAGCC AATAGGGAAGGGGAG AAAAGGGGTTAGGG AAAAGT 
TAAAAAAAAAAAAAAGAAGAAGAAGAAGGAGAGGAAGAAGGAGGAGGAGGAGGAGAAACCCAGGGCTAGCATAGAAGGATCT 

GCC T CTGTTGCT AG C ACAGCCAGCAAGGGGAG ATGCTAGAGGTCCAGC CAATAT CTC TTTGCTTGTGGCTTCTGGAT AGGAGC AAGGAAATGAATGA 
GGAGGTGGCTCTGGAAATTTGTTGGAAGACATTTGCTGGTAGTTCATCTTAGCTCTGGGGAAACCAGAGAGACTGAGATAGGGATTGTGGGAGCATT 
TGG AGT CTAGATTGTGACTTCCAAGT CC AAAGGTTTT AGCTTGAGGAT TACC AAAAAAGGAGTGCCCTGCCCATCT C ACAGCTTCT AGGG TTTTGTT 
CTGCACCACACTCTTGACATCCCTTGCTGTAGGAAGCCCTCCTCTTTCTCATCCGACATGTATCCGTGGGTTAAAATTTTCTCAACCTTCATAGTTT 
AAAAACTTCATTCTTTCCTTAGCACAGATATCCACAAGGGTTTTCAAAATGCTGCAGCCAAAATAAAAGTATATAAATATTGAACAGGGCGTATATC 
CTAAGCCTCTAATAACGGAGCAGCAGGCTGGGATGAGTCTGGGTGGAGACCTCTAGAAGGCACAGAATCCAGTCTGCTGCTCAAGATGGTGTCAGAG 
TGGCTCTGACTGCCTGGAAGTTTGTACTTTTCTTAGGAAGGAAATTACTCATGGCTTCCATGGTAACCTGGTGCCCCATCCCCCTTAGAGGAAGAGC 
ACTTCCTTCCTATCTATGCCAATGACCTCCTACCACAAAGGCCTGAAAGTATTGGACATGTTGGTGAGCCCAGTACTTAGTGATCCCAGAGCTTTGC 
CCTTGGATCTGTGCCCCTCTAGCCTGTGGGAGACGTGGAAGGGAACAGGGCATGAGTTGCCAGGACTCCTAC^CCTAI^ACTGCAGCATCTGGGAGAG 
AACTGGAGCTAACCTCTCTGCCCAGGCACAGGACATGCCTCTCTCTGGGTGTAATTAGAGAACACCCAAAGATCCCCTTATCCTACCAAGCACCCAT 
CTACTGCCCACCACAGCCAGGTCACCTCCACCATTAACGTTCTTTTTGGCATTGATGTGCCAGGCATTGTGCTGGGGCCATGGGGAATGTCACAGTG 
GCTGAGCACAACGGAAGGAAAGAGGAGGCAGATCTGGGATGCACCCTCAGCTCAGGAATGCAGCAGAAGACCACATGGGGTGCAGAGTAGCTAGCGG 
GGAAGGGTCTTGGAAAGCCATTGGGCTCAGCACCAGAAGATGTAGCACTCTCAGATGACCAGGAGTCCTTGGGCACACTCTTAGAGCCTCAGTTTCC 
T C AC AT ATAAAATGG AAATGATG AT ATC AACTTCACAGCATTGGGGAAT TAAAC AAGAG CACCTCTGAAAG AAC AC TTGCCC AGAGGT TGGCATATA 
GTAGGCACTCAACAAAGTCrrGAAAAAAAGAAAACAGCTAAAAGAACAGCAAGAAGAGCCTAAAGGCCAAAAAG 

CTAC AG AGG AG AGGTGATAAAAC AC ACTTCATCG CCTTAACCATTTTTTTT TTTTTTGGC CA AAT ACAGCT CT ATTGAGT CCT CAAGTTCTCTTTTC 
TTTTTCCTGTAGCTATTGGTCAAAAAAGTCAGAGACCTGGGAATGTGTAAGCAGGCATGGAGAAGAGCTCTCCTTCGGGTGCAGGCTTTTGGTAAAT 
AGAGGGATGGAGGGTTCATGCAGAGACGAAGGCATGGAAGGATGGCTGGACGGATACCACACACCACCAGAGCCTGCTTTCACTCAGAAGGGAGATG 
C AG ACC C ATGT AAT CCT CCCCGT CT CTG CTT CCC AGG ACTCATCTGATGATGTG AGAAGAGTTCAGAGGAGGG AGAAAAATCGTATTG CCG CCCAG A 
AGAGCCGACAGAGGCAGACACAGAAGGCCGACACCCTGCACCTGGTAAGTGTTCAGATCAATCTTCATCCTCGTGAGCTTTAGGCTTTGCCCTCCGC 
CATCTGGGAACCCTTGGACCATAGCTTTGATTCTGCTTGTGTGGGGATGTGTATGTGGGGTGGGTTAGTGGAGGGTGAGGGCCGTGGGGTGAGGTAG 
GGAAGGGAAGCTGTCGTGAGAGTGCTTTAAGAAGGCTCTGTAGTCCGGGCGTGGTGGCTCACACCTGTAATCCCAGCATTTTGGGAGGCCAAGGCGG 
GTGGGT C AC CTG AGGTCAGG AGTTC GAG ACCAGCCTGGCCAACATGGTGAAAC CTC GTC TCT ACTAAAAATAC AAAAAATT AGC CAG GCATGGTGGT 
G CATGC CTGTAGTCCCAGCT ACTTGGGAGGCTGAGGC AGGAG AATCAC TTGAACAGGCAGAGGTTGCAGTG AGCCGAGATCG CGC C ATTG CATTCAA 
ACCTGGGTGATAGGAGTGAAACTTCATCTCAAAAAAAAAAAAAAAAAAATAGAAGGCTCTGT^ 

GGGCCTTTCTACCCCAGGAGGGCTCAAGGGGTGGGAGAAGGCAGAGCTGTATTGGGAAAGAATTTAGAAGGTATCAAGTGCTTTGGGGACGGCCCAT 
TGT C TCG CAGTGCAAGG ACC ACACATAGTCG CATCCC TTGGCAGAGACCCAAATGCC CC TGC ATTTGACAC AAGGAAATT CAC AGGGCGCTGT ACCA 
GATTCCTCTTTTCAGAGCATTATTAAAGTCTTATTTTGAAATTTCTGTCTTTAATGGTATCCCAAGGGGTCTATGAAAGATGCAATTTCAAGTGCCC 
TTATCTGGGATGATCACCATCCTAGTTGAGTTGCCTCTAGTCCCAAATTTTTCTAGAGAAAGCTTTTACCTCTGTCTTGAGTGGTCCAGGGGCCTTG 
AGCAGTGCTGAACTGCAGTTCCGCCAGTGGCCAAGGCCCTAGATGGTGTTTCTAGGTTCTGGCTGTGCCCCTTATAAAGTACTGCTTCCTCCCCATT 
GCTCATACCTTACCACCTTACCTCTCTTCACTCTGCCCCCAGTATGGTTCTTGTTCAACCAACTCTTCACTAGTTCCTTTCTCTCCATCCCCTCATC 
CCCTTCCCTAGCAACTCTACCTTCCTGCCCTTGCTCCAGAAACATTCCCTTACCTCTAATAGCCTAGCAATTCCTTTCCATGGAAAAAGTGCAATTG 
G CAC C AAAAATC AATAAGCTTGT ATTTATCTGGAAGATTAGCATG ATAATCAGC C AGTTAGAATGGCTGTGTT AAC CGT ATC AAG AAT AAGTAAGT A 
AGATGTAGTTTTAGGGGAAATAATGCATCTTTCAGCTTTACAAGGAAACAGAGCAGACTGCACCAACAGCTCTGATTGGGTGCTTGGGTTTTATTTG 
AAATGTATTCAGCTTACACTCCATTGCACACTAACATTAAATTCACAGCTATTTGTTAACTTTTTGGAAAGAATGAATGAACAGAAAGTCTGTACTA 
ACTGTGCTCAACAGTCTGAATGTCAAGTTTTTGGTGGCCCTGCTGCTTGTATGTTCTCCATGATTTCACTGTGTCCCCACATTCTGGTTTACCACCT 
TAAGCTTGAACATCTATGATCCTTACCTCTTTCTCTATCAGAGTCTGATTCTGATTCAGTATTCTTTAGGGCAAGCAAACAGACCAATCAAAATTTA 
AAAGGATAAAATCTTCCTTGTGTACAAGTAGACAATGTCCTCAGACTGGCAAACTGCTGTAAGCTTTTTAAGTGTGTACCTGTTTTCTTAAGTGGAA 
TGTAAGTATAGTTCGGCAGCTCAGCAGTCAGAACAGGGATTGAATCCCAGCCCTGCTGCCTACTATCTGTGTGACAGAGACGTTCTTGGCCTCCCTT 
GGG CTTGGT TT C CTT AT CTAGAAAATGG AAAATG ACAATAACTACCCTATACAG ATCGT ACATTT AAAGAGCCC AAC AC AATGCCTGT CATTC ATAA 
ATACTCAAAAATTTAAGTATAATAAGAGTAATAATAATTATCATAAGAAGCAAGATAATCACAAATTATACACACATTTTATATATATATGAAAACA 
TATATACATTTTGTGTATATTTAATCACAGAAGCTCAGGGTTGAAAGGTTTCTTAAAGGCTAGAAAGTTTAGACATAAGTTTGATGCTTCAGACTTC 
TCTGCAAC^CCTCCAGCCAATGGTCCCATTGTATGCCTGGTCACTTCT^ 

CTATGTTAAGCTGCACCCCTGTCTCCCTGTAAATTCAACrCACTGGTCTTCATCCAAACCCTTGGGGAACGTATTAATGAGGGTGACACAGATGTCT 
CTTTGCTGCGGCAAGGCTTTCAGCACTGAATCAGCTCTCTCCCTCTCCAAGACAAATGTCAGAGCCCAAAGAGTCAGCTCATCTAATTATACAAGTC 
ATGTGACATTAGGGTAATCTCTAGAGCTTGAGGGTCCACAGAGCCCCAAGTCATATAGTCAACATATCCATAGC(^CACCCCAGCCTCAGTTTCTAG 
AATGTTCCAATCTTCCCCACATCTGAACACAGATTCCTCCTCCACTGCTTTCTTTTGAGCACCACAGCATCGTCTTTACCTTTTGCCCAATGAGTAG 
GT C GCAGCGGAGTTGATGCAGCAAG CCTG CGGAC TGCTGCCAATGCAA CTTATGT TT CT TGGCTTC AGGTAGAAGCTTTTGGC AGC AACGTGTTCAA 
TCTGCCCTACATCAAATTAGCTGTGGTTTGACATCTTCCTTTCCTCCGAGATGACAATCACCTATCTTCTCCATCCATTATCCTCTTAATTATGTCT 
GAGATTCAGGTGAAAAGTCTTCTGGTTTACCACCAGTGTCAGGGCTCAGTGTGTGGTTTGCCCTTGACTCCTTTCCATGCTTGGCTCCACATGCTGG 
GGT TGT TCT CTGGC CCTCAG ACATGG CC A TAGGC CTCGCCTACCAGTG CTTTACAGATGAGGGTTTGGGAGCTACTGAGAGT ACC ATATCAAAAAAG 
GACCACATTCCTATTAGGGTTCTAGCCTCATATATACTGGTATGTTTTCTTAGTTTCAACTGCTGTCACTATTATCAACCTTAAAATACTGAAACAA 
TTATCTTGTTCCTCTAGTGAGAGTGTGGGATGCTTACTATTCGACATTTTCAGCATCTATCCTATAAATGGCTGGCACCAAGACACTGTACATCGAT 
TGC TTT AAA AATTG ACAGAGTAGGG AGAGAGCCT CTT TTTATGATTGTGTT ATAAAT CC CTGGCCTTG ACT AT CTCTGAC AT AAAATGGATCGGTGA 
TTCCTTTCGTAATTGTGACATCAACAAACAATCTAATTCTGTTTTAATTCTC 

TTTCTCCTGCAGACAACTGTAGGGTTCATTGTATATAGTAC&GATGTTGATTAAAAAAGAGAAAGGC^^ 

TCAATAGCTAATAAGTGGACCTAAGATAATCAGCTGAAAAATTATTTGAAATAATAAGAGAATGCAGTCAGCTTCTAAAATTAAAACAA 

AAC GTT CCTATATG CAAAACATAACC AA TTAGGAAAT TAATGGAAGGAAAG ATC CTG TTT AT AAT ACCAATGACAAC AGC AAATGGAAGGAAGGAAT 

AATT TA ACAAA AATTATG AAGAGTG TAT CCTAAA TGAT TTTT AAAAGAT TC AAAT AATTGGAAAGATAT ACCTTATGTAAAGTTAGCAATTTTCCTT 
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AAGT TAATTGAT AAATTT AATGAGAT CC C AATAAAAAACC CAATAGGGTTTTCTGGG ATTTGGGCAAG AACTAGATTAGCTATTC TAAGTT TCC AGT 
GAAAAAAAATCATATGACAATTGCCAGGAAAATTCTAAAGAAGAAGTATAATGAAAAAAGATTAACTG^ 

AGAAATCAAAACAATGTGGAAAAGACAGAACATTGAACAGGATAGAATATGGAAACAGACCCAAAAACAAATTCAGATTTTAAGACCCAAATAC 
TTTCAACTCCTAGGACACAAGTATAAATACAAAATGGGCTGGGKACGGTGGCTC^^ 

CACTTGAAGTCAGGAGTT03AGACCAGCCTGGCCAACATGGTAAATGTTAAATTTTTTAAAAAAAGGAAGCTAAAGAAACTTGAAAAGACAAATAAC 
CATTAAAAATTTATCTTGGAATGAGGAATTATTTTCTAAATAAAACATAAAACCCTGAAGCCATAAAGGAAAAGATTGATATACATTACTTTTATTC 
ATAAAAAGTAAAAGTTTCTTTATAGATCAAAATACTGTATGCAAAATCAAAGGCAAATGACAAGCTGAGGAAAAATATCTGTAA 
AAAAGGTATGTTTACTTACTATATGAGAGAGCTCTTATGAATAGGAGATGATTAACAACCCAAAAGAAA 

CACGCCTTAATCCCAGCAATTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGCAGATCGAGACCATCCTGTCTAACACGGTGAAACCCCGTCTCT 
ACTAAAAATACAAAAAATTAGCCAGGCGTGGTGGCGGGTGCCTGTAGTCCCAGCTACTTGAGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGG 
CGGAGCTTGCAGTGAACCAAGATAGCGCCACTGCACTCCAGCCTGGGAGACAGAGCGAGACTCCGTCAAAAAAA 

TGAACAATGACTATGAATAGGCAATTTACAGAAAAGAGAAATAAGTGGCCAATAAATGTGTGAAAAGATGGTAGACCTTATTTAGTTAGGAAAATGC 
CAATTAAAG CAGCAGTGAGGCTGGCAAAGATGAAACATGTTGTTGATAC ACG ATGTTGC C AAGGTAATGGAGAAGC AGG C ACC AT CAC AC ACAGCT A 
GTAGTAGAGTCTGCATAACCCTGGTAGAAGGCAATTTGCCCACGTCCATTCAAATTATAGGTGCACATACCCrGCGACTCAGCAATTTG 
GAGTGTGTTCACAAACAAATCC^CACATCATACAATTCTGTACATGTAAAGATAATC 

TAAAAAAAT CC ACC AATAAGGGATT CAAT AAATT ATTGTC CATTCTGTGGAATATTG CAGAT ATGTTAAAATAATG AGTTAC ATC TGAATGTGCTAA 
TCT GGAAGGATT AT CAC TTC TGT ACTTT AAGTTAAAAAGG CAT AAAT AT AGT CC ATAGTTTGCTACCATTAGTTTTTTAATTT AATGTTTTATCT AT 
TTATTTATTTTTTATTTTTAGAGACAGGGTCTCACTATGTTGCCCAGGCTGGTCTCGAACTCCTGAGCTCAAGTGATCTGCCCTGCCCACCTTGGCC 
TCC C AAAGTGCTGGG ATT ACAGGTGTGAGCAACAGCACCC AG CCCCTTCTGTGT TTT TAATAAAAAGAAAAAG TTACAT ATATGC ATAAGT ATAAAT 
GCAAACTTTTTCAAATAGATTATAAAAAATTGTTAATGATATTTATTTCTGAAAAGTAGGACTAAAGATCTAAAGCAAGAGGGAGATACTTGGCATT 
CTT T ACCTTTT ATACAGTTTGGG ATTTTT TATCACATGCT TTTATCTT TAT AAT ATAGATAT C AAAATT CTCAGCAT TG ATC ATT ACC AC ACATGC A 
jy, TTAGGAGGATTTCTTTTCTAATTTTGACATCAAACATTGAATAATCTTCC^TCACGACATGGAAGAGTATTCAATCTTGACGTTATCTTTTGAACA 
~~Z. CATACATAAATATCTCAGCCCTTGGGAGAACCAACTGTAAGTTGGGATTTGAGTGACTCTAAGGTCCACAGTACTATATGATAAAAGATGGCAAATC 
U CGTTTCCTTTGTGT AGCCATCTTGGTTGGTTGGT C ATGGCTAT CTGGTGCCGTACAG CTGGAAGGATT TCAAGT CAAAGACCAGGATCAG TGG AAAA 
fj GAGAGCTGAGGTCAATT AAG AAGCTTT ATT CATGAAAAGGCTAAAAT C AGTGGAGTTG C CGGG AGC AGTGGC AAATAG CAGTGTTAGCC CTGAGC CG 
AGCTGTTGACAAGTCATATCTGGCCTGGTGGATGGAGCTGAAAGCAAGTGAGAGTAACAGCCAAGTTTGAAAGCTTGAGATTTAGGTACTGTCTTAG 
y ATGGAAGTGGGCCAGGTGCTTGCCTGGGGCTGCCTCCTGAAGGAGTCTCACACCAGTATAAGAAAAAAGAGAGGTTTACAGGCTCCAGGGATGTGCT 
TTGTCATATATGACAGAAAAGTCATGAGTATAAATTATGCATTAAATGAAGCTTTGAGCTAACACTGTGAGAAATACGGTATTTACAATTGGAGAAA 

aattgagtaatggaggaaaagatgtttgggtagcactattgttcttggccttctgaggaggcagagaaggatgcttacggtcctatttcactcctgg 
f* 5 aattcttttgctctctctgggagaccatgccataatgtttgttggcacagataagtcacaggtcaggaatgttcaggctagttccacccttaaaaag 
tp& ttcaatgtagaatcagtgaaccatctatggaaagagatgcgacccttctgtctccattgctatgaatttcttttaagaaaaaacc 
i i caaacgactatgtattaaccactgtgcctcaagtgctctccctgcagcatctcatttaatctgcataaaagctgtattttataattccttgcattaa 
^ gtaacttgatggagatgacacagcttgtgactaggtctgtctgacgtcagaacccattttcctaaccattaagttttactgcctctttcttcaaact 
« ctgatgaagatgtattcatcttccctaacctatattttccttgaatctctacatgaattaatttttttgtcttttaagcaatatcctccaaaatcta 
y, cactgtcccccagccatcatcaggcgtgatcaagaaagtaaagacatgtttact 

atatatatcatgatatcaacatatatagataggtactatatatagatagatacaattggtatatcaaatgatacaatatgtcaattgaaatcttttt 
%4 tgagaagtctgtgtagcaataaatgttaacatctttcacaggaattacctttgacacaacagagtttgaa^ 

III ttcatcattatctgtgaatagtaaataccttaacaatgctataaaacttgacagggggcttacaacagaagatgaagcatattatgatacattcata 
I* caataaaaaaatgtatgctggtactaaaaatgattatgtagatctgtatttatcagcatagaaagatgtcagtgatacagcattaagtgaaaaaggc 
W aaattacaaagagcatgcagagtataaccctttcacgtaaaatttgtgggttttgggcatggtcgcatgcataaagagatatccggaaggatattct 
fj ccaaaatgctaatagcagttatgagtggttgagtttgggtagttcatcttattcttggtacttttctct 

f7 tttaaaaatcagaattttaaaaattaggctattttcatttggggagggaaaacacatcaaggaaatccaccacccctaactgcttgtcttaccgatg 
p?; acgggtcggcatggctaagtttgggaggcatattggaaattgggagtccagctccatgctggtgttgatgctgtaaagaacgactgtttgggaggcc 
aaggtgggcagatcatgaggtcaggagactgagaccatcctggccaacacagtgaaaccctgtctctactaaaaatgcaaaaaattagccaggcgtg 
gtggcgggcgcctgtagtcccagctactcgggaggatgaggcaggagaacggcatgaacctgggaggcggagcttgcagtgagctgagatcacacca 
ctgcactccagccaaggcaacagagtgagactccgtctcaaaaaaaaaaaaaaaaaaagaatgactgtctgtctacataggatgatggtcctaggtc 
acaaactgacagctgcctgttcagcagatgtcttgatggagctattctcagcagcacccagacttccttctcttgagcagaatgggagtctccccag 
agtgcccttgtgcaagcttctcagaggttttcttactcccctcttttgaacttagtctagtcccctgcttcctaggaggggtggg ttgggtccacag 
tataaaggccccttcagattgtgggatatctgtagagggttccactacgggttcacatgatggcttgtggagggaataacaatcataatagtcatac 
taccttgttgataaaatttcagaattaagtgggaactcagagatcttctagggtgacttgccacatttccaatgaggaaactgaggacgcccaagaa 
cccaaacctcttgccttcccaccagtctgcttttgtttcatgccactttgctgaacagagctttactaggcaggtgtttacataaggcctattgcaa 
aggtgggccaagaagcaggccattaaccagaacacttgaactgcctagacactcatcagctccagaacatacggctatacatgagaccagaaagaga 
gtgagtccaaacttgacaggggctgaagtgtaaatgtttctttgtttaaagagattttaaagttgtaatccaaaattagaaagggcagtgacatatt 
ggggaaaatatttgcatcaaattaaagaaatgaaggtttaatatttatgctacagaaagaaactctt^ 
aagtgggc^gaggcaatgaacagaaagcttacataatatttttcctaccaaaatg 

actaaggcagggtttctcaaccccagtggaactgatacttcgagtccgataattctttgttgttggggattgtcctgtgccttgtaaaatgtaaaat 
ttacccacaagatgccagtagaacaccccctcaccctgcttagtttgataacc^aaaa^ 
tcagtcctagttgagaacgctacactaaggagaaagacacatggaaagttgttctttctccttaatacttc 
aagaagccttcatcaagctaattggcaacaagtaaattatttaaaattataaca 

gctagtggtgacattatatacttgatatgaccccttttcagtaatacttctcaggactgggaagattgaatgcctcctttgcctcattcatttcatt 
cctaggaatttctccaaggaaatcactgaacagagacaaaacccagtgtgcatcacgatgttcatctcagcattatttttaatagtgacaatttgat 
agcaaccaagatgtccagcattagaagaccactttaagtaaattgtggataatcaatttgatgatgcagtcattaaaaattatgttgactatgtaga 
tgtgacaaagttatgacagaaagcaaaatacaaattgtatgtgtactatgattgaaattttgtaaaaataagtatgcatgtttgttcaaaaggatga 
gatggccaagtgggatggttcatgcctgtaatcccaatacatacggaggccaaggcaggaggatgacttgagaccaagctgggcaacagagtgagat 
cctgtctctacaaaatataaaaataaaaataaaaataattagccaggcgtggtagcrtgcacccagctactttagaggctgaggtgggaagatcact 
tgagcccaggaggttgaggctgcagtgagccctgattacatcactgcactctggactgggtaatagagtgagaccctgtgtcaacaaaaaggatgag 

ATCTCAATATGTGTAAATGAAAACAGCTGTATTAAGGGACTAGGATCATGGGCGTTTTTAACTTTTAAAAAATATCTTTAATGTAGTTATACTGTAT 
TTTTTTATTTTTAAAGAAACAGAAGAAAATGGCCAAATAGAGAGCTGAAGGAACTCTGGAGGACATCA^ 

AATT TGGGG ATCAGGTTTTGGTAGG AGA CCTTATAAT TTATA C CCCAAATAGGAATACTTCGAGAGTGG AAGATGG AACTTGT AAATATT ACAC CAG 
GAAAACAGGTGTCCCAGTAAATGTCTGGTCACCCTAGACAGGGAAACCAAGAAAGATGACATGAAGCTCTTATGAAAACTCAGAAGTGATAAAGTTC 
AGAGTCAAGTAGAGGGGACTGGATTCCTATGACCAAAATGTGATTTCCTGCAACTGAGGACCCTGTAACTAAAAAAGGGTACTCAGCTGGCATAAGA 
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ATATGTCTTATCATTAAGAGAGTAGGATGACACAGAAAATTATAAAGCAAAGGCCATATTATCAGTACTAGAGCTGAGATCAGCGGAAGTGATTCAT 
CTACTCAACAAATATTTATCAGCCAGGAACAGTTCGTTCTAGGCACTGAGAAT^ 

ACAGTCAGTTGGAAGTCAGGCTGGAAAACAAATAAACACACTAATCACATGACATGTAAAGTGCTTC^^ 

TGTCTAAGGGACGGGATTGGGGCAGCGATCAGGGGAGGCCTTCCTGAGGCCCTGGAATTTGAGATCAAAGGTGAATCAAAGAGGAGACTGGAAAGAT 
CTGGGGAGAGGGTGATCTCAGCCTAGAGAGCCTTGAGGCCAAGCTGCGAGGCCAGGCAAAGGTGAG 

AAGACCATAAGGGAGTGAGGAGGAACTCAGTAATAATGGCAATTGTAGGGCTGAGCACAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCG 
AGTTGGTTGGATCATTTGAGGCCAGGAGTTCGAGACCAGCCTGAGCAACACAGTGAAACCCCATCTCT^ 

GCATGCACCTATGGTCCCACCTACTTGGGAGGCTGAGGCGGGAGATTTGCTTAAGTCCAAGAGGTCGAGGTTGCAGTGAGTTGTGATAGCACCACTG 
CACTCCAGCCTGGGCAACAGGGCAAGATCCTGTCTTTAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAAATAAAAT 
AAAGGCAATTGTGCTGCTAGGAGC^GGAACTGGATGGGGCATGTGGTGGGGACCCAACT 

CTCTGATCATAAGTTGAGAAGC^TAGCACCTCACTAGGCTGTTTGACCGTTTAATTGAGAGGGCAAATGTAAGAACAAAGCAATATTGTCATTTTT 
TTTTTACACTGTTCCTTTGTTTAATGCAGACAGCCTGCCCTTTTCCTAGAAATCAATCAGATATATTATCCTACATGAACACACACGTTCTTCTTTC 
TGGTTGCTTATAAACTCTTTAAGGATATCAGTAGTTTCAGGGGCTTGGCCTTTCACAACTAGGCAAGTCTCCTTTGTTGGGTATGAATGGAGAGAGG 
GTCAAATTTGAACTAGGGAAATGGCACGACGAAGTTCTTAAAGTGGTTGTAATTCT^ 

GGTTTGACCTAGGCCTTTTG AAAAAATGAAT CTCTTGGAT TCATTTGG AAGT AT CCC AG ATT TAC AAAGTAGAAGCTATT AAAGTGTTTG AGTGGTG 
GTTAATC^CTCTCTAGGAATTCTCATGGACATCACAGATTTACTCAGCTGACCCTTCTTCATAGGCACATCTCAGATTTTGGAAAATTATTCCAGAT 
TATAAGTTGGCAGTCAAGC^LACGTGATTTGATCGAACACTTACTCTATGGCAGGCACTGTCCTTGTCACT^ 
GCTCCTGCCCT CACAGAGCTCAAC^TGTAAATGGTAAGGAAACGAAACGGTTAAAGTTAT 

CTTTCCATTATATGAATGAACTGTGCTGGACTATTTAGAATTGTTTCCACCAATTTAAGAATATAATAGTAGTGTGCTTAAATGGACATTTTGTTTT 
AGAAATTTTTTTAACTAGAAATAGTCTCCCAACTCTTTAAATTTTTTAATATTATACTATAAATGATAAAATAAAATACATATTTAATAAAATGAAA 
ATATGT AAAAATAT AAACTCTTT AAGGATAT CAGTAG TTT CAGGGGCTTGGCCTTTCAC AAC TGGGCAAGT CT CACAAAT TT AATTTGTT TGTT AAC 
AAACAAATTTAATAAAATCAAACACTATATAGTTATGTTATATTTTTATTATTTTGAGATA^ 
|» GCGATCATGGCTGACTGCAGCCTCCACCTCCCAGGCACAAATGATCCTCCCACCTTGGCCTCCCAAATAGCTGGGACTACAGGTGTGTGCCACTATG 
f~= CCTGGTTAACTTTTAAATTTTTTGTAAAGATTGGGTATCACTATGTTGCCCAGGCTGGTCTCAAACTTCCGGGCTCAAATGACCCTCCTGCCA 
AGCCTCCCAAAGTGCTAGGATTAC^GCAACTGCACCTGGCCTTATGTTATTATT^ 

0 ACT TAGGAAGC ATTT TT AAAGTCTTGATGCC TGCAAC CACTCCAGAACAATTAAATTGG AAC CTCTGGGGGTC TGGCAT C CAGGAATC CAGACTGTT 
f ; l TAAGGCTCTCCGGGTGCTGGCAGTGTGCACAGCCAGGGTTGAGAATCAGAAGCAGTACATTTGCAGCCTATCAGGTCTGTACCTGAATATGTGGAGT 
^ GAGTAGAATGTGTGGAGTGAGCAGAATGTGTGGAGTGAGTGTGGGGCMTTGTGTGGGTAAGTACA<^GATACTCTGTCACATACAGTGGCCTGATG 
4~ GCTTAAAACAAAACAAACGGCC^GGTGCGGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGCCCGAGGCGGGTGGATCACGAGGTCAGGAGATCG 

AGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACTAAAAATAC^AAAAATTAGCCGGGCATGGTGGCGGGTGCCCGTAGTCCCAGCTACTAGG 

1 s GAGGCTGAGGCAGGAGAATGGCATGAACCCGGGAGGCGGAGCTTGCAATGAGCCGAGATCGTGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGAC 

ACGGTCTCAAAAAACAAACAAACAAACAAACAAACTATTCAAACCCTTGCTTTGATGGCAGTTATTATT^ 



TAATCTTGCTAGTGGTAACCTTTGTAACTCGAAGCATCAGTATCAGCCAAGATTGCATGCATTCAGAGACAAGTGAGAGACCAGCCATCAAATGAGG 



AATCAACAAATCATGGCCTACTTTTCTCATGTAACAAGAAGTTCAAAAGTGGAGAGACCAGGCTACAGCAAGGAAGAACCAAGCTCCTTCTCTCTAA 
GCAGCCATCCTCAGTATGAGGCTTTCATTCTCTAGGCTGTGAGATGGGTGCCGAGCCTCTCGGCATCAGTCTATGTTGAAGGAAGGAAGCCTTTGTC 
&k CTTATGAGCAGGCTGTGCCTTTTTGGTAGGAAGGGAAGCTCTGCCTTCACCTCCCTCCCGCCCATCAGGTACTTCTGCTTACGTATCTTTGGTCAGA 

y TAGAGGAAGGCAAGGCAGAAGGGGTTTGGGATGGACGCTGAATGAACCAAACTATAGTTTTTTTCACATTTGAGGGCAATGA 

flj AACCAAGGACCAGGCGATAGCCTTTTCACCCTGTGACTAGACAGGTCCCTGAGGGCACGGCTGATGTGGCCAATCTTCTGTAAATGCAAAGGTGCTT 
I - 1 ATGGCCCTAGAGCTTCAGCTCTAAGACATGAAAGTGAAGGTTCCTCCAGAATCACACACGTGC ATTATTCAGCGCAAAGATCTGTGGTTTGGTGATT 
J% CTTCTT CTCCTTTC ATAAGG TGAGAGGC AGCAGTTGGGGTG AATTTC CCCAGG AATGGTGGG AG ATGTGGGC CTAGATCCT C CCGCTGCTTGGGAAG 

C] AGTCGTACTTTTCAGTTGGAAGCCTGCTCCAGGAATGGCGGGCTCCACCCTCTGCATGTGACGTACAGGGGCTGGTAACTGATGGAGCCTGAACATA 
T ATCTGGGTGTGAGGGATAAGCTTCGAACGC C ACTT AAGTGT AGGTGGGAGCCAAAAGGT CAC CCTGC C AATACATGATTACATG TCCTG TTGAGGA 
GAAAATACACAGGGACTATTTTCACCCTTCTGGAAATTGTGATTAGAAAATGCy^GGAGAAGGCCAAGACACTAAACTGGAG 

CCTCAGCCCTGCTCAGGGTGCCTGGACAGTCCCCACCAGGGCCCAGCTAGGAAATGCCAGCCTTGTCCTTGGTCTTTGTCCCTAATGGCTCCCGGCG 

GGTG AGGGAACTGCCTGGGGATTCCCCAT CGTTGAAG CGCTCTGTG AGTGACCCCCTCCTAAG CCCCTCCAGTAGGGGATTT C CCAGAATGTTCCCA 

CGCCCCCAGGAAGCCTCTTCCTTAGCCACCCCC^AGCTGCTTGCGGTTCATGGTAGAGGTAGGTGGCATCAAATGACCCCACAGGTGAGGGATTTGC 

CTGAGGTGGGAAAAGAGGGAAGATAAAATTGGAAAGCCACCGGGGCTGAGGAATTATCTCATAGGAGAGAACTGTCAAACAGTATAACGTTTAAACA 

GACTTTAAAGGCAAGAAAGGATGGAGGGCACAGACTCTGAAGCCAGAGTGCCTCAATTCAAGTACCGGTGCTGCAACTTCCAAGCTGCGTGATCTTG 

AGCCAGTGACTTAACTTCTCTGTTCCTCTGTTTTTCCTCTTCTGTAAAACGGAGATAATATTTTCTACCACACAAGTTTGTTGAAAATATCAAATGT 

AATGATAACCATAAAATACTTAAAAGAGTGATTGGCACACGGAAGTATTCAATAAATGTTAGTTATTGGCCAGGCATGGATGATTCACGCCTGAAAT 

CCCAGAACTTTGGGAGGCTGAGGCAGGCGGATCACTTGCGGTCAGAAGTTCAAAACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAGAAAT 

ACAAAAAGTTAGCCAAGTGTATTGGCGGGCACCTGTAACCCCAGCTACCCAGAAGGCTGAGGTGGGGAGAATTGCTTGAACCAGGGAGGCAGAGGTT 

GCG GTG AGCTG AGATTGCACCAC TG CACTCCCAGTCTGGGC^^CAG AG AGAG ACT CTGTCTGAAAAATT AAATAAAC AAATAAA 

AACCTAGGCTATTTTTTTAAAAAAATTTTGGTTTCTATCCACCTTTTTCGTATAAATTGAGCCAGGTTACACCTTAGACCTATTAGATCAGAATCAG 

GCATCAGGATTGCTAAAGCTCCCCAGGGGACCCCAGTGTGTGGATGAGGTTGAGAATCATTGTTTTGGTCCAGTCTTTCCATTTTAAAGTGAGGCAA 

CTGGA G CTGAATAGGGAAAT CGAGTCACTGAGGATCACCCAGCCAGGA CCAGGGCCCAAGTT CCTATTCCCATTTACCGCTGATGGAGCT CACCGTG 

GTC ATG AGTTC AGG ACCTGG CTCTGTGTCTGG AGGAAGGT CTGGC CTCGAGTCAGCCCTTGTCCAGAATATCTGGCCAGGTGACCCCC TCT CCATG C 

AGC C CAGGAGGC AG ATGGTC AGCTCT ATGATTGG AAG AAT GTGAGTGACTC AGG G ATCTTGTG TTGTGTTGTG TTGTGTGTGGGTCAC CAAAACCG A 

GACCCAGCTTTCTCCTGGGAGACAACAGGGTAGCCAAGGACTTATTTTTTTGTTTTGTTTTGTTTTGTTTTTTGAGACAGAGTCTTGCTCGGTTGCC 

CAGGCTGAACTACAATGGTAAAATCTTGGCTCACTGCAACTTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCGGAGTTGCTGGGACT 

ACAGGCATGCGCCACCACGCCCGGCTAATTTTTGTATTTTTTGTAGAGACAGGGCTTCACCATG CTGGC CAGGCTGGTCTTGAATTCCTGACCTCAA 



CTCAAGTCTCCAGTCTCTCGGGGGAAGCAGTCAGTGTGAGAGAAGGAACACAGGGCCCCGAGCCAAGCCCTGCTCTGCGCCTCCCTCGCGGGCAATG 
AGTCTTTGCATCAAACACTCAAAGTTTCTGAACTTCTGAGAAAACAAGTGCCTTGAGGTGAATCCTATAGCACAGCTGTCACCAGCCTGTGGTCATT 
TTGGAGGGTATGTGGCATAGGTTTGTCCCAGGGTCTCAGTTGTCCTCATCTGTAGAGTGGGGATAATAACAGCTGACCTTCCCTATCCCACACAGAG 
CGTTATTGTGAGGAGTCTGATGGACCTGGATGAATTCTGAATAGGAGAAAGATGGGTACATGGATGCCCAAATGCAGGCTCACCCCCACCCCTCTCA 
GTAGCAGAAGACATGCATATGGGAGAAGCTGCATTTCCTCCCAGCCTCACTGTGCTCTAGGTGTCC^GCCCTCATTGTTAGGGGATGTCACTGTTGT 
ACCTCTGGATAAACAGCTGCTCTCATGCCACCTTTGCCAGTTGTCCCTTCAGCTGGGCGAGCCCTTTCCAGGGCATGACTGCCCCAAGGAGGGCCAG 
AGGGGACTGGGCAATAGCAGGGACATGGAGGGACGAGGCCCCAGATCAGCCACTTCCTCCTGCAAGGGAAGTCAAACCCTTTCTCTCATAAGTGTCA 
GGCTGGCTGCACGGTCCCTGGGCTGGCAGGGCAGGTTTCTTGGGGGCCCTAGCAGGAAGCACTCTTGCTTCCCAAGGTAAAGACAAGTTTCTTATCA 
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CTGGCGGTGGTTTTCTGGTTAGAGGCAGCAGGCAGCGGTTACAAACTTTGCCTAGAGCTCTGGGATC^ 

G AG CTAGAGTGAGCTCAG CAGAGCTTGCT TAGGG AGG AAG GAC CCG CTGTCACTTOC CG AAAGCTGGG CCAGCC TCTAGGCGGGGAGAAGGTC AGAG 
TGGCCCTCCCTTCTGCCTCGGGGTTTGCCCAGGCTCTTCAGAGCACAGAATTT^ 

AAAAGGGCTAGAGGAAGGAGCCAGGACTCCCGGCCCCCAGTTCCCGTCTTACCAACACCCCCTAGTCTCATCGTGTTGCTTGATCTTACCT 

AAAGAGGGGATGCCCTCCTTCTGGGATGTGCCAGGAAATACCCAAATGGTAAAATGAA^ 

CATCATCGTGGCTCTTGATAACTCCTCTGCCCCGTTTTAGAGGGGCTTCCTTTGT^ 

AATGCCAGGTTGAGGCTGAGCATGCTTGTGTCTGTCCTTCACAGATGATGGAACCTGGCTTGATGAGCAAGGCCATGTTGGTATATGCTTCTCTGCC 
ATTTAGAAAGACCCCCCCCAGGATGTGGGGTTTAAGATGATTGACAGTATAACCGATTAACCTCATTAGCCTACAGCTCATGCACCTGATGGATTCG 
TTAGGG C AAATTTT ATTTTATTATTTTATTT T AT TTTTTGAG AGAGGA CCT C ACT CC AT C AGTGAGGCTGGAGTGC AGTGGCATG ATCATGGCTCAC 
TGCAGCCTCGAACCCCTGGTGCTCCATAGCTAATTTTCTTTTATTTTTATTAT^ 

CTCCTGGGCTCTCAGGAGGTAGAGGTTGCAGTGAGCTGAGATCGCACCATTGTACTCCAGCCTGGGAGACAGAGC^GACTGTCTCAAAAAAAAAAA 
CCCAAAAAACCAAAAAACTGCTGGGCTCAAGCGATCCTCTCACCTCGGCCTCCCAAAGTGCTGGGATTATAGGTGTGAGCCACTGCACCCAGCCTAT 
TTAGGG C AAAT TTT AAATGTC AAGTTAT CTT ACATCTGCT CACCACCC ACCCTATTTTTG AGT TAC CTT CTTC ATTCCAG CAG ATGATGTCATAAGT 
CTGTCAGTCAACAAACATCTGAGTTCATACTATGTGCCAGGCATTGAGCCAGGCATTAACAAATAGGCATTTAATAAGTGCTTGAATAATTAA 
AATTTAAAATAATATTTAACTAGTGCTTTCCAACTTGCCTGATCM 

CTAGTGGCTAGGATTTAAAAAAAAAATTATTATTATTTTTTTAATTCAGCTCTGCAGCCAGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGG 
G AG ACC AAGGT GGG CAGATC CCCTGAGGT CAGGAGTT C AAGACCAGCCTGGCCAACATGGTG AAAC CC CGT CTCTACTAAAAATACAAAAACTAGCC 
AGG CGTGGTGGTGCCTGCCTGTAAT C CC AGC AAC TCGGGAGGTTG AGG CAGG AGAACTG CTTG AACCTGGGAGGCAGAGGTTGCAGTGAG CCAAGAT 
CACACTACTGCACTCCAGCCTGGAGGACAGAGTGAGACTGTCTCAGGAAAAAAAAAA 
TCCCAAAACCACTGATTCACATTCCTAGATTTTAGGATGGGCCTGAAATTCTA^ 

GGGAAATACTGACCTAATCCAAATAGATTTAGAAAGGTAAGACCTGGTCTCCAAATATCAGGTTGCCTTTTGAACTGAAATTTTTTTTTTTTTTTTT 

TTTTTGCATCTCCAATGATATTTGCCTTTCCTCTGTGGTTACCAATGCTTAGCAGAGAAAGTCTGTTTGGTTGAGTCGAATAATGTAAACCTTA 
y„ TTATACCGGAATTGCGATCTCCTGGGCTTTCTGAGGATGGGATTGCTTTTCCATAATTGCTAATTACCAGATTGCAATTGTTGATTAATTAATGATG 

TGATCAGAAAT CAG CACTTCTTAGCAGGCATT TTT CTGAT TAGCATGT ATTTGATGGTTCCCC CAAGC AAC AGGGAAAAG AG TTT C CAGAAAAAAG A 
hS GAAGCTCACAGGCTTGGGACACCCCCAAAATACCGTCAGAATGCTTTCTGGTAGGCCGGACAGGGTGGAACAAAACGTTGTAGCAGACTTTGACCAT 
Q TTAGTATTTATTTATTTCATTTTTATTTTTATTTTTATTTTTGAGATGGAGTCTTGCTTTGTCACCCAGGCTGGAGTGCAGTGGCATGATCTCGGCT 

CACTGCAGCCTCTGCCTCCTGGGTTTCAGCAATTCTCTTCCCCGAGCCTCCCGGGTAGCTGGGATTACAGGTGTGCACCACCACGCCTGGTTAATTT 

JS CCTCCCAAAATGCTAGGATTATAGGCATGAGCCACCGCACCTGGCCCCATTTACTATTTATAAAGACCCTGAGCATTCTAGAATCTGGGTGCAGTAG 
ACAAGTATCCAGTGACTTTATTTGTAAGACGAAATGCAGAC^CCGGGAGCTTTTGGAAAACACGTGGCATAACACAGTGGAAAAGAGTGAAACTGGT 
GTAAGACGTTCTCACAAACGACCTTCCATTTCCTTAGGCTCTGGCCTGCCAGCTGCTTTCCCTTGCTGTATTCCCGCCACCTCCCACCCACACACCC 
pfe TCACAGAGACCCCTACATGCATACACGCAGAGGAACACACTCACACACACCCCTCTATCAGGTTGAACCATATGAAATTGCTTTTTTGCCGGGAGGA 
I j | GAGATGTAGTCTTGCTGTGTTGCCCAGGCTGGAGTGCAGTGGCTCAATCTCGGCTCACTGCAACCTCTGCCTCCCTGGTTCAAGTAATGCTTCTGTT 
TCAGCCTCCTGAGTAGCTGGGACTACAGGCGCACTGCCACACCCAGCTAATTTCTGTTTTTGTArTTTAGTAGAGACAGGGTTTTGCCACGTTGCCC 
* AGGCTGGTCTCAAACTCCCCACCTCAGGCAATCCGCCCGCCTCGGCCTCCTAAAGGGCTAGGATTACAGGGGTAAGCCACCGCGCCCGGCCTGAAAT 
m TGCTATTCTTATAGGTCACAACAGACAAATATCAGCAGCTTCACGTGGTTCAGCCTCATATCTATGTGTATACATGCACACCCAC 
Jr.^ GTAAACGCGCCCATACACACCTACAAACATATATACACTTGTGCACACATTCACACGTGCAGGAAAATTCTCTGGTGCCCTTCTCCATGGCTGTGCT 
y AGTGAACAACTAATCTGGCCCCTCCTGTGTCCCTCCGCACCCCACCCACCTTGCCTCCTTCCTAGACACTAACCTCCGGTGCTGATCCCCACCCCTA 
ill CAGGAGAGCGAAGACCTGGAGAAACAGAACGCGGCTCTACGCAAGGAGATCAAGCAGCTCACAGAGGAACTGAAGTACTTCACGTCGGTGCTGAACA 
1% GCCACGAGCCCCTGTGCTCGGTGCTGGCCGCCAGCACGCCCTCGCCCCCCGAGGTGGTGTACAGCGCCCACGCATTCCACCAACCTCATGTCAGCTC 
wj CCCGCGCTTCCAGCCCTGAGCTTCCGATGCGGGGAGAGCAGAGCCTCGGGAGGGGCACACAGACTGTGGCAGAGCTGCGCCCATCCCGCAGAGGCCC 
Q CTGTCCACCTGGAGACCCGGAGACAGAGGCCTGGACAAGGAGTGAACACGGGAACTGTCACGACTGGAAGGGCGTGAGGCCTCCCAGCAGTGCCGCA 
GCGTTT C GAGG GGCGTGTGCTGG ACCCCACC ACTGTGGGTTGCAGGCC CAATGCAGAAGAGTATTAAGAAAGATGCT CAAGT CCC ATGGC ACAGAG C 
AAGGCGGGCAGGGAACGGTTATTTTTCTAAATAAATGCTTTAAAAGAAACCAGTCTGACAAGGCAGTTTCTCTCTTTCACCCTGTCCCTTGACACTC 
TAAATTCTGCGATGTTGGGACCTTCTTTGTTTTTTGGACTCTAGTACCCTCAAAGAGCTCTAGGGACTCCACTAACAGCTCCTCTTTTCCArGGTTT 
ATTGATGTACTTGGGAAGCTTTCAATTCCTTCTCTAGGTGTCTCCTTCTCCTTCTCCTTCTCCTCCTCCTCCTCCTCCTTTCTCCTCCTCCTCCTCT 
TCTCCCCCTTCTCCTCCTCCTCCTCCTTCTTCTTGACAGAATTTCTCAGATCTTGGCTCAGGCTGGAGTGCAGTGGTACGATCTCCGCTCACTGCAA 
CCTCCGACTCCCAGGTTCAAGAGACCCTCTTGCTTCAGCCTCCCTAGCAGCTGAGACTACAGGCACGCACCACTATGCCCAGCTAATTTTTTTATTT 
TTAGTAGAGACAAGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTGGCCTGAAGTGATCCACCCGCCTGGGCCTCCAAAGTGCTGGGATTAC 
AGGCATGAGTCACCATGCCCAGCCTAGGTGTCTTCTGGATTGCCCTCCCATGCTCCCCTCACGGTGACCTCCCACCTTCCATTCTTCACTGCTTTCT 
AGACCATGAGCAGGTTGTCCCCTAGTCCGGGAGAAGGCCCAGTGCAGGATGGCCTCACCTGGTGAGGTGGTAAGTCAGGTTGACCTTCTCGTCTCCC 
ACT CAT GACTC CACT CAC C C AGC C AACTGG CT CCATC CTG ACAG CC AAGT CT CCGGATGCTGC AAC CAGGC CCTGTG CCAGGCTGCCCTTC ACATGG 
AAGGCGGTGTCTTCCTAATGCTCTCAAGGGGCAACATGCAGCTTGGGTCCACAGGGACTTCTCTGCCAGTGGTGTACGTGTCCTGCTGAAGACTCCA 
CCTCCCTTAGAAGGCGGTGATACAGAGCACAGGACTTGTCCTGTCCCCGC^GGGCTCCGCAGGCTGCCATTTGAGCACTTTGAGGGerCAGGGAACA 
CAGGGTCATCTTCAGTTTGGCAGAGCCTGAAACCAGCATGCTCCACCCCCCTGCTCTTCAGAGATCCCCAGGCCAAGTGCCTGCTGGCCTGCCTGAC 
ACAGCCTTACCCACACCACTGCCCACGAGCAAAGCCTGCAGCAGGGCCTGGCC^CACACAACCCCTGCTGAGCCAGCTGGAGAa^AAAGCAACGGAG 
ATCATTGTGCCAGGCGCCCACCTGATTGAGCTCCTTCAGTAGCCACCCTCAGTCTCATCCATTTGTCCTTTAGAGGGACAGCCTCTGCCTGCTGCCT 
CCCCGGTGGCATCCCTCATGGGGTGACAGAAGGGAGGAGAAGTCCTGCTTCCCTGTGTCAGTGCCCCCTGAACCCGCTGGCCAACCACTCTGGCCGG 
TTTGAGGCCAAGTTTCAACTCTAGACCTCCCCTGTGTGTGTCTCTCCTGCCCCTCTCAGATGATGCCAGTGTTGGGCTATTGTGATTATTAAATGCC 
ATAGAAGTTTGGATGAGCTGCATCCCAGCAGGTGGTGGGGTCACAGCCCCTTGGGAGGAGTCCAACCOUVCAGCCTCTGATGGCACATTGCAATCTT 
TGGGGCTCTCACCAGAGAAGCTGGTGCCCATGGGAGCCTAGGATGTGGACACTGGGCTGAGCCCCGTAATTCAGAGCCFGGGAAGACAG 
ACCTTCCCTGAGTGCTGGCATTTCTCCCAGCACAGAATCTTGCATTGTAACCTTTGGTCTCCCAACGAAACGTGGCAAAACAGGCTCCCCTTTCAGT 
GCAGAGCCTCCATGTGGACTGGGCAGATGTGTCATGAGGAGGAGCTTTCTTAATTTCCTTCAGAACCTGGATTTTGGTAGCTAACGCTCTGTCTAGC 
TCTTTCCCACCCCTCTGTTAACGAGAAGAGCAAAGAAGCCGATCTTCACTTCAGATGTTGCAGTCATAAAATATAATTCATTTTTCTTTCATAATTA 
TTATTTTTTCTTTTTTTGAGACACGGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCATGGCTCACTGCAGCCTTGACTACCCACACT 
CAAGCGATCCTCCCACCTCAGCTTCCCAAGGTGTTGGGATTACAGGCATGAGCCACT^ 

GAGTCCAGCTTTCTTTGCTACTATCTCTGCCAGGGCCTGGCCTCTTGTGGGCAGGCTTTCTTTGAATCAGAGATTTTCATCTCCACCCATGCTCTAG 
CACTGTGATTTTCCCCAGAAACCCCCACAGAAGGCACATGACTTCTGCCGGCATCTTCTGAATGACATTCAGGGCCAGAACTAGAAAAGCAGAGTTC 
ACGTGCACGGTTCCCATCTCTCAGCAAGGACTTCTCTTCCCCTTTCATTCTCTACTTTCTACTGACCCCAGGCAGTTCCACTTCTCCTGTCTAGTCA 
GCCCGGTAGAGGTAGCAGGTAGCTGAGCCTGGAGCTGGTGCCAAGGCTGACTTCATGTCCATGCAGGCAAATTGGAGAGGCAGCCATGAATAGAAAT 
TTGAACATCCGTAGCTTTAACCTTGGATGTTTCACCAACCAGGTTCTGTCCCAGCCTTCTACCCATGACAGGGATGGCTCCCTCAGGCTGGTGGGCC 
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ACTTGACCCACACTCAAACGCCTTCTACCC&GGCAGGCACTGTGAAAGCA^ 

AAACTAAGGAGAGTGTGCAGCTGTCTGTCGTTCAGTTCTCTCCCATGGTGCCATGTGGGAATTTAGGCCTGGGCTTGCCATGTCTTCCAGTTTGTCA 
AGAAAAGCCAAAATCAGAATTTCTACATGAAATCTCTCAATTCTTTTTTTCTT^ 

TCCCTCCCTCCTTCCCTCCCTCCCTTCCTTCTTTCTTTTTTTCTTTCTTTCTTCCTTCCTTCCTTTTTTTCCTTTCTTATTCCTTTCTTCCTTTCTT 
TCCTCCTTTCTTTTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTCTCTTTCTCTCTTCTTTTCTTCTTTTCTTTCTTTCTTTTCTGA 
CAAGGTCTCACTCTGTGGCCCAGGCTGGAGTGCAGAGCTGCTGTCATAGCTCACTGCAGCCTCGACCTCCTGGGCTCAAGCAATCCTTCCACCTCAG 



TATTGCCCAGACTGATCTTGAACTCTTGGCCTCAAGCCATCCTCTCATTTCAGCCTCCCAAAATGTTGGGATTACAGGCATGAGCCACCACATCCAG 
CCCATGTGTCTAAATAAAATACTTAAAACATTATAAATCAAGCTTACAAGTTGTTAACAAAACACATTCTTTTCTTC 

CAT CAC AAC CTAAAAGGC C AAGTTTG AAC TCAGAATTCTT AGAC ACTTC AAAATTTTATATTAGCACAAGGTAG TGAGAAAGAAGC AGTAC CCGG AC 
TGCAGCTTGCCGCTTTCTTGCCCTCCCCTGGTTCTTCCTGTACCACAGCAGCCTC<^GTTCATGCCGGCGGACATGCCAGCCTGGAAGACCAAATTC 
CGTCCACATTCACCCCCACAGGACCCATCCCTTGGGTCCAGAGATGCACAra^ 

C TT CAGCTC CTGGAAGCCAGCCC AGGGCCAGTTACTAAGGGGATTCTGGTGTCCTTGGAATG TTGGGC ACAGT CTAAAGGGGGAC ACAGGGTT CTAG 
GTAGGTGTCTCCTTAGTTCTAAGGACTTTTTGCCCCAACAAGAGGAGCGTAGCAGGAGGAATGGC^ 

GCAGAGACGTCTTTTGTC<^GGTCTGAAGGTGGCACTGCAGAGACCTGCCCATTTGCAACTCTGATCTAGGTCAGCAATATCTCCTGGAAAAGCCCA 
GGCTACAGG AAGGC CCTGTG TAG CAAGAAATGGAC AC AAGAC AGCGTCGTC AAC ATC TGTCAC ATTGAAATGCGGGTTC CTGGAG AATGTTAGTTTC 
CCCACCTGCCATCCTCTGCATTTATCTCTTACTAGCAGTGTCATTCCCTGAGTGAGTGTCAGAGCAAATTACGAGAAAGCCAGATCTTCAGTCCTCA 
GGATGAGAAGTGTGGGTGGGGAGAGAAAGGGAGGCAAAGTCAACTCAAAGGTCACGAAGTTGCCGTCTGCTGGACCAAAACTTACCAGCAGTTTATT 
AATACCTCCAGCCGATGTTACAGTTGTGGCTTTACGGGGAAGGAGGTGAGACTACTGGATGCCCCACTTCTCTTGGGGACAGATAGGCTATCTCTAT 
TTCTCTTCCTTCTTCCCCACGGGTTCAGTGCTCCAAAGCTGCTTGGGAATTCCAAGGGGGAGAAAGGGCTAGTGAACTAAGATCAACACCAACGTAG 
TAAAGATTTAATACTCTTGTTTCTGCCAGAGGAGACTGTCATTTTAAGAGACACCTAGGAGCTCAAGGCTATAGTGAGCTATGATTGCATCACTGCA 
CTC C AACCT GGGCAACAAAGCGAGATGCAAAACCC TGTCT CTAAAACAAAAAAG AGAGATGC CTAGAG AAAGT AAC ACT TGAC CC AAAGG AAAGAGT 
|k| TTTTATGGTAGCATTTTAGGCTTTGTAGCAAGTGAGCAGAGGAGATATTTAATGGAATTCTTGCCATGGGAATGACATGTGACTGCATTGGCCTCAT 
Q GTAGATACCAGAGGAGTGGGAGATGGGGATGGGAGGGAAAGAGAGATGTAGCCCTTCTAAAACTCTTGTGGGTTTGTGTGTGTGTGTGTGTGTGTGT 

Q GGTGTTTCACCATATTGGTTAGGCTGGTCTTGAAAAATCTTGTTTTTTAACCACCTGATATGTGATTTAATTTAATCACTTATAGGAAGATAAAATA 
% AAA ACCTGATC CTG GTT TGAACAAT CTACTGGTAGGTAGAGATGAAGAGCCCCC TTTTAGAGGA CTT CAATAGGAAAATTTGCACAAAT ACCTTTGC 
^ TACCCAGGAGGTTTCTTTGGCTTCCTCTGGAAGATTCCTGTAGACAGATTGCTGAGTTTGTCATTACACCTAGAAACCAAGGCTAGGAGGGGTAGCT 

jM* gagactgataattgccctcccaaagctattcttccatttttccttactacggaactcttgttttgtctaAgatggcaatatgcctagagaaaagatt 

i a acatatcccagattcctttgatgtttgggatggccatatggcacagttctggccaaagagatatgcaagaatttgctaggtaagacttttatgggaa 

f~' agctcttcaaaaggtggggctgtctcagttggcacatgcctttctcctcctttctgtctggaaggcagatatggtgttagaggtgaagcaaccgtgt 

US acacaaaggaggaagagattctgcgtccctggtaatactgcagaagtgtgaagatgatagaagctccagc^ 

gaaagtggaaaagagtttatttggttacatcaccttgagttgggttttctaagtgcagcttaactcaattctgtctaatgcaggagccaatggaaca 
accaaataagg aatttaaacaattcccctagctc agtggttc ctaactgggggtgatcctccccattccccaagatatttgacaatgtctggagata 

i s * tttttagttgttacagcttgggagagaggtactagtaacatctagtgggaagaggccagaaatgttgttgctaagcatgctgcaatgcacagggcac 

q accccacaaacaataatccagcccaaaacatcagtagtacagaggtggagcacccctgcagctggccctagtgcttctagggcaccagcccatcagg 

^ ccaccaacctaccctgacacgcttgtccatccctctctcagaaacaaa ccctgtgtattgac caggctgaaaaagccacgtccttcaccgtatctgg 

iii cactttcctctctctaagcctttgctcatgcatcatttcctctgcctagaatctccctctcatcttttcaggtggggaccatgacrcttcatttttt 

|jj ttttttttttttttgagatggagtcttgctcttgtcacccaggctgcagtgccgtgacgcaatcttggctctcggctcactgcaacctacacttcac 

jjj aggttcaagtaattctcctgcctcagtctcctgagtagttgggactacaggtgtctgccaccacacccagctttgctttgtgttgggaacattcaaa 

u atcctctcttctagctatttgaaaacatacaataagttattgttaactgtagtcaccc^tcagtgctaagacactaggacttattcctctcatttag 

J»& CTGTAATTTGGTATTCTTTAACCATCCTCTCATTATTTCGCCCTCATCTCTGCCCTGCCTTTCCAGCCTCTAGTAACCACTATTCCACACTTTACTT 
CTATGAGTTCAACTTCTTTAGCTTCTGCATGTGAGTGAGAACATGTGGTGTTTATCTTTCTGTGTCTGGCTTATTTCACTTAACATAATGTCCTCCA 
GGCTCATCTATGTTGCTGCAAATGACAGGATTTCATTCTTTTTTATGGCTAAATAATATCGCATTGTGTATATGTACCACATTTTATCTGTTCATCT 
GTTGGTGGATGCTTGGGTTGATTCCATATAGTACTTTGCATTGTTGCTTTTTAATGTGCATAGAAGATAAGGAATACAGGAGTAAAAGAGTTTGGAG 
GCAAAAACTCAAACTGTGTTTTTCCTCTGCTCTCACACCACTCTGCAATCAACACAGATGACTTCTGTGACCAAATGTGGGGAGTTTTCCCCATACA 
CCGAGCAAGCAATCAGTTCTGCAGCAGACACCAGCTGAGTATCCTCCAATTCAGTTCTGACCCTGTCTGCCTGGAGATAGCATCAGCTCCTACAGGT 
TGAGAGCTCAATCCCACAAGACTCCAACCAACTTCAGATGCCAGTCGCAAGTCCAGGCATCCAGAACTTCTGGCCACTGGCTTCAAGTTAGGTTCCC 
ACGACCCTCTCTTTGGATTGGACAAATGTGCTAGAATGGCTCAAAGAACTCAGGGAAACACTTACATTTACTGGCTTATTATAAAGGATATTGCAAG 
GGATACGGATGAGATGCATAGGGTGAGGTACGGGGGAGGGGTGCAGAGCTTCCATTCCCTCCTTGGGTGTGTCATCCTCCAGGAATCTTCTTGTGTT 
CAGCTATTCAGAAACTCCCCAAACCATGTCCTCTTGAGCTTTTTATGGAGACTTCATTGGAGAGCCATGATTGACAATCATGTAGAAATGCAACTGG 
ACAAAAAGGGTCCAATTACATGATCTAGTATGAGTC^CATGATCTAATACTAACAGGCTAAGCAGAGAAACCCAGCAAGACCTGTCTGTGCAGATTC 
TTTTTGGCCTCTCTGTGTAGCATTCCTTCCTTCAGGGTATGCAGCAGGACTGCTTCTGAAGTGAGGGTCTTATGACCACAATCAGAAAAGGGGAGGA 
AGATTAGGGTCCTGCTTTGCACTAGGGAAAGAAGGGCAGAGAAAAGTCAGAGATAAATTTTGTTTTTTAAGGCCTGCCTCTGAGGACTGAAGTGCCC 
AACATTATTAAAAAAAAAAAAAAAGACTGTAACAAGGTGTTATATCACAGAGATAAATAAGACCTCTGTGTTATCCAAACCCAAGATTGGCTGTACT 
CTTCTTTTGCTTAAAAAACATATATATTTAAATTAATAAAGCTTGGCAAATATAGAGTATAGACACTGAAGTTTGGAGGCCTACAAGCATTCATGAA 
ATTT CATCT TTT CATTTT CTCAATAAAGC ATCTCTCTCAATC C ATTTGTGT TGCT ATAAGAAAAT ACCTGAGATTGGGCAATTTATACATAAAAAAG 
ATTTATTTGGCTCTATGGTTCTGCTGTCTGGAAAAAATCGGGCACCCGGTGAGGGCCTCAGGGTGCTTCCACTTGTGGTGGAGGTGAAGGGGAGCAT 
GTG TGTG CAGAGATCAC ACAGTG AG AGAGGAAGCAAGAAAGAGCAAGATGGAAGGGGCTGGGC TCT TTTTAACAATC AACTCTTGTGGGAACTAATA 
G AGTGAGAACT C ACTCAACCCCC ACCCC AGGGAG AACATTAATCT ATT CATGAGGGGTCTAC C CC ATG ATC AAAACT CCTCC C ATT AGCATCC C ACC 
TCTGAGTTTGGGATCAAATTTCAACATGAGGTTTGGTGGGGTAAACATCCAAGCTATAGCAGCATCC^TGTGTGGAAAGTTAAGAGCATCTTTTTTT 



TCAACCTCCTGAATAGCTGGGACTACCAGTGAGTGCCACCATGCCTGGCTAATTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTTCAAT 
AGAGATAGGGTTTCACCATGTTGCTCAGTCTTGTCTTGAACTCCTGGACTCAAGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTGCAGGT 
GTGAGTCACTGCACCCAGCCAAAAGCATCTTATCTTCTTTCTCTATCTTGTGGCATGTGAAGATACTGATACTAAACTTTGCCMTCATCTTTCAAC 
TTG CAGC CATACAC ATAGGT ACC CATTC CTAGGG ATC AGCTC TTC ATATCTGGGCCATGGGAGCTG TG AAATT AGATTGCGAGTCACATT AGC ACC A 
CGTCAATCCTCTCCACCCCAACCACCATTAGCACATCTTCTCTAAGTGAAGTCAGCATTATTCACATAAAATGTTATCCATTGACTACTGGTGCACA 
GTGTGATTGGGGTTTATGAGTTACTCG^GGTGATTATATGATGAAGACTCAATTCTTAGTGGGGGCGCTGGAGACCTTGCACCCTCTGTCCCCCAGC 
TCACTTCATATCACCCTCTTTTCCTCTGCACTGGCATCTGTCCCTTCCCCAAATCTCTGCGTGCTCCCTCCAGCCCAAGAGCCTTGCTCATTCTGCT 
CAGCCCTAACTGAAATGCTCTTACCCTTCTAGCTCATTCATTCCTACCTTTTGTCAAATACCAGCTTGGGGAAATTTTGCTTGATTCCCAGGCTAGA 
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TCAGAACTGCAATTTCAAATAGTTTCATAGCACCTC^TGCCCTTCCTTTATACCCATTAACACAGACTGAAAATTTACATTTATCTATGTGGTTATT 
TGGTTAACATTTGTCTCTGTGACTGGACCATCCATTCTATAATGTCAGGGATTGCCAATGTTTTGCTCACCATTGTATTCTCAGCTCCTTGCACAGT 
AGC AAAGAG CCTACCACAGAGTAGAAAT TGAATAAATATT CATTGGCTGAATGAATAAATGGATGGATGGATGGAAC AC AGTG CCTGC CTTCTAGG A 
ATGTAAAAT CC AAGC CAATT ATTATT CTGAC GATGGAATACAAATG AC AAAATAAAT AT TCAGTT AACATATAAAAAGT AAGAAT C AATC AGAT ATT 
TTAAGAGCAGG AATCAAGGGAAGG CATGTATTC TTGTG AAG ATG 

HUMAN SEQUENCE - mRNA 

C AAGAGAGAGAGAGAGGTGC AAGCCC CA AAG CGAGCGACATGT CC CTTTGGGGAGCAGT CCCTCTGCACCCCAGAGTGAGGAGGACGCAGGGG TCAG 
AGGTGG CTACAGGGC AGGCAGAGGAGGCACCTGTAGGGGGTGGTGGGCTGGTGGC CCAG GAG AAGT CAGGAAGGGAGCC C AGC TGGTGAC AAG AGAG 
CCCAGAGGTGCCTGGGGCTGAGTGTGAGAGCCCGGAAGATTTCAGCCATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCTCCTCC 
CCCTGGCAAACAGGACTCATCTGATGATGTGAGAAGAGTTCAGAGGAGGGAGAAAAATCGTATTGCCGCCCAGAAGAGCCGACAGAGGCAGACACAG 
AAGGCCGACACCCTGCACCTGGAGAGCGAAGACCTGGAGAAACAGAACGCGGCTCTACGCAAGGAGATCAAGCAGCTCACAGAGGAACTGAAGTACT 
TCACGTCGGTG CTGAACAGCCACGAGCCCCTGTGCTCGGTGCTGGCCGCCAGCACGCCCTCGCCCCCCGAGGTGGTGTACAGCGCCCACGCATTCCA 
CCAACCTCATGTCAGCTCCCCGCGCTTCCAGCCCTGAGCTTCCGATGCGGGGAGAGC^ 

CCC ATC C CG CAG AGGCCCCT GTCCAC CTGGAG AC CCGGAG ACAGAGGCCTGGACAAGGAGTGAACA CGGGAACTGT CACGACTGG AAGGGCGTGAG G 
CCTCCCAGCAGTGCCGCAGCGTTTCGAGGGGCGTGTGCTGGACCCCACCACTGTGGGTTGCAGGCCCAATGCAGAAGAGTATTAAGAAAGATGCTCA 
AGTCCCATGGCACAGAGCAAGGCGGCAGGGAACGGTTATTTTTCTAAATAAA 

HUMAN SEQUENCE - CODING 

ATGCCTCACAGCTCCGACAGCAGTGACTCCAGCTTCAGCCGCTCTCCTCCCCCTGGCAAACAGGACTCATCTGATGATGTGAGAAGAGTTCAGAGGA 
GGG AGAAAAAT CGTATTGCCGCCCAGAAG AG CCGACAGAGGCAGACACAGAAGG CCG ACACCC TG C AC CTGGAGAG CGAAGACCT GGAGAAACAGAA 
CGCGGCTCTACGCAAGGAGATCAAGCAGCTCACAGAGGAACTGAAGTACTTCACGTCGGTGCTGAACAGCCACGAGCCCCTGTGCTCGGTGCTGGCC 
GCCAGCACGCCCTCGCCCCCCGAGGTGGTGTACAGCGCCCACGCATTCCACCAACCTCATGTCAGCTCCCCGCGCTTCCAGCCCTGA 
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TABLE IX 



MOUSE NOMENCLATURE 
ICSGNM Irf4 
Celera mCG4922 

HUMAN NOMENCLATURE 
HGNC IRF4 
Celera hCG20902 

MOUSE SEQUENCE - GENOMIC 

ATGCTCATCATCCTTGAGGGCACACCTCCCTTGCTTTTGCCCTCTGAATCACAGCCTTGTCCTCCTTCAGCATGTCCTATCTTTCCATATTTTACAC 
CTCTATCCAGGCTGGTTTTTGCAAATTTGCCTGCCTAAAATTCCAGTCGAGCTGTATCACTTGTAAGGATAAACTACTAATATATATGCCTTGCCTG 
TTCTCTTTCACTCATGAATGAACAGTCCTCACACTCCTATCCCCCTAATATCTAACAGAGATGCAGCTCCCAAACTCTACAGCCTGAATCCCAGAGC 
CTG AGCT AGCC C ACG AATGC ATC CA ATC CCTGTGGTG CAGAAGAAC CC ATGCTCTGT CC AAATCAATC ACCTAAACGGTTTAGAATTTCTCTG TGGT 
CTTCTCTTTCCAGTGCCTAGGTCTCCACCCCTACCTCTGACTGTTTTTTCTTCTACTAGCATCCCCTGT(^TACTCTTAAGATTTCTCAAC(^TTCT 
AGTCGTAACCTAGCCTGTTCTTTTCCTTCATGTCTTATATTGAAATTCTGTGACTCTGAGGTTC^GAACAGAAGTCTTCTTGCTGGGGTTGCCTATG 
GTTCTCTCAATCATTTCCACTCCTTCCCTACTTGGTACTTTCCCAGAGAATGATACCTTTTGGGCAGTGAGTGATATATACCCAGTACTAAGCTGAG 
TCACTGTCAGCATACTCTTTTAGGCAACCCTGGGTCTTGGCCATGCCTCATTGCCTTTTCCTGTGCCTCACAGGAGATAGAATAAATCACAGGACTT 
TGGAGAACGTCACAGTGGTCCATGCAGTTGACCCTTGTAGCTTGAGCTTTGGGTCTCAGAGTGCCAGAGCAAGGGAAGATAGAGCACAAAGATGTGG 
AGATGTGGAGT CAC AGC AAC AGTGT CACCAATTCTAG AACTATAG TTG CAAGCACT ACC T CTGCT GAAGGC AGTTGAATACGAG ACAACAT AT CATT 
GGCCTTTGGAGCAATGCTCAACACCACCCACGGCAGGACACTTTGCAGAGCAGGGACATGGAGA^ 

GCCTCGTAGCACAAAAGAGAAAGGAAGAGTGTATGAAGGGCAGGCTGGATCTTCCAACACAAGAGTAGGGGC^CCAGCTTACTTGTCTTTTAAAGGC 
;: , CTGCCAGCACAGTGAGTCCACATGCCCTGGAAAGTCATGTTTCTTCTTAGGT^ 

^ AAAGACAGTAGATGGTTCTCCTTGTAATTTTTAAGTTTCATCTGATGTTGAATTACCCCCCACCCCCCAGGATGCCTGAGTTCTACTCAATAAAGGA 
G GGGGCTAATTTAAAACATATTAAAAGTCAACAC!ATTAAGTTTCCAACGTT^ 

f*% ATCCATTGTTTTGCATCTGAACACACTATC^GGGACTAC^GTGCACCATGACTGTTCTAGGTCTAGCAGAGTGTATCTGGTGAAGAAGTCTGCTAAG 
ZZ AATACTCGCTGCTGGCCCTCAGGGGCTTTTCTTCTGTGTGGACTGGTTTCTTTTATGCTCAGTGGATTATATCAAATAAGAACAATATATTCTGTAA 
O TCACAAGTAGTCAAGCCTGTGTACTTATGCAGAGGAAACTGCCTTTGGATCATGAGTTTGTAAAGAGTTGAGGGAGAGGCACAGTGTTTAGATCAGA 
Jf AGGTGATCTCCAGATGACGTTTTACATAGCCCTGGTTAGTCTGAGCTGAGGAGGGGCCAGGTTTCCTTCAGTCGTGCTGAAGACTTCAAAGGCTGGG 
T s f CAAAGAGAC CTTTATAAGAAGATGCTTCTCAAGGCAC ATGAGAGAAAT AGTACAAGGAGACCCGGAAGGTAGAG AAAGAGGAAAGGAGAACAGACTG 
^ GCTCACATGCTCAGAAGTGAGGACGACTCCATCTCACAGGGGAGTGCAGGGCATTTCTGAACAAGTTTGTGaVATTCATACTCCTTGTGGTGGCTAC 
1=4; CTGATATGAAACAACAACACATTTTATATAAAATAATCTAAATCACATGTTCCTTTAGAGATTGCCAAAATGTTTTGTATACTGTCGAAAACTAAAA 
s,« ATTGAATGTCCACAACATGACCTATGTTCATGCCTACAACTCCATTAGTACTTTCACTCAGTCTTTAACGTTTTGTATACTGTAAAAACTGTTCCTA 
$4J AAATTTGAATGCCCCCAACATGAGCTCTGCCCGTGCCCATAGCACCATTAACACTGCCACTCTGTCTTTCCAGAATTCCAAAGCTTGAGACAGAATG 
% CCTGAGGAACATCAAAATGCAAAGCTTGATGAGTACCTGGCCTTCCTCTGTAGTAGCTTTAAGGGAGACAAAACAAGCACCCTTTGATGGACTCCAT 
AATTGTTTTACTTACTCTTGGACCATGGAAACAAACTACAGATGGGGCCATGGGAACAAACCACAGATGGGGCCTGAAACCTCCTCTGAGAAGGGAC 
" ~" AAGTGGAAAAGGAGACACCAGATGCTCGAATATCACATTAGTTTCTCTGTGAAAGAAGGTGGTCAATGTGGGTTGCAGGTAGGGCAAGGGCATTGGT 
AATTGTGAACCATGAGGTATAAAACCAAGGGGACATAATACTTCTAAGCCTGACTTCGTCTACTTGGCTGGGCTTCTCAAGGACCAAGATGAGTCTG 
p| AGGAGAGGAACAAATGGAG^TAGACAAGGGGATCCCATCCTGGGGGGTCAACCAAGGGAGGTTACATCAGCAGGCCAAAGCCTAGAAACACAGGACA 
|y GTCTCTAAGTGTGGGGTCAGGCTGTCCTCAATCCCAACTGGTTTCATTGCCATCTGGGCAAGTCCAGGTGACCTCCCTGATGCCACCATATCCTTTT 
Ijj CTGAAATAAGACC^TCGTCCTGCCTCACTTCCCTACCTGTTCATAATAAGAAGTGATTGAGAAAACTGTAGTCTGGTACAGTACAACTTCTCTTTGG 
pi AGTAGCAGACATGAAGCCCATTGTCTGCAGGCTGCCACTGAGCTGAGGGCTGGGCTAACCTGGGATTTTCCTCTACCGTGCCCAGCCTGGAGGAGTT 
*H CAGTACTTCTAACAGAGTTCTACTCATAGGTGGTTTCTCATATGGTAGTTCCTTTGCCCTCTCTGACTAGCCTCATGCCTGTGTCTACTGTGGAGAG 
;N» TATCTGCCAAAATCTCAGGGAGGTGCAATGATCTCACCCAAGTCATATGTCATTTGCTGACACTCACTGGCTTATTGCATTATGGATAA 

GAGACACGCATGCATCAAAACATTGTACAGCTCCCC^GTAGTACACACAATTGTTATGTGTGGGGCTTTTTTAATCACCTAAAAGATAAGTTTAAAA 

TGAAAA ACG TTG AT C CC CTAATGTGGGA AAGAGCT TGTGT CTATTTGAATGA CATG AG AG CTAAGACTG CTT TGG AT CAGAAACCA CAT CC TTT GCC 

TCTCTCTCTGCCTCAGCAGAACCAAAGAAGCTCAAGAGACCCGTTTCTGTGTGCTCACGGTCATCAGATACAAGCAACGAAAGTTCTAATGCAACTG 

TGAAAGTTTCTCTTTTTGTCCCTTGTTGTAGGATAATCTGCATTAAC!AAACTGTCAAGATGTTGAAAAGGAGATGTGGGGGAGAAGGGGAGTGAGA 

AGGACAGGGAGGGGAAGCAAGCAGGTGAACTCTACACAGAAAATAAGAAAGGTAGAAATTCCCTGCACTGCACTGCCCTGTGTGGATGCACTGTCCT 

GCATGGATGCAGGAGAAACAGCCCAGGTGGCCCTGGGGATGAAGGGCTCTTCCTTTGCCTGCAGAGGGAACTGCCATTCCCAGGGGCCCTCAGCTCT 

TTCTATGATGAAAAGGATAGTAAGGCTTTCCTTCTGCAAGTCACCAGTGCTCAGATTATCTGCACCTCTCCCATCCCATCCTCTGGCTGTCCAGAGA 

G CA ACAGGC AG GGTCAA GGGACTAGATGGCCTGG CTTAGGCTGTG CTGTGCATAGTAGAGGC CTCTACAAAAAGGAC TAACAGTAC AG ATGTCTCCA 

AATT AC CTT AGTTAGAC TTT ATATC TTT AAAAAAAACT ATAATGGC ATG AAAATG AT AC ACATCC ATT AAAAT TAT C CTT CAAATTCTGAAGACAGG 

CAGTTATTGTT C AT ACAATGCAG AGGGC CATGGGGAC C CACTTGGAG ACCCTTAGTTC TTG TTTGGCAG TGTTGA TGGGCAACT CT AAAGGTAAGT A 

TGGAATCTTTTGAACAAGGGCTTGATGGACAGCAATTCAGGTTCATCATGGCAAAAAGCTTGTTTTGTGATGTCCTTGGTTGAGCAAAGGGAGAGGC 

AATC TACTAGC CCA CTTTTC ATGGTCTC CTT AAATGC CTG AAACAAGAGAGAGAGAG AGAGAG AGAGAGAGAG AGAG AG AGAGAG AGAGAG AGAGAG 

AGGGAGAGAGGGAGAGAAAGAGGGAGGAGAGAGGGGGGAGAGAGAGAGAAAACACAAATATTTATGTTCTATGAAGCTACTTTGTCTAAGATCTATA 

TGTATATCTGGGTGCTGTGATTTGAATATCATATAAATGTATATATGATATTCAAATCATATATACATATATATTTATATATGTATATACGTATATG 

CATGTATGCATATATGTGTGTATATATGTGTATATGTATGTATGTGTATATGTGTACATGTGTACAGTGTGCATGTGTGCATGTGTGCATGTGTGCA 

TGTG TG CATGTGTA C ATGTGTAT ATATGTATATATGT ATATGGGCTGGG AGCAT GATGCTTACAACTACTG AAGAGGTTGTT TTG CTG AAT CATAGC 

ATGACACAGGACACTGTGTGGTGATACAAAACAAGCAGCTAGCTAAAGCCTTTCTTTTTGATAATGCCTCTTCTACTAAGAAGCTTGCTTAGGACTC 

CAGCCAATTCTAAAGACCCCACCCCTACCAAATGCATTTTCACT^^ 

GTAT CTGAT AAACATTT GATC CT CAG AC AAAAGAGTT TCACATATT AC TTGGAG C AT AC TTAAAC TGT ACAATT AT ACCCATTTT ATC AG AAAAAAA 

AATCCAAATATAATTCTGAATTACACGCTTTTATGTACCACAACTGGCAAACCTGCTCCAGGCATAGAGGTAGGCAGGATGCCTA 

CTGGAAGGTGTAACAGAAGTGGATCAAAGCAGAACAAGGCACAGACTTTGGAGGACACTATCTCATGATAGCCAGATTTGTTTCGAGAGAGTGATAG 

CTCAAGCCTTTTGGATATAAGACACTCAACCCATCATGTGAGCCTCCAACTCTGAGTCACACTTCCTCTGTAAGAGTCTGGGAAGTAGATTCCCCAC 

GGTACCATTTCCTAATTTCTGCACCTGGATCTTCCC^CACCCATCATTTGGCCTCAAGGCCTCTTCCTTACCGTTCTGTTTCCACCTGCCTGCTCTG 

TGTGAGAACGTGTCTCTCTCTGTGGACTCCTCCCAGGACTCCTGGTC^TTCCTCTCTGTCTGACACGGCCAACCACAGCCAGATAATTCCTCTTTGG 

CTGGGGCTTTCATCTCCTAACAGTGGATGTGTGTCCTGGCGCTGTGAATAACTGATAATTGCCTGCACAATAAGCCAGTGTATGTTTCTTCCTGAAT 

CCTTTCTCTTTGACTGTGATCAAGCATGTGCCAGACTAACCGCTCCTGAGTAGCCCATAGGAACAAGAAGGCACTCACACCCCGAGCCTCAGGTTGA 

CCATAAGCAATGCAACTTTCCCTGAGACATAATCTCTACCGCCCCGTGTGCCCCAGCATCACTCCTGTGTTTTACAGTGTTCTGCTGAAAAGAAACA 

TAGCAGATACAGTTTAGAATAATCCAGAGCCAAGCCACGTTGCCAATGGGTGAAATGAGACAGTGCCATTTGCCTTTCACCTCTTGCTTCAGTTTTC 
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GAAAAATTGTCCCCAGTGATTACAGAGTATTTGCATTTTTGTTGGCTTGGGAACGGGAATTGGCAGCATC^GTGTTTCCTTGCTTAGAGCCACGTC 
ACTATTTACGTAGCAAACATTACAAGTCAGAGAACGGAGCATTACAGGGAAACTTCTTAGAATGCTCACAATCAAAGGAGACAAAAAGGGAACAGTT 
GGTTGTTTATAAACATGACGAAGGAAGAAAGGGACAATGTTGCTCTAGAGTCACAGACTTTGTGACTCCTTATTACATGATTGATGAAAATGACAAG 

ATGATCTGGGGATGTTTGATACTGGAATTGTCTGAAGTCACAAAACACTTAGAGGATTTTTGCAATTTCAGTTAAGGTATATTCGGGACCTTATATA 
AAATAAATAGTTGTATTGTTGCCCACAAATACATCACTTTACTAATAATGTTCAGTTTAGCTCTTGGGAT^ 

TGAGAAT TTGAACT C AGGTCTTTAT GCT TGC AGAGTAAGC ACTCTTACCAACTAAGCCATCTCCCCATACCCC TGA CTTTAAC CC AAAGTGGAGCAA 
ATAGATGCATGCATGGGTAAGACAGGTTCACAAAAGCTTACTTCTTGGGCAAGTCACAGTGATGGCTAGTGGCCTCTCTGGATCTCTCTTCTTATGT 
G CACAA TTGGACTAAAATGC GTTGC CCTG AAC ACCAT ACAGGTAC ATT TGAAGGGTGCATAT CATT ATGTCTGTGA C AGTTCCCAAGGGCTAATGT A 
GGATATACGTAATCATAGTCTAAGTGGGTGAGCACTTGCGATGCACATAATTTTGTTGTCAGAGCACTGGAGAAAGCAGGGTCACCGCCACTCTTGA 
ACGTGGGAAAGTGAACTCACCACCCACGCCTTCACCCCTGTGAACTCTA^ 

GTGGGCGTTAGTCCATGCACCTGGAATCAAAGC^TCAGAGGTTTTGTCTTCGTTTTAAATCAGAGGCAATGCATTAAAACAACAAAGT 
TCCTGTTTTATTTTTTAATGTTTTATAAAATAATTTTTCCCAA^ 

T CATTTTAAAAG AGTGAGGT CATCAG AAAATACATTT ATC C AAAACAAGGT CTC CGTCTCTTGG AAGAGAGAGT ACTGGG AATTGAACCCAG TG CCT 
TCCACACACAAGACAAGCATTCTACTGCCACGGTACACCCTAGGCTGAATAATGTCTTTCTGAGTATCTAACACTGACTGGCTCTATAAACCACTTG 
TTCTTACTTGTGAATGTATAATTTTAAAAGGTCAATTCTCAAAATACTGAGCTGCAGTCTAGCTTTTCTTACCGTCTTTTTATGGGAA 
AAAAGTTTGCCCCATTCAGACCTCGATCACTGCTCAGTGGCTCCTTAGCCCAATAACATTAGTAATCTCCTCGGGGAGTATGGAGCTCTGCCTTTGG 
GAC GGATGCTC AATTGACTAATTTC CTGTGACATTTT TCAAAC AACAG TCCGCCATCTC ACTGTGACATGT ATCGATTTTCTTAAGTG TCAGTGCGA 
AAGAGG ATG GGAAATTGCCT ACTGG C AAAATTGT C CCCATTCTTTCAAGTCTG ACTGTG AT TTAAGAAGT TCT TAG TGGTCAGCTAGAAGGGCCCAC 
CTGGCCATTCCTTCCTCTCCACCAGCAACAATGGGAGCATGTGATTCACAAGGGAATCACATTCAACTAAAAAGAGAAACCGGGGTATGCTGTTTGC 
AAGGAACGGTTGAAACTGGAACTCAATATGTCGTGTGGTGTGAAATAAATGTGTGTCT^ 
M AAAGGCAGACTGGTTAAAGACATGGGGTGGGGAGGGCTGGAGGGACGAGTGGTAAGAAATGGCGACAGAGGAGATGAAGGTAATGTCATAATGAAAC 
"f*% CCATC^CTGCTGTGTGCAACTAATAGATGCTAATAAAATAGGAAGTTTTAATGATTTAGGTAGCTTATTGCTTGCATTCACCTCACTGTTAAACTAT 
CACTTCTGGGGGATCCACACAACGAGCGAGCGAGTAAACCAGAAGATGGCGTTGAAGATTAGTAATCATATCTTTTAAACAAGATAACCATGTGAAG 
D TCTCAAAAGGTTTCTTGTAATGACTGTTGTTTAAACTTCTGAAAACAGAGGATGTAGATTGGCTGAGGAAAATGTTGAAACCGCCTAAGTCAAGGTA 
f^i GAAGACACGTGTGTCTAAGTGAAAAAAAGAAAAAAGAAAAAAAAAAAAAACCAAAAACCTCGGGTTGGCTGCTTCTGTCCTTAGTCTGTGCACGCT 
'"'J! TGAAGAAATGTAATTCCTCAGCAGCAAGGCTGTGCTATCTGAAGCTACAATCTCTGCTTTGCTCCGAGGTGTGTCTCTGGTGACCGGGATAGTTCCC 
4~ G AC AGA C AG AAGGTGTTCAAAGAATATTT TTG AATGAATGAAA CC C CAAAG GAAGAAGAGGGGAAAATGGGTGTGAC CAAAATTTT CTTTG AACGAA 
i-jL ACTCTGTTGTTTACTACCAGGGCTCTGACAATGGAAAACTAATTGGGGTGAAAGAACGACATGGCATCCTGTTAATTTCTGAGAAAGCCTGTTGATG 
TTAGGAAAAAAAAACATGCCGGTGGGCATCTCTGCACCAGTTTTCCTGTGGCCAAAATCAGATGTTTCTCCTAAAGTCCAGAACCCAGGATGGAAGA 
TTAAAAGAAAAACTGAGAAACATGTGAAATGAAAAAGTTGTCAAATCTTTA 
hj AGA AACTGGTAGTCTGCTTGCTTACCTGAAAACACCAAGATAA CATATAAG CTC C AGG CAT C CAAGCTG AGC TGGAGAAAGTC AGCGGC AAAAGCTC 
ATGGAGTTTACATATGAAGGTCAAAGAAAACACGAAAATAAAGTAAAACCTTCAGTCAGCCTAGCTGTTCTATTTGGGGCATTGGTACCTCACCGGC 
AACTGCCTCCCACGAGGCTGAGGTTAAAATTATCATTTTAAGGTGAATTGACATCCGGAAGCGCGCTAACTACCTGAGTACTCAGGGATCCCCCATC 
hh TCTTTTATGTTGCCATGATTGAAACTTTGGGGACTGTGCTTGTCTGAGTCATCTCAATTCGTCGGTTTCATTCACCCAACATGTATAAGCGTTTCAA 
ACA C AG T ATTT GGG CCACGGCTTATAAACTTGCC TTTCTATTTTT CTT TTTAGTGAGCGTGAT ATTCTCTAAAC GCT CAGAG AGAC AAGACTCCGCT 
fe? TTGCCCAAGGATGCTCCCGACCTCTCTCAGTCTATCTCTTCTGTCACATGCTGTGAGAACCAAGTTCCCTGTGCTCCAGACTCTCCATCACTTCCCA 
fO ACCTGGCCATGAGCAGTTAGTAGTTATCAGCTATGCTCAGTGCAGATTCCAGTATCCCCTTTGCATGCCTCCACCTTCACAGGAACGGGCCATACCG 
ACTTGTCCCATCCGGTTGAGGATTTCTGAGTACATCAGAGTCCCCAGCCCCCTCCACAGGAGGAGCTGAAGAAAGCCAGGGTTTGTCTGAAGTGGGA 
CAGCCCTTGACC03GTGGGCTCTAGTCCGAAGCTCCTGTTCCTGCGGGACACCCAGGCACAAGGCAGAGGTGGGGGGCGGTCCTGGGTATGGGCAAG 
Q CCACGCCCTCTCAAGGCGGGGCCGAAGCGCCCGCCCTGCACTCCGCCTCCGGCTCTATAAAGTTCCTCTTTCTCACCTCACTTTCCTAGTTTCACCA 
yr CTTGAACTTGGGACCCTTTGCTGCCCTCAGCTAAGAGTGCGGGTGAGGTAAGGCCTGTAGTCGGGCAGAAGGAGGAGTGTGAGGCTGGTGGCAGAGG 
AAG CCTGGCTT CCAT CTC TGAGCC TG AGGG AG AAT GC TGAGAT AGCGGACCCAGGCTCCGCTCATCTACGCTGCCCTAGGACCTGTGCACTT CGGGT 
TTTGTATGAAGCTGTTTGGGTGGGAGTTCCAGAACATCCCCCACGGGCTGGGCGGGACGAGCTAATGGGACTGTGGTGTCATCAAAGGATCGCACTG 
GCCACAGCTTGTCCTCAGAGGGAC^GCCTCTGACTCTCTCTGCTCCAGTGGAAAGCTCCTTTCCAGCCCTGGTTCCTAAAGGACCCAAACTCATCTA 
GGGCTCCAGAGCGTGATTCCTAGGCCGGGCAGCCAAGAAGAGCTGAGAGCTCCAAACTTAGGGTGCTCAGAGCCCCTTTCCCCGCATGCCCCTTCTT 
CACTTCTCTGGCAAGAGTGCTAGTGTTGCTGTCCGCAGCACCCCTTATTCCCAGCCTCGGCrTCATTCCTGCCAGGGTTCACGCTGACATTCTGCAG 
GTTGGAATCTCCTGTTTCTTGGCTGCGCTGCTTGCCCCATAACCAGACTTCC^CTTGTTGCTTCCAGGACCCACGTGATGGTCTCTGGTTGGGTAGG 
CCTGGGGTTATTCCGAGGACAAAGTAAGGGTGTCATAGAAGAAAGTCAAGAGAGTAAGCTAGGTACCGCAAACCTGCATGGCAGGGACACAGGACCT 
GGACAAGGGCTAGTCCATGTGCCAGGTCCTTTTCGCCTGGGGCAGCCAGGGCAACCTAAACCCAGGAAGGGGCAAGTGTAGAAACAGTGAGGGAAAA 
GTGGGATG AAAGCTACTTGGAT CC AGC AC AGAGGG ACG AGTG AGCAAAGTGAGCG CCCCAG CG TGGCGCAAGACTTGGGAT CTGCAGAGAAGCTGTG 
TAGCTAGGAGCTTTCAACGGAGCGTGTTAATGTAAATGTAAATGAAGAAATTACCTAATTTTTTTAATAAAAGAAAGAACAGACAGGCAAAAAAA 
AGGAGGAGGAGGAGGAGGAGGAGGAGGATGGTGCGCGCCAAGGGATGCTCTCTATACCTTCGTCAAAGTACCTTCTCTTGGGGGACTTCGGAAGACT 
CTGGCACTGCACCCGAGCACCTTGGCAGCCTCAGAGACTCGGGGCCTCGTGGGCACTGCAAGAGTTTGGGACGGGGCTTCCTCCCGCCTCCAAAGTG 
ATGCGAAGGTAGTTGCAGGGAATGTGTGTCTCTCCTCAGCGCACAAGCCCAGGAGGAGGTCCGCACGCGTCATGAACTTGGAGACGGGCAGCCGGGG 
CTCAGAGTTCGGCATGAGCGCAGTGAGCTGCGGCAATGGGAAACTCCGACAGTGGTTGATCGACCAGATCGACAGCGGCAAGTACCCCGGGCTGGTG 
TGGGAGAACGAGGAGAAGAGCGTCTTCCGCATCCCGTGGAAACACGCGGGCAAGCAGGACTACAATCGTGAGGAGGACGCTGCCCTCTTCAAGGTTA 
GCAGCATTCAGGGATCCCTGGGCAGGGGTGGGGGTGGGGATGGGGAATCTGAAAGCTCTGAAT^^ 
GCTCCTGCAAGGAGGAGGCCAGAGCATCAAGCATTGGACCCTGCTTAGGCAAAGTCCCCAGGAGAA^ 

CAT ACA CAAGAAACCAGG TCCCAATACTGTTTGTGTGGAGGAAAGAACTTC CAGCTTCAGGGG CAT CT CTGGGGGACCG AGGTTC CGTTTG CATAG C 

CCATTCGCTGTTTCCTGCCACCACCACCGACTGCTAGGGCCACTCTCTGCTTCCCTGTCTCTCTGTGTTTTGTTATTTTTCTGAGTTTCTCTCTCTG 

GGTTTTGTTTCTTTGATTGGGCACCTCTACTGTCTGGTTCTAGTTCTAGAAGCTGCGATCTCTGATTTTCTTTCTTTGAGTAGCTTTGACTATTCCG 

AGTCTTTCTCTGGTATCCCCCTCCGACCCCGTGTGAGTCCCTTAGGACTGATGTCCCCAGAGAACTGGCTCACTGAACTGTGAAGCCCCCAGCCTCC 

ACCTGCCAGCAGGCCGAGGAAGGGGACTTCCTGCGGGAATTTGTTCAAAGTACCTCTGTGATTTTGTAGATGTCCTCTCTGGGGCCTGCCCCCTCCA 

CAGCTCTGTCCCCAGTCTTGCCCACACTTGATTCAGGCGCTGGGCGTGTACAGCCCATACTAGGGGTCTCAGGACCCCACTAACATCATGTTCCACA 

TTTCAGGCAACAGCAAATTTGAAACAGTAACCTTCCTTGCTGAAATGCAATCCATAGAATTCTTTTGACGCTCTGGGCTTGACTTTTCTTCTCATCG 

TTCTTAGGCTTGGGCATTGTTTAAAGGCAAGTTCCGAGAAGGGATCGACAAGCCAGATCCTCCTACTTGGAAGACAAGATTACGATGTGCT 

AAGAGCAATGACTTTGAGGAATTGGTCGAGAGGAGCCAGCTGGATATCTCTGACCCATACAAGGTGTACAGGATTGTTCCAGAGGGAGCCAAAAAAG 

GTAAGGGGTTTTCCCAGCCCAGGTGGCAGGATAAAGGCATTATGGCACTCAGAGAGCCCTTCTTCCTAGAGACAGTCACGTCCTACCTCTGCTGTAG 

GTTAAGCCCAGATGTCCTTTTGCCCATGTCCTCTCTGTTATAAGTGACAACCCTGTGGTGTTAGTATAGGATGACCTGGCAGACTTTAAGCCCCATG 

GGTGTGTGGGTTATGCACTrGAAGGCATTATTTTCAGTTACTCCATTCAATTAGGATCTGGA 
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ATGTTTACTTACCTAATATCCTCTAGTAAGCATTTTCAAGAGGAGAAAGCACATCCCACACCCCATACATATTCACACTTCTTGTAATAAAACTGCT 
AGAGTTTCTGGTTTAAC ATGG CCTG CTAGGGTGGTTATGAATATT C AGATCTTGAGTTCCCT CTCTTCCAACTAGTCTACCTCAAGCAGTG CT CAGG 
AATC TG CAT TTGGTTCCAACCATACAGGATGCCTTAACTAGGTAC CATCTCACAACC AG AAACCAC TTGGTGG ATC AC AGGGATCCTGGGT GGTGTT 
TCCTTCCCTGGCTGTCACTCACAAGTCAGCAAATGTTT 

C AAACAAAACAAAACAAACC TAAAATAC C CG ATTGGTTAATAGGGCTATGC ATTCTAAG AATTAAGTGCAT AGGTA CTTTTATAAG ATTTAAGTCAG 
TTCCTTGTCTTACTCTGTGTTCTCTCTTCCTTTTCCCCAAACACA 
CTACCCCATGACAGCACCTTATGGCTCTCTGCCAGCCCAGGTATGTGGTAGACTCTT 
CACACAGAGAGGCAAACACAAATGAAAAGTGTAGGGCTGACTTCTTATTTGCTATGGCTAGTACACAC 

GTT T CAGTTC CAGTGTGGTGTCACTGTCCCTGACGCCACAG TTTTGTTGGGGAGT TTGATGTGT CCCAC CTGTGG AGAGAGGCTTC C ACTGATGGTC 
AGATCTTCTGGGAATCAGACCTTTTGTGGAAGTCAAAGGTTTTGGAAGTAGTACTTTATCATGTGAAACCG<^GAGCAGCTGACTTCTCTAGGCGTC 
C CTG ATGTG AAT TACAGTACTGTTTTATTCAC TTTGGTGG CTT AAAAAGGG CAG ATT TC ACTGCGGTATTCTTGGTGCCGTGTTCAGCC ATATGATG 
AAGCCTTACAAAAATCACAGCTTTATACAATGTCCTCATTGTGCTTTCAGACCCTCTATGGCTGTTTTTTACCTAGTGTGATAGACAGTCCATGTCA 
CTTTTTGGGCAAAATGACTTGGCTGCTGGACAAAAAAAGGGGTTCCCTGAGGAGTTTG^ 

TCTTAGCAATCCCTGTGCTCTGTCAGCAGGTTCATAACTACATGATGCCACCCCATGACAGGAGCTGGAGGGATTATGCCCCTGACCAGTCACACCC 
AGAAATCCCATATCAATGTCCTGTGACGTTTGGCCCACGAGGCCACCACTGGCAAGGCCCATCTTGTGAAAATGGTAAGGATTGTGCCAGGGCAGCA 
GACAGAAGAACAACCTGAGCTCGGGGTGTGGACAGCACCACAGGGCTTTTCCCTACCATTGAGATACCAGAGACACATCATATGAAGCTGCTACTGT 
TGTTGTTGTTGTTGTTGCTGCTGCTGCTGCTGGGGTGGTGGGGTGGTGGGGTGGTGGGGTGGTGGAGTGGTGGTGGTGGTGGTGGTTGTGGGGTGTT 
GGGGTAT GTTGCCT TGTCCTGTGAAATGTTG AAG TCCTTAGATCC ATG ATAGGCCTC AG TCTGTGTGGGGA CTTAA C TAGAAGACCCC AG AGATCAT 
TCCAAGTAGCTGAAAAGTGCCCCATTTTTAATACATAGAGAAAAACATGGATGACAACAAATTCTCAATGACAGTAATGTCAATTATAAAACTCGTC 
TATATTTTGTTTTAACTGAGTTATCCCTTATTTCCGATGGTGATTAAGTTGGGGGGTTTGTTGTATCCCACCTATCTCCCTAGTCTGTATCTTTCTA 
CTCTCCTGTAAAGTAGAGAGTTGTACCCAGTCCACCTCAGCAGGAAATCATTGCTAGTTCATGTCTCTTGAATAATAATGAGTCATCTATAGCTGTT 
CTTGGT ACT AAGGAAGGAAGGATCAGAGCGAAAGTAAT CCACAAAGTGT CT CTAC AAATG AGTGCC CTGCC CGAAAGACC C ACAGGGGTCC CCCC AT 
GCTAGCTGGGCTCTCACAGAAGAAACGCCCACTAACCAGACACAAAAAAATTTCACAAACTATGTTCAGTGAGACTTGGGTCCTTTAGTGTTTATTT 
AGGTGAGTGCACCAAGCTCCACCTCGGGTCCTTTTTTGGCTGTGTATTTTAAGGTAGAGTCTTGCTAAATTACCAAGGCTAGGATCTTCCTGCCTTC 
AACTCTTGAGT AGCTGGGACTACAATCTTGTT CTAACGGG CTGAACAT AAAACAAGTTT TTAGGACTT ACAAG TTCACTGTT TAAATATAAGT CTTG 
ACATGGGTCGCCGTGCGAGTAGTTCTTTTATATTGTTCTGGCAATACTTTCCTTGTGACAATTTCATCAACACCCTCACTCAGTCTGTGCATGCTTA 
CACTAATCTTGCTTTAGTGTGACATAACTTCTCTGCTGCCAGAGAACACGGTTCAGCCCCTCCCCCTAGCTAACAAACAGTGAGCAGAATAAATGAG 
GGTTGAATAATTAATTCATCTTTGAACTAGTCTTATAGAAGTTTGAACTCTGACCCTGCTGGTAACTTGCTATGTGGGCTGGTGCAAGTCCCTCTCC 
TTCTGGGCCTCAGTTTCCCTATAGATTTGGAGTGAGCCCCAGGTTTCCATCCAGAGCTGTACTGTGGCTCCTTCCTTCATCACCCTAATTTTTATCA 
CTGGATGTGGACTTTGGACTTTGTCCCATAATCACACGTTATGCTGCTAGCAGGTGCTTAGAGGCTGTCAGGCTTGGGTTGGAGGCCATGGCCTCTC 
CCAACTCAAGAGCCTCCCCGCACTCAGACTCGATACTTAGACATCATCTGATTTTTATTTGCAAATGCAGGTTGCCAGGTGACAGGAACCTTTTATG 
CTTGTGCCCCACCTGAGTCCCAGGCTCCTGGAATCCCCATTGAGCCAAGCATAAGGTCTGCTGAAGCCTTGGCGCTCTCAGGTGAGTGTGGCGCTTC 
CTGTAAAGCTCCGAGGGAGGGGGCATCTCTCCTCTACTGAGGTTGGGTGAGGATTTAGACTCTCGCCTTGCAGGCCCCGGGGTCTGGAGTAGGCATG 
GTCCAGGCTATGTGGACATCACGCTGAGTCAAATACACTATTAGAAATCTCCACAGCAGTACCAGCTAGCCAAATACTATTTGGACGATGTCTTTAA 
CCTTCTACATCATTACCTGCCCAGTTTTCCAGGAATGTGTAACCAGGCTCCTCCTCCAGCCGACATTCTCCATTCTCGCAGTGTGGAAAGGCTTTAT 
AGGCACAAAAGAATGCTGTTTGTCCTTTTAGGGTGTAGGGTTGGCCACAAACAGGTGGTCTGAGTTGCTTCCAAGGAACACTGGTTCTGAACCCTGG 
T CT CTG AGAAGTTC TTAT GC C CC CT AAAGGATCATATAGGTC TGACTCCCTCACAACTTTGA CAGAATTGCTG AGC ATGTGTGGATGTGATCT GATT 
TTAAAGTTCTGTTACTAAGGAAGCCTGCACTTGGAGATACTGACCAGCATTTTAAAAGCCCACACTCCGTGGAAGCAGACATCTTATGTCCATTTAG 
TCTTTAGATGATTTTTTTGGATGTTTTCAAATGGAATTATTAGAATTCTCATCATGCCCTCGGCTACCTTAAAAGCCTCTGACTGAAAACATCAACT 
GCATTTTGACAATTTTAGACACTTCCCTTGTTCTCGAGGGAGGAAGAAGTTTTAAAATCTAGTTCCTTCCAGCTCTGATGCTCAGGGAGACTTTGTG 
AGCCACTCAAGAACAGCCGAGGAGCACATCTGGGCATCAGGGTTTGTCACAGACACTAGAATGCTCTAGATCCTCTTCTGGAGCGCCAAAGACTTGT 
GTGGGTGCCCCAAGAGTAGGAAATAAACAGCTATTTATATCTCTGCAATCTTGTGATTTTGGTGACATTAAATGAAATGAAACCTGCCCTACCACTC 
ACCTCAGATGGCCAACGCCCCCTCTCTTTGGGTGCACCACTTGTGCTGTTCATAGCTGCAGCTATCGAAGACACCATGATGTGGGCTGTCAGAACTT 
GCCATTGAAGAATACGAGGCTTTTGTGGGTTTCTTCTTCTAGTTTGCATAATTAATTATCAACCCTGAGTGCACTTTTCAGAAAGCTATTCTTTCCA 
GGCATTGTTGGGGCTCCAACCACCAGCACGGGTATCTATCTCTGCCTGGGGAGCCCTTTGCACACCCAGCTTGCCCTTTCGGCCCGTGGGTGGTATT 
TTAAAGTGGCTTCTGAAATCAACAAAATCATGTGTCAATAAATTCCTGTCTTAAAGCTGTAGAAAACCTAGTTGTTGGGTTCTTTTCAGAGTTGAAC 
ACGAAGCTTAGAGGGATTTCAGGGGGTTTTACATTAACCACTGGCTTTTAGAGCAGTTCTCATCAATTTCTTCCCCTACTCCAAGAGAGCTGACTTA 
AAAATAAGAAAATAAAGGTATCATTTTCCAGAGCCCAGAAATTGTTATTTTAGTGCCTGTCTCTAACATATCTATGTGGGTTTTGTTGTTGTTTGGT 
TTTACTTAATGACATCATGGTAACACCTTAGGGAAGTTCCAGAGCTGAGGACACTATTTGCTTTTCTTCTAAGATGTTTCTGTATTTCTTTTACTAA 
T AGAAATCTGTCCCAGAGGT CAA CT CCAAAAT CAAAAT TG AGTTG CTGGAAAACGAATT C CAATTCGGT AGTATTATTT C AT ATTGTAGAC AAAATG 
CCACCACTGTTAACACCATCATCCGAAAAGCCCTCATAACAGGGGTGTGCTTTCTAATAAAATTTGGCTGAAAATTCAAGAAATATATACCTCTCCC 
C AAGAG AAGTAAAT GGC C AC AACAACATTTG AAAATG ATCGTGTTAGAG AG ATCAGTTTCTT TCCACAAGCTTCTCTTAGTA TTCTGTGCTTGAGGT 
CTAAGAATCTACAGGGAATAAGAGCAGCTAACATCTCCAAGACTTCCTTGGTCCTAGGATCTTTCACTTGTTCGTGGAGCATCTTGACACTCAAGTG 
TTCCACCTGCTGTCCTTCGTATCAGTCTAGTCACCGAGTTTTTGGGGCTCTGAGCAAGGTGGCACCTTTTTOU^ATCCATCAGCACTGACTCCAGAG 
TTTTGTTCACAGACTGCCGGCTGCATATCTGCCTGTATTACCGGGACATCCTCGTGAAAGAGCTGACCACGACGAGCCCTGAAGGCTGCCGGATCTC 
CCACGGACACACCTATGATGTTAGCAACCTGGACCAGGTCCTGTTTCCCTACCCGGACGACAATGGACAGAGGAAGAACATTGAGAAGTTGCTGAGC 
CACCTGGAGAGGGGACTGGTCCTCTGGATGGCTCCAGATGGGCTTTATGCCAAAAGACTCTGCCAGAGTAGGATCTACTGGGATGGGCCCCTGGCAC 
TGTGCAGCGATCGGCCCAACAAGCTAGAAAGAGACCAGACTTGCAAGCTCTTTGACACACAGCAGTTTC^ 

AGAATGG AGGTGGTGGTGGG TGCTGGCTATAAAGGTCT CAAATGGC AGTGTCTGC CTAC CC CAG AC AGAGGTCTT CCTCCTGAGAT CTGTGAGCT CA 
TGCAGAAATAGAATTCCTGCCTGATTCATGCCTAGCCTTTGTCTGTTGTGTACTCCCCTGATTAGCAGAGGGCCAGAAAGAGGATCCATATTTGCTG 
GCCAGGATAGACACTGGTGTGGGTTGATCTCTTAATTTATCATC^TTCTTTTCACTCT^ 
TCAGGCTGGCCTAGAACTCCTGCCTCGGGATTTTATCTGTACACCAGCACATCT 

CCCCCAGAGAGGTTTCACTGTGTTGGCTCTCTGCTCTCAGCAAGTTTATCTGCTGACACCTCAGCTCTTTAGGGGTTTCTAGAAAAATC»AGAAGGT 
TAGAAACACCATCTTAACATTTTTGGTTATGAGATGTGTCACCCTTATAAAATTGTCACACTTGCGAGGCAGGACCAAAGCTGAGAATGTAAGGAGC 
AGATTCAGCGAGGGGGAAATTCGGAGAAGAATTTGGTCTTATCTCTGGCATACGAAGGCTTCTAATCATATTTTCCAGAACCTGTACTTCTTATGGT 
ATGATTTTTGACTTTGGGTAGTTACGTTAGACCAGGGCTGGCTTATATGAAGTCATTAGTCTTGCCACTGGTGGGTACCACCCAGAGGCCTCCCCTA 
GAAACGTAGCCGAGGATGCCTTTAAGGAAGCAGGCTGCTCTGCTGACTGCTAATTGTCATTCCCTCACCCC^TCATATCCTAAGCTTGCTTTCTTCC 
CAATGCCCTTCCCAAGCAGCTC^CAGCTCTGGCTTTCCTGTGCTTTCCTTTTCTTGAACCGGGACTTGCCTTGCTCCTTCTCATCCATGCACACTGT 
G ATGCTT TAAAATG CGTG CAC ACTAGGATCT GGC T AA ATCCACTAATT AGC ATGC AT AT C AC CTTGCC TGT CT ATCTTT TTT TAAT TTG GTAAGAAC 
ATTACTTGAAATCTACTCTCGATATTTTTTTAAGCACATGGTCCATTCATTATTAGTCATGAGGATTGGTCCTTGAGCTTGTGTTCAGCAAACTCCT 
CATGTCTTAGTGACAGTTTATATTCTTTGACCCACAGCTTCCTAGCCACTTCCCACGTGTGGTGACCGCCATTTGGCGCTACTTTTATGAGCTCAAC 
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TATTTTGGAGTTCGCAATGCAATGGAGCGTCACGGTCTTTGTCTTTGTGTGCATGGGTAATTTCACTTGGCATGATGTTCTCCAGGCTCATTTGTGT 
CATTACTAATGACAGGCCTTCCTTCTTTATAAA 

GGGTACTTGGGATGATGCCTTGTCTAAGCAGGTCTGACGGGTAACGCTGAGTGGGGGAGTATAGCTGTCTCTCCAAAACCAATTTGCTCTGCCTTGG 
TTACGTTAGACCAGGGCTGGCTTCTATGAAGTCATTAGCCTTGGGTCTGCACACAGAAGGGGACTGCTGAGTCCTCAATGTGCTGCTTCCTTCTGTT 
CATTTACAAAGAAGACGACAAGCCCTAGTGTGCTTGGTACTAGTGGTGTTTCCTCAGCCTTTTGAGCTCCAGTGGCCAA 

CAAATCCTCTTTCGTTTCCAAAAACAAACATGCTCCTTGCTAAATATCCTCCCCTGAGAGGTGGGACACAGAAAAGAGAATCATCTCAGGGTCCAAG 
GTGTCTCAGGCTCCAAGGCGTCTCTGGCTCCAAGGACTGATGAAGATCCTTGGAGCTGGGTGTGATCTAAGCCTTGGTCAGACAGCCTTTGCCTCTC 
TTAGAGTGAGTTCTAGTGTTTGTGGTCCAAGGCAAAGAAAAATAACTTCTGGAGATGGATAGGTACAC^^ 

GTC CTATGTGCCCTCCAGAAGGCTA CCAC TC ACTC CCTGCTTG ATTAAGGTAG CT CAGAATAGTGACAATTGCCAGCGAC AGGGCTTTGAGGTAC AC 
TAAGTCTTGGTGTGCCAAAGAGGATCCAATTTGGCAGGAAACAAATGCCAAAACAGAGCAGACACCT^ 

T AGTTATTAATGAGTAAGGGCTATT TCC CTTTGT TCC CTTCTTCTCTAGTGGGGC CTGT TGTTTTTGCTGTCCACAGTCCCCAAGCCCCACAGAGGC 
TCAGGG AAG ATG ACATTGCTGCAGAGAGGCC CAACAG CCT CAGAACCT AAGGATGCC CACCC AATGGT CCGGGAAGAGCAGG CTG AGGTGGGCTTTC 
TCAGAGAGTCGCATCCCTTGCTATTCTGACCTAGCCCCTCGTAGGTGTCTTCCCGGCCACAGTGCTATGCACTTTATACAGTGTGCCTAAACCCCTG 
GGTTTGATTTAAATTCTGGAAGAAGAAAGAGAAAGAAATGCAGGAAGTCAGTTCTGTGAGCT^^ 

T CTTTG ATGTCT AC CCATTCTGG ATTTGC ACCCC ACTAGGGACAGTCC CCATAGGCACAGTT GAG AAT TCATATCTGAT C AGGGCAGAGTCTTCATG 

CCTGCTCTGTGGAGGCAGCTTTTTAATGTCAGTTCTTTGATGCAGACAAGACCTGGGAACCTAGCTCT 

AGTGGATGTGGCTTTCTGCTTGTGCTGGGGGAGGAAGCCAAGGCCTTGCACATAC^ 

GGCAGGTTTCTTTTC^G<^TCTTGTCTGTTTC^TGTCTCTCAGGCAGGACTAGCCTCAGCATGACATCCTTGTCAGAGGGGCTTCATTGGTCCCCT 
TCTCCCTGTATCATCCTGTCCCCAAAGTGAGATTGAAGCCTACTCTGGTTCTCCAGTTTTGGAGTTTTAGACCTAGTGCCAAGTAGGACACAGCTGC 
CAA CAG CTGGT GAG AGAAAC AGATGCTCTTGGTGCCC AGA CACCACGTGGCCTCCATGGTTA GCTAGTGAGGTTAAAAAAATAAC CCTGGGCC ATCA 
GAACATTGTGACTCTTTACATTAAAATGTCTCCTTGGCCTGTGCTGATTGCTTGACTCAGCATGGCTACTTTTCTTTTTCTTCTTTGTCTTCTTCTC 
fo& TTTGAC CTTGTGCATTTCTGTGAGTGTAGTG CTG C AGACC CAAGT TCTTAAGGTTGGGTCATGTT CCTTAAGAGTAATG AAGTAAAACCAGTGCCAA 
Q GTCAGGAGATC ATATGTGAACTTGAC CATGT GAT TTTGTGTCTAGGGTCTGCTCT AAGGGCTGGACTTAGGGGAACAGAGCC CGGGCT CT CCC AAAG 
1^ CAGACTTCCACGTGACTCTGGCTTTCCGTTCACCCGCTTTACCAGGTGTCTGAACAGTTTGGTTTTTTTTTTTCTTTCTTTCTTGTGGGTTTTCAGA 
5f GCTGCAAGTGTTTGCTCACCATGGCCGGCCAGCACCGAGATTCCAGGTGACTCTGTGCTTTGGTGAGGAGTTTCCAGACCCTCAGAGACAGAGGAAG 
O CTC ATCACAGCTCATGTGAGTACCTGGTTAC ATC ACC CGTAAATC ACAC ACTGTGGAGC TGT C CCTTTTAG AGAAGTGGC AAGTGACGAGTAAATGT 
p CAGCTCACCTGGGAAAATAGATGTAGACCTTAAAATAGTGCAGGAGGAAGCAGGCTCCAGTGAACACCACAGCTCAGGGAGGCACCCGCAACCTACT 
7™ TCCAGACAAATTCTGTCACCACCGAATCAGCAGGGCAGATGACTTGGACCCAAGGCTCTGTTTGTTCTGTATTCTTTATTGTTTCATACAGACAGTT 
ACCTGCCCTTTTATAGGAATTTTCAATAGTTGGGACCAAGTACTGCCCTTCGACATCTCTGTTTCTTGTGTGGTTTTAAAGATGCTGTCCTTTCGAG 
y„ TAGAGTAGCACTTTCTCCCTGGGAGGCTGCCTGTTATGTATTATGCTTCATCGTGCCTCCTAACTTCAGATAGTTCCCAGACCCTCGTTTTGTTGCT 
f"' ( GGACTTTAGGGAGTTATTTAACAGTTGGACAAGGGAGGTGGAGGAGGCTGAGTCTTCCCAGGAATCAGGTAGGTCGGTCTATCCTCACAGCTAGGGT 
Hi TTATTCGGATAATGTTCATCACTC^CTTAATAATTAAAAGGTAATTCTGAATACATGATGTTTTTTAATTAGAA 

s GAA AAGT ATGC AGT GTG GAGTAAAGGTTGTGTCCCAG ATAGC CAC AAT ATCTCAGTG CAAAT GGG AT ATT AGCTCTG AT GAT AT CTCTTAGTGGAG A 

f , CTGAAGACTAGGCATACAGCGCAATGGAAGGCATTTGCTAGGCAGTGGTAAAGCCCTGGGTTCTAAACCCCGCCTAGGATGGGGGTTGGGCACTGAT 
p& GTTG AAC AT CC AGC CTC CCTTCTCGGTTGCAAAAAGTAAAATCTAAGAAGCAACAAA CG GGCTGG AGAGATGGCTCAGTTGTTAAGAGCAC AGGCTG 
TTCTTCCAGAGGTCCTGAGTTTAATTCCTAGAAACCACATGTGCCTTACAACCATCTGCAGTGAGCTCTAATGCCATCTTCTGGTGTGTTTGAAGAC 
TGCTACAGTGAACTCACATACATATAAATCTTAAAAAAATAAAAGGCAATGAAACTATGATCCTGGCCTTGAGCCTTTTCTCAGTTCTAACTGGTGG 
TTGATATCAAATGAGACTGCAGATGTGTGGATGAATCTAGCATAGATAAGCAGTATTTTTTTTTTTAAGGTAGTGAGTAAATTCTAGCATAGATCTC 
ATTTTAAGGACTTTGGGTGCAGTGGGGCTCCGCAAAAAGGGAGCAACAATAGTCATATA 

TGGAAAA CA TA CATGGT C AC CCATG AACTCTGCTGGT CTC CTT AT T AC AGACTT AATTC ATATGGG TGCTTAC AGAGGAAT C CTAC CAGACATC ACA 
TATCAAATAACAAAGAGGCTTGATTTATTGATGATTGGTTGTTACAGAGCACACAGCCTGACT 

TGCTTATAAACAAACTAGGTCCATCAGAGCCAGCGAGCTGCTGTCTTGTGGCTGGCCAGCTCTGTCTTCTACTTGTGGTTCAGAGTTCTGTCTATTT 
CACAGTCATCTGGTTCTTCAGGATGAGCCCTTCTGTCAGACTCATGAGCCTCACTTACCCAGCATGTTACTTAGCCTTTTAATTTGGTCATCTCATT 
CAATAATGTCCAGTTAACTCATTCGCTAAATATCAAATCCAAGAGGTGATTGGTTTCAAAATGCCATATTTATCTTCTATTATAGAATCMGAGTTC 
TTTTTCCAGGGTTTTTAATTCCAGGTATTATAAGAGCAAATGAAACTGGTTTTTCAAATGGCTCTGAATGTGAACTGCTTCACTGTGTTATGTTATC 
CTGTGCAGCTTGTAGGTTTTTACTTAGAGTCCTAGGGTCATTTCATGATGTCCCAATTGTATGGTGTTGAGAAGAATATTCTAGTGATGTCTTTTTT 
TCTTAAATGTCTTATTAAAGGTGGAACCTCTGCTAGCCAGACAACTGTATTACTTTGCTCAACAAAACACTGGACATTTCCTGAGGGGCTACGAGTT 
ACCTGAACACGTTACCACTCCAGATTACCACCGCTCCCTCCGTCATTCTTCCATCCAAGAGTGAGAAGAAATACTCTGACAGGGCAGCCGGCTGCTG 
CCCTTTCTCTTTGGAAGAGCTAAGAAGTGAGTGGGTTTCCACTTGAAGACAACAACAGGGCTTTGTGAGGAAAAACAGCTGTATCTGCTCAACAGAG 
GAGCTT C CC CC AGAAGAGTGCCTGTCATCCAGGT CTTGACAAGTGCCAGGACTTGGGTG ACTGTG CCCTGG CTT AT AACTGTG AAACTTGATCAGTG 
TTGTGTTTACATGTACTTGAATGCTGGCTTTAGCCTGGTATAGATGGACTTTTGCTTGAAGACTGAAAACCTGTGCCAGCATGAATCCCTGACAAGA 
GAAGACATACGTATTATTGGTCCATTTCTCAGGGAAGTAAAGTCTAGACTAGAGACTACGCTGTCCACTCACAGAGAAACGCATTCCTGGATAGAGG 
AGCTGTAGGTTGCCCAGATCATGTCCACTGAGTGAAGGGACTTGGTTCTTAGGTCCAGTCTAGGCTATCATGTCCTGATCCTGCACTCTCTCTGAGT 
AGACCCTACATCCTGTTTGCTCCCTCTTCTTGCCCTTCCCTAAAAGCCAGCTGGTGGACTCTTGTTGCCACATACTTGAGCCAAAAAACCATCAGGG 
ACATATAAGATGAGACATGTTTGCATCTCCTCTGATCTAACAACGGCTACAAAGTTTTTTGTTGTTGTTGTCTCCCCTTTCTGTTTTGCCTCTCATT 
CTTTAGAATTGTTG CTT TTT TGT AAAGAAAGGAAGAG AGAGAGAC CGAATATACTAGAG ATAGATTTTTAAAATTCTGTG CCAT TTC CCCTT CTATT 
TGTTTTGTTGGCCATCAGTGGTCCCCATGGCAACACTGGAAGGGCGGAGCCTTTGAGCAGGTTGGGAACTGTCCCGGACACCAGTGGTTCAGTGTAC 
AGCTGTAGACTCAGTGAGCTGTTTCTGCTTATTTTGTATATTAGATGCTTCCTTGTGCCAATAATAGTTTGACAGAGTCTCAAAGCTACCGGGCTTC 
TTTTCCAACAACACATTCTTTCACAGAATCCCAAAGAGTCTCACTTGCCTTCAC^ 
TGCCATGCTCTGGGATTTTTCTCCTACCACAACATCATTCCTGACTCTTTC 
TTTTTAATTTTTTTTTTAACATGCCACTCTTAATATTTCTTCTTAAGTC^ 

TCATTGTTTTATTTTTAAAAAGGAACCTTTGACAAATAATACACATTACAGAAAATGGAAGCATTTTTCATAGAG 

TTATGTTCG AATCCTCCTTGCTAGTATTAGCC CC AAG AGT TC AGT CTGT CC TTT AAGCATGTTTT AGT TTT CAATACATATATATGAAGT TGC CCTT 
TCTTAGAGAGAGAG AGAGAG ATATGTGATGA CACTCTT ATGT ACC C AACTAT AC AGATTTAC CTC ATCT TTTG AGTGGCATGTGG ACT TCCACTGCT 
ATGTGCCTAGGTGCAGTTTAGACATCTTCTTAGCTACCCATCCCAGCAGCCCACTGAGTCTCGTGAACTAAAGTCACTTATACATCTTATAGATGCA 
AAACATT CTGGGGTATATTATCCCAAGGGAAGATGAAGGC AAAAT CGGGAAGCACTCTTC CTGAGG CC C CAAGCCACAGCTGTCCTCCTGGAG TATC 
TTAACCATATCTGGCACTTCACCAAGGTCCCTTGCCTGGTCCTGTATGACCATGACTCTTCAGTCCATAAGCAGATGCTTTGTCTCAATTGCCATGT 
GAAGTCTTTTGGTTTTTGGTTTTTTTCCTTAATGTTAAGCTGTTTTAACAGTGAAGCTGAATTTTCTGGAAAATGCTGTATGGTTCGAGCCCCCAGT 
TCATGTTTCACTTTACAGCCATGCATGCTGATGCCTTCCATCTTGACAAATGGAATGATGGGTAAACACTCCTGACACAAAGAGGAACACCTTCCTC 
GTCCCTTGCTGAAACAAAGTG<ZAATGCACACAGTAGAATTTAGTTTGCTTGCACTGGAAGATGCCGTTGAAGAGGTAGGCTGAGGAACACTGTATCC 
TAGGGAGAAAATATGCAAGTCTCACGGGTGCTTTCTGTTGGCTTGTTTC^ 
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AGCCCTCAGTCGTTGTCCTGGACAGAGTTGGTAGACCCCACCTGTAGAAATCCAGATGCACTCTGGGATTGGAGCACAGAGAGGCTTGCTCTATCGT 
GAGAATGCCCAGCCTGCAGAGGCCCAGGTGCGGTGCCTGAGTGGCTGTATGCCAGACACAGGAACGCCAGCAGGCTGAGTGTGGTGACTGATATGTG 
AGGTGCCCTTGCCCTGCAGACAGGCTGAATGTGAGCCCCAGCAGCAGAGGATTCTGAGAGTCAGGGTGGTGTCATGTCCTACACAGGAAGAGCCTTC 
AATATTACTTTTTAAACGAACTTTCCCCATTGCTGATATTTTGATAAAAACCCACA(^GGTTTAATTACATGCCCCCTCATCCCTCTCCACACaGT 
CTTGCCCATCCCACTGGTGAATTGTTGTTGTTCTTGTTGTTGTTTTTTGT^ 

ATCTCCCTAAATCACATGGCTGCTGCATACATTGCAATGTTGCCCTACCACATGTATGTTTTTGCAAGTTGCTTCTTCACTCATTGAAGAAGCCTGC 

ACCGTAAGTGAAACT CACTT AGT CC TTGCTAATTGCTGCT CACTGTCACGTG AACGCAG AGAAGGCTGATTTT CCTGGA CCCATTTTCAGTGC TTTG 

AGGAACATGAGTCCAGAGTATATTGAAGTAGAAGACAGTGTTTGGCAAGCCAGTTCTTATTAGACACTGCTACAGGCTTCACAATCTTCAAGGTGGA 

CCCACCTCATGTGGAAGGGCATCATTGCCCCTCTGCTCCCATGCGCTGTGCACCCCCTTCACAGTGAAAGACCTGCTGGGGACCAGGACTCAGA 

CTCTGGCTTCGAGCTCATATCTTGGCTTCCTGTCATGTCTAAGCCCGACAGGAAAGTGTGGCACCATCCTGCCTGCGGCGTTGGATGGATAGGCCAT 

GGATTTTGCTGTAAACTTGCTTCACAAGGAAAGCTGTGTGTTTTAAAATATTTGATAAATAAAGGATTTCTAGAAAGATTGACTGAGGAGCATATTT 

AATACCATCCTCACAGCCCTTGAAACACTTCCTCTATCTGCTGGGTTCTTGGTGAGGCTGTCAGTGGCCTACTTCCTTCCTGGCTGCCTGTGAGTCC 

CTGAAATCAGATCTCTAAAGTGTTCAGCAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGAGGTTCTAACTCATGGCTC 

TCACCATTCCTTCCAGCATACTTTTCTCAGATCACTATCCCCCACTGTTCTGGCTGTTTTGATCCTTAGTCCCAAAGGAGAGGAGGCAAAGCAAGAG 

AAAGTATGAAATCGTTTCTCAGTGGCACAGAGATGCCACTGAAAGGTGAAGTGCTTAGAAAGAAAGGATGTTCTATCATCATGAGAGAGAGCTGGCC 

AGGTGAGCAGGTGGGCAGAAGTCTTCCAGGGAAGAGCAGGGTACCT^ 

GTAG ACTTGGTGTTT TTTTTTTTTT CTGTTTGTTGTGTTTTT T TTTCATTTTTAGAAAAAAAATTTAAT ATCTAGTGTGTGAGTTT ACC TACTTGTA 
GGTACATGCACATGCAGTACTTGTAGGGCC^GGGAGGGGGGTCTGATCCCCTGTAACTGTAGGTACAGGTGGTTGTTAGCCACATACGCGAGAACCT 
GGAACTCCAGTCCTCTGGAAGAGCAACAAGGACTATTATCCAAGTTGAGCCATCTCTCCAATTAGGGAAAATATTTTTTAACTTGTTTTAATGGTTT 
GTTTTATACTTTGATTTCCTCTGAGTTTGTTGTCATGAGGGTAGCATATAGTGAGCCTGCACTCTGATAGGAGAGATTCCGGGGGAGAGGCACAGGG 
TGTCGGTAGACTCTTGGGTATCCCTAGGAGAACCTTGGCTGAAATTCCTCTCTGAGAAGCCAAGTCAGAGAATGCCGTTCTCAGCTGTATCACACAC 

jy. ACTTCCTGACATGGCTGTGAACTTCCGTATCTTCCAGTGTCATAGGAAGACTGTCAGCCCAGGATAGGACTGTGGCCTGTGAAAAGCTCAGAGATTC 
AGAGAT CTG AGAAAAGC ATTTAGCAAGGATCT AAACAACATATTG AATTTACTAAATAAAAACCAATTGCT CTACTATT CCTACTTTGGG AGC T CTG 

U TGTGTCCCATAAGTATAGATGGG CT C AG GCC AGCATGCAC AGC CAGGAT TT CTCCAG AAAGTTCTACTACTTGAAC CTAAGCTTGTTTGAAGAG ATT 

fl TTGCTGACTTTTGAGTTCTTACATGATATTCCTATTCCCACAGGGCTTCTGGTCTTAGACTTTTCTTTCCAGTTACAACTTTGGTTGTTGGATTTGG 

,S TTTTCTATAGTTGGTAGTATTAAGACAAATACAGGGTTTTTCACATGGGCIAACACTCATCCAT^ 

W CATGGCGTCTCCTACATGCCTTATTTCTGGGAAGGGACTGCAAGGTTGGTTGTAATATCTCATGTTTTCTAGCAGAAATGTCTTTTAATAAATGTCT 

AGTTAGGAATTATAGGGCACCC^GGATGTTTCTTCTACAGCCTAAGCCAGGGGCCAGAGGATGTCTACACAGGTACCCTGCTCTTCCCTGGAAGAC 
|% TTCTGCCCACCTGCTCACCTGGCCAGCTCTCTCTCATGATGATAGAACATCCTTTCTTTCTAAGCACTTCACCTTTCAGTGGCATCTCTGTGCCACT 
f"' GAGAAACGATTTCATACTTTCTCTTGCTTTGCCTCCTCTCCTTTGGGACTAAGGATCAAAACAGCCAGAACAGTGGGGGATAGTGATCTGAGAAAAG 
Js£ TATGCTGGAAGGAATGGTGAGAGCCATGAGTTAGAAAGAACCATAGCTGCAAGATGAGAGAACAACTCCATGGTTAACGGTGTGGATCAACACCTGA 
| .. E AAATGATGGAGATGCTGAGATGGGTCTGATCAGAGCTGACTGTGGGCCAGAACTTACACACACACACACACACACACACACACACACACACGTGCGA 

GAACCCCTCACACACACAGGATGGTGGGGCAGAGTTTAGGTACTAGAAGCTCCTTCAAGGGTGCATCTTCAATAATAATCTTCAATTATCTAAC^ 
S TCTCACACTAGACCCCACCTATCAAAGGTTGTACTTAAGGTTGTATCACCTCCCAGTAGTGCTCATCAGCCACAAAAGCTATTAACATTGGTATTTA 
jy. CTGTTTGTGTCTATGGCATATCACTGGTCATTCCTTTTTTATTTTTTTAACTTTTTATTGATTGTTTATGAATTTCACATCATGTACCTCAGTCCCA 

CTCATCTCCCATCTCTTCATATCCACCCTCCACCCTTGTCACCTACCCGCCTCCGAAAAAAAATTTTTGGAAGCTGCAGTGTGTCACAGTGTGTCCC 
U ACAGTATACCCTTTTGTCCACACTTCCTTATATGCAAATGTTCATTGCAATGAGTCATTGGTCTGGTTCAAGGCCTCTGGCTTCTGTTACACTGTCA 
f|f ATACTGGATCCTCACCAGGGCTCTTCTCATATAGAATCTCCTCCTGTTGTCCTATGTCATGGAGATCTTGAAGCTTTGGGCCTGTAGTACCCTCTCC 

TTC A CA TAT AC C AG CAGTTC ATAGATGG C AT AGAGCT AACTC AAAACCCTGGATCTGGGCCTGGGTGGTAGCTAAGTTGGTC AGC ATGCC AGCTCT C 
14? CTGCACCC^raCCACCAGGGCAAGCTCTCCAGGAGTACCCTGGCTAACTCACCCA^ 

pf CCGTTGAAGCTGACCCACACCAACACCAGCAGAGCCAACTCTATTCTGCTGCTCAGGCAAGGGACAGGACTCACTTTCTTTAGTGTTATAGCTAGTG 
jf 5 ^ AGGAGCAGGGCCAGCTCTCCTGTTCTCATGGCCTTGTGGCCAGCTTCCCCTTCTACCTTAGGTGGAGAGGGGTAAGGTGGAAGAGGAGGGCATCTCC 
H» TTCTTGCTCATGCCACCCCACAGTAGACAAGTTGTGGGGCCAGCTCTCCCATGCTCATACCCACAGGGCTGGCTCACTTGCaCCTCTGCTGCCAGGG 

TTGACTGTACCGTGCTGCCCTGGTGAGGTGCAGGGCCCACTTTTCCAAGTCCTACAGCCAGTAATTGTCAGGGAACAATGGCCATTCTAAGAAACAC 

ACATCACTGGCAAAGTCAACCACTGTGACATCAACAGGATACAACCra^ 

TGGCTTGAAAGCAAGTCTGTAGTGCATTTTCTTAATTAGTGATTGAAGTGGGGAGGGAGTTAAGGTCCTGGGGTCCTGGGTGCTATGGAAATCAGGC 
TAATCAAGCCATGGAGAGAACAAGGCAGTAAGC^GCACTCCTCCACAGCCTCTGCTT<^GTTCCTGTCTCCAGGTTCCTGCCTTGAGTTCCTGCCCT 
G ACTTC CCT GG AAAATGAACTAC AAGCTATAAGC TGATAT AAAACCTGTCTTTC CCAAGTTGTTT TAT CTGTC ATGTTTTATCACAGCAATAG AAAC 
CCTAACTAAGACTTCATAAACTAAGAGTCAAAGGATAGAAATCAATCTACTAAGCAAATGGAAGCTAAAACAAAACTGTTAGAACCATCTTCATACC 
TGATAAAGTTGAGTTGAAACCTTCAAACCAAAAATAGCCAGAAGAGATAAAT<^GATCACTGTAACGTTACTGAAAGGACAATTCTTCAGGAGCAAA 
TAATAGAGACACTACCACAATACTCTTAGATCTGATAAACATTTTTAGCAAAATAACTGTATACAAAAGCAACATGCAAAAACCAAAAAACAAAAAC 
G CC ATAGCT TT TTAAACATAT AAAAACAAGAG ATGACCTAAG AAAGAAGAAAAT AGGATAACT AC ACAAAAATACT AGGGCT CAT GTTCATTAAGAA 
ACCTTAAAATGCACAAAAAACCGATCCAGGTAQ^CAATAGTGGAAGATGTCAATAATT^ 

GTTTTGCCAGATAGGGTTTCACCATGAAGTCTTGCCGGACTATAACTTGCTATTCGATCAGACTGGCCTCAAACTCACAAAGGTCCACAGGTTAGGG 

TTAGGGTTAAGTTCTCCTGCCTCTGCCTCTGGATGCTGAGATTAAAGACTGTGACACCATGTCTAGTCTCATCCTTAGATAGGTCTCTGGGATGGTC 

AGCTTTAATTGCC^CCTAATAAAGTCTAGGGTCAACATGACCTGAGAGGAGAGTCCAGATGAGTGACTGTCAACATGGAGTGTCAACAACTTTGGG 

CAAGCCTGTAGAGAATATCTTATGTAAGTTAATTGATGTAGGAAGACC^AGCCCACTGTAAGTGGTACC^TCCCCTTGGCAGGGTTCCTGATCTGTT 

TAAGAGTGGGAAAATGAGCTGAGAACCAGCAGGTATGTGTGCATTCATTTTTCTCTCCCCTCAACTGTAGTTTTAATTTGACTTGCTGTTTGAGTTG 

TTGTCT TGG CT CTCCCAAAATGATGG ACTGC AAC CTAGAATTTTAAGCTGAAATACAAACCCTTTTCTCTCTAAGTGGTT GTGTGTGAGAG AGGGAG 

AGAGACAGAGACGGAGGCACACACACACTCAGAGAGGGGGGAGAGAGGGAGAGGGGGGGAGGGAGAG 

CAGGAACAAAAATGAAACTAGGAGTTTCCAAACTAAAAGTAAGCAAAAAAACTTCACAATTAAAA^ 

CAAACAGACTGGTCCATCTAAAAGATACAAAGTATACATTATTCTCCTTC^ 

TGGATCCTGTTCTCCTAACTGAGCTGCCTTGTCTTGCCTCAGTGGGAGAGGATGTGCAGTAACTTGATGTGCCAGGGTGGGGrGATATCCAGAGCGG 
CTCTCCCTATTCAGAGGAGGAATAGGGAAGGGGCTATGTGAGGAGGGGCCTGGGAGGAAGGGAGGCTGTGGTCAGAATGTAAAGTGAATGAATAAAT 
TAATTAATGAAAAATAAAACAAAACAAGACAACGCACATTATAAACCACAAAAC^ 
ATTATACCATAGTAGAATAAAACTAAAATTAAAAAGTAACAGCAACAAAACCAACACACATTCT^ 

GGGAAGAAAAAATTTCAATCCCTAGAGTCAGATGAAAAGAACTGCTCAACTTACTAGAACTTCTGTGGCACATCTAAGGCATTCTGCA^ 

ATGGTCATATGTGTTTATATTAGCAAAAAGTATCAGGTAGACCTCAAATACATAATTAAATAATGGGGGGGGGGAGAGGGAGAGAGAGAGAGAGACC 

AAATTTGTATGCAGAAGATTGCAAGAAAAATAAAGAGAAGGTCAGAGATCAATTAAATGGAGACTAAGAGAGCAATGCATACAGCCAGTCAGAGAGT 

TGGTATTTGAAGAGATAAACAAGTTTGATAATCCTTGCAGAGCAAAAAGGAGAAAACTCTCAGATGAAGAAAGTAAGAACAAAGCATGGGATCACTA 

GGGAATACTTTGAAAATTAATTTCAAAACCTCAGAAGTCCAGAGGAAATGGATACATTTCTAAACACATATGGTCTGCTAGAGTTAAATCAG^ 
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TAC AAG CAG CATAACTGG ACTCAAAATAACC AAAATAACAACC AGTTGTTG AAG C AGTAACT AAAAATCTC CAC AC AAAGAAAAGT CTCAAACTAG A 
TAGATTCACTGCTTAATTCTACCAACCTCTTAATGAAGATTTAACAGAATACTCTTAAATAGATTCACCAAATAAAAAGAGAAGGAACAAGACCAAA 
GTCATTACACAAAGCCAATAAAATCCTGATTCCAGAGCCAACGAAGGATACAACAACAAAAACTGAAAACTGTAGA 
CAAGCTCTTTAAGCGCTCACCTTTCTTACTCTCTAGCTCTCCTTCTC^ 

T TCCCC CTGCCTTT CTACAATAAAG CTCTAAAACAAAACAAAAAAAAAGCATTC ACAAC AAAATACTTG 

TGCATCTTAATCGAGTTACTTTCATCTCAGGAAAGCATGATTAGTCTAACATATGCAAATCAATACATGTATAAGGACAGAAATCAGA^ 
CAAGAGATGCAGAAAAGGCATTTGAAAATTCTCAATATC^CTAGACATCCCGAAGAAACTAGGAATAGAAGAAACATTATAAAGACCTTGTAAAA 
TTACAGCTGACTAAATGGGGAAAATCCAAGAGGATTTCTTCTAAACTCTAGAACAAGACAAGGTCCT 
TTAAAGTCTTAGCTAGAGCGACAAAGGAGGTGGAAAAATAATGGGAATATAAATAAGGAAGAAGTCG^ 

CTATTTGGAGACAGCATGCATACATTTAAACTCTACTCAAGACTCCACCAAAAACCTCTTAGATATGATTAAATAGTTTCTGCAAAGTAACAGGATA 
CAAAACCATACAAAATCAGTAGCTTTTCTATGTAGAAATATGAAACCACCTAAGAAAGATAAGTTAAAAAAAAAAACTATTCACAAGAGCTTCAA 
AGAGTTAGATGTGAACCTAATAAAGAATATGGATGCCTCTAAAGTAAGGCCTTTAAGACACTGAAGAAAGAAATTAAAATTGATATTAGAAGATGC^ 
AATGTCTTCCATGCTTATGGAATGGCATAAGCTAATATTGCAAAACTAACTGGACCACTGAAAGGGAATATAGTTTAA 

CTAATGTTTTTGACAGAAATTCTAAAGTTCGTATTGAATCATGAAAGTCATTAAAAAGCTAAAGCAGTCCTGAGAAGAAAACAATGGTGAAGGGATC 
GAACTATCTGACTTCAAATTGTACCACAGCACCACAGTGACAAAG 

TTAGAGCCCACACAACTGTAGCCAAATAGTTTTTGATAAAGGGGTCAAAAACATACATATAT^ 

TAGGAGACCCAGAT AGC CACATGTCAAAGAATGGAATCGG ATT AGTGTGTAT CGT ACTGCAC AAAAAT AAATTTGTAATTGATCAAAAT CCTTAATA 

T AAGGT CTG AAACAC TG AAACTT CT AGAGGTAATT CACTC AAG AC AGAAGNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNim^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNIOTNIOTNNNNNNNIJNNNNITONNNNNNN^ 

NNN NNN NNN NNNNNNNNNNNNNN NNNNNNNNNNNNNNNNN NNNNNN NNNNN NNN NNN NNN1^NNNNNNNNNNNNNNNNNI<7NN2JNNNNNNNTTAAAA C 
T GT C CCCAGAC TC AG AAAACAATC ACC AACAAAGCCTGGCAAGGAC AGG GGTGGAGCAGGGGCTGACTC CTTGTC AGT GTT AGGGC AAGTGTGAACC 
CATATGGCCTCTATGGAGATCAGTATGGAGGTTTCTCAAAAACACTAAAAATAGAACTACCAATGACCCAGTCTTTTAAATCCTGGAATAGACCAAA 
GGATCCTAAGTGCAATCAAGAGATGTTTGCCCATCCATGTTTATTGCCGACCTATTCACAAAGGGATACCCTGAGTGAGACAAAGAGGAGACTCAAA 
GGAATAAAGAAGAGGTAACTACATGGTAGACATAGAAAAGATTTAATGGGTTGAACAAGTTATGAGTGGTCAGGGAATGGGCCAAAGTTCATGGACA 

AGAGCTCCCTTCTCTACAGGTCTAATCTCAGAGATTTGTCCTAGGGATGGACCACAGCGGTGGCTGCCTACATCTGCCTAGGTCTCATTGGCCCTCC 
ATCCTTATTCCCTGCGTTTTATGTAACACTGTGACCAAGGAGATGTCCACACACTCATTAAAGACTTCACATAGAAACAAGGATCAGAGTTGAGTGG 
CCCAACTTTGGAGACCTGGGGAAGGTTTTAGAGCTCAAGTCTTTAGTCCAAGAGAATGGATGGATAGACGCATGCTGACTAAGGAGGAGATAGATGG 
CTACCCTTTTGTTCTTACATTGTTCCTGAGATTCGCAGAATGGC^TCTCACCCCAGAGGTAACTGAGTCCAACAGGCAGGGAGCCCCCATCATTTGA 
TATTTTCCAGATTTGCTGTATGCCACCAAGAGAGCACACATAGTGAGAAAGGACCGTGTAGCCCAGGGCACCGGCTTTGGAAAGTGCGTGATGGGAA 
TGGATGGAAAACATTGCTGTGGGTGAACTTTGTTGTTCCGGGTCTTTATTCTGGTGATGCTGTTGTCTTTGTTCAGACATCTTAAACTCAGTTGTTA 
TGTCAGTATTAGTAATTCCAATTTCTTCTTTAATTTCCATGCAAGATAGGGCTTGCCTGTAAAGTGCACTGACAGAGGCAGCGTTCTATATTTAGTG 
AGGCAGATGCTATCTGCAAATTACAGGGAAATGATTGCATCAAGGTCCCTGGTTGGTGAAATAAACCACAATAAAGAAATGAGGAGCCAGAGAGGGG 
CAATGTTCTAGAAGTCACTAAGAACCAAAGTTGCTTGACTCTGACTGGTAATGATTTCTTGGTCCCCAAATGCCCTCCCATGCCCAAGCAGCATGGC 
CTTCATGCCCTTGGGGTTAGAGGAAGAATAATCACATTGAACTATTTCCTCTACACCCTGAGAAAATGATTATCATCTTCACCGGACAGTTAAGTGC 
CTAAAC CTC AAACC ACC AGG GCAACTGTGTGATTTTAGGAAGG ACGAG AAAC AG GGC AT CTATTTTGTGACTTACCTAAACCCAGGAGCTATCAGAG 
GGC TTC CTG AC ACTTTT AAAAGAAAAGAAAGCGTAACTACAAGGCAAG CAT AGGGCC TT CCTGGG CCATGATGGAG CCGAGGGTTCAAGTTTGT AGT 
GACCGTGCCATTTTCTTCCCATCCTGAATACAGAATTCACTCTGGGATTTTCATCATAGACAAACACAATTCAGGTCACCAGAGTAAACTGCTTCTG 
TAA T GGTAAAT TTC CCAGCAGGTCTTAA AAT AAAT AG ATAAATAAATAAAT AAACAAAC AAACAAAC AAAT AC AC ACATAC ATAC AT AACATACACA 
C AT ACAT AC AC AT ATAT TTT ATAAT AGT CTGTGC AAC CTAATGTT ATT TTT CAG CTCAC ATTTATTTAATTAT TAGGAATCCAGGTTACCTTAATTT 
TTTTGTGCTTGCTACACACTTTCATGCATACCAGGT 

MOUSE SEQUENCE - mRNA 

GGGACCCTTTGCTGCCCTCAGCTAAGAGTGCGGGTGAGCGCACAAGCCCAGGAGGAGGTCCGCACGCGTCATGAACTTGGAGACGGGCAGCCGGGGC 

TCAGAGTTCGGCATGAGCGCAGTGAGCTGCGGCAATGGGAAACTCCGACAGTGGTTGATCGACCAGATCGACAGCGGCAAGTACCCCGGGCTGGTGT 

GGGAGAACGAGGAGAAGAGCGTCTTCCGCATCCCGTGGAAACACGCGGGCAAGCAGGACTACAATCGTGAGGAGGACGCTGCCCTCTTCAAGGCTTG 

GGCATTGTTTAAAGGCAAGTTCCGAGAAGGGATCGACAAGCCAGATCCTCCTACTTGGAAGACAAGATTACGATGTGCTCTGAACAAGAGCAATGAC 

TTTGAGGAATTGGT CGAGAGGAG CC AGCTGGATAT CTCTG ACC CAT AC AAGGTG TAC AGGATTGTTCC AGAGGGAG CCAAAAAAGGAGCAAAGCAG C 

TCACTTTGGATGACACACAGATGGCCATGGGCCACCCCTACCCCATGACAGCACCTTATGGCTCTCTGCCAGCCCAGCAGGTTCATAACTACATGAT 

GCCACCCCATGACAGGAGCTGGAGGGATTATGCCCCTGACCAGTCACACCCAGAAATCCCATATCAATGTCCTGTGACGTTTGGCCCACGAGGCCAC 

C^CTGGCAAGGCCCATCTTGTGAAAATGGTTGCCAGGTGACAGGAACCTTTTATGCTTGTGCCCCACCTGAGTCCCAGGCTCCTGGAATCCCCATTG 

AGCCAAGCATAAGGTCTGCTGAAGCCTTGGCGCTCTCAGACTGCCGGCTGCATATCTGCCTGTATTACCGGGACATCCTCGTGAAAGAGCTGACCAC 

GACGAGCCCTGAAGGCTGCCGGATCTCCCACGGACACACCTATGATGTTAGCAACCTGGACCAGGTCCTGTTTCCCTACCCGGACGACAATGGACAG 

AGGAAGAACATTGAGAAGTTGCTGAGCCACCTGGAGAGGGGACTGGTCCTCTGGATGGCTCCAGATGGGCTTTATGCC!AAAAGACTCTGC 

GGATCTACTGGGATGGGCCCCTGGCACTGTGCAGCGATCX3GCCCAACAAGCTAGAAAGAGACCAGACTTGCAAGCTCTTTGACACACAGCAGTTTCT 

ATCAGAGCTGCAAGTGTTTGCTCACCATGGCCGGCCAGCACCGAGATTCCAGGTGACTCTGTGCT 

AGGAAGCTCATCACAGCTCATGTGGAACCTCTGCTAGCCAGACAACTGTATTACTTTGCTC^^ 

TACCTG AACACGTTACCACTCCAGATTACCACCGCTCCCTCCGTCATTCTTCCATCCAAGAGTGAGAAGAAATACTCTGACAGGGCAGCCGGCTGCT 
GCC CTT TCT CTTTGGAAGAGCTAAGAAGTGAGTGGGTTTCCACTTGAAG AC AACAAC AGGGC TTTGTG AGG AAAAACAGCTGTATC TG CTC AACAG A 
GGAGCTTCCCCCAGAAGAGTGCCTGTCATCCAGGTCTTGACTUIGTGCCAGGACTTGGGTG^ 

GTTGTGTTTACATGTACTTGAATGCTGGCTTTAGCCTGGTATAGATGGACTTTTGCTTGAAGACTGAAAACCTGTGCCAGCATGAATCCCTGACAAG 
AGAAGACATACGTATTATTGGTCCATTTCTCAGGGAAGTAAAGTCTAGACTAGAGACTACGCTGTCCACTCACAGAGAAACGCATTCCTGGATAGAG 
GAGCTGTAGGTTGCCCAGATCATGTCCACTGAGTGAAGGGACTTGGTTCTTAGGTCCAGTCTAGGCTATCATGTCCTGATCCTGCACTCTCTCTGAG 
TAGACCCTACATCCTGTTTGCTCCCTCTTCTTGCCCTTCCCTAAAAGCCAGCTGGTGGACTCTTGTTGCCAC^TACTTGAGCCAAAAAACCATCAGG 
GACATATAAGATGAGACATGTTTGCATCTCCTCTGATCTAACAACGGCTACAAAGTTTTTTGTTGTTGTTGTCTCCCCTTTCTGTTTTGCCTCTCAT 
TCTTTAGAATT GTT GCT TTTTTGTAAAG AAAGGAAGAG AGAG AGAC CG AAT ATACTAGAG AT AGATTTTTAAAATT CTGTGC C ATT TCCCCTTCTAT 
TTGTTTTGTTGGCCATCAGTGGTCCCCATGGCAACACTGGAAGGGCGGAGCCTTTGAGCAGGTTGGGAACTGTCCCGGACACCAGTGGTTCAGTGTA 
CAGCTGTAGACTCAGTGAGCTGTTTCTGCTTATTTTGTATATTAGATGCTTCCTTGTGCCAATAATAGTTTGACAGAGTCTCAAAGCTACCGGGCTT 
CTTTTCCAACAACACATTCTTTCACAGAATCCCAAAGAGTCTCACTTGCCTTCAC^CCGTCTGATCCTTTATTGTTACTGTTTTGTTTAGGTGGCT 
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TTGCCATGCTCTGGGATTTTTCTCCTACCACAACATCATTCCTGACTCTTTCGGGCTCAGAAAGAGTTAAGCGCTCTTGCACACTTGGCACTTACTT 
CTTTTTAATTTTTTTTTTAACATGCCACTCTTAATATTTCTTCTTAAGTCTTTCAGCAGCATACATTCAATAAGCAATGGACCATTGGGTGTCTCAT 
TTCATTGTTTTATTTTTAAAAAGGAACCTTTGACAAATAATACACATTACAGAAAATGGAAGCATTTTTCA 

TTTATGTTCGAATCCTCCTTGCTAGTATTAGCCCCAAGAGTTCAGTCTGTCCTTTAAGCATGTTTTAGTTTTCAATACATATATATGAAGTTGCCCT 
TTCTTAGAGAGAGAGAGAGAGATATGTGATGACACTCTTATGTACCCAACTATACAGATTTACCTCATCTTTTGAGTGGCATGTGGACTTCCACTGC 
TATGTGCCTAGGTGCAGTTTAGACATCTTCTTAGCTACCCATCCCAGCAGCCCACT^ 

AAAACATTCTGGGGTATATTATCCCAAGGGAAGATGAAGGCAAAATCGGGAAGCACTCTTCCTGAGGCCCCAAGCCACAGCTGTCCTCCTGGAGTAT 

CTTAACCATATCTGGCACTTCACCAAGGTCCCTTGCCTGGTCCTGTATGACCATGACTCTTCAGTCCATAAGCAGATGCTTTGTCTCAATTGCCATG 

TGAAGTCTTTTGGTTTTTGGTTTTTTTCCTTAATGTTAAGCTGTTTTAACAGTGAAGCTGAATTTTCTGGAAAATGCTGTATGGTTCGAGCCCCCAG 

TTC ATGTTT CA CTTT AC AGC CATGC ATG CTGATGC CTT CC ATCTTGAC AAATGG AAT GATGGGTAAAC ACTCCTGACAC AAAGAGGAACACCTTCCT 

CGT C CCT TG CT GAAACAAAGTGCAATGC ACAC AGT AGAATTT AGTTTG CTTGCACTGGAAGATGCCGT TGAAG AGGTAGGCTGAGGAACACTGT ATC 

CTAGGGAGAAAATATGCAAGTCTCACGGGTGCTTTCTGTTGKjCTTGTTTC 

AAGCCCTCAGTCGTTGTCCTGGACAGAGTTGGTAGACC 

TGAGAATGCCCAGCCTGCAGAGGCCCAGGTGCGGTGCCTGAGTGGCTGTATGCCAGACACAGGAACGCCAG^GGCTGAGTGTGGTGACTGATATGT 
GAGGTGCCCTTGCCCTGCAGACAGGCTGAATGTGAGCCCCAGCAGCAGAGGATTCTGAGAGTCAGGGTGGTGTCATGTCCTACACAGGAAGAGCCTT 
CAATATTACTTTTTAAACGAACTTTCCCATTGCTGATATTTTGATAAAA^ 

CTTGCCCATCCCACTGGTGAATTGTTGTTGTTCTTGTTGTTGTTTTTTGTTATTGGTGAAAATTGTGAAATTGTTAAGGTCTCTGTTTTTGTTATTT 
ATCTCCCTAAATC^CATGGCTGCTGCATACATTGCAATGTTGCCCTACCACATGTATGTTTTTGCAAGTTGCTTCTTCACTCATTGAAGAAGCCTGC 
ACCGTAAGTGAAACTCACTTAGTCCTTGCTAATTGCTGCTCACTGTCACGTGAACGCAGAGAAGGCTGATTTTCCTGGACCCATTTTCAGTGCTTTG 
AGGAACATGAGTCCAGAGTATATTGAAGTAGAAGACAGTGTTTGGCAAGCCAGTTCTTATTAGACACTGCTACAGGCTTCACAATCTTCAAGGTGGA 
CCCACCTCATGTGGAAGGGCATCATTGCCCCTCTGCTCCCATGCGCTGTGCACCCCCTTCACAGTGAAAGACCTGCTGGGGACCAGGACTCAGAGAT 
CTCTGGCTTCGAGCTCATATCTTGGCTTCCTGTCATGTCTAAGCCCGACAGGAAAGTGTGGCACCATCCTGCCTGCGGCGTTGGATGGATAGGCCAT 
GGATTTTGCTGTAAACTTGCTTCACAA 

MOUSE SEQUENCE - CODING 

ATG AAC TTG GAG ACGGGCAG CCGGGGCT C AGAGTT CGG CATGAGCG CAGTGAGC TGCGG C AATGGG AAACTCCGAC AGTGGTTGATCG ACCAG ATCG 
ACAG CGG CAAGT AC CCCGGG CTGGTGTGGGAG AACGAGGAGAAGAG CGTCTTCCGCATCCCG TGG AAA CACGCGGG CAAGCAGGACTACAATCGTGA 
GGAGGACGC TG CCCTCTTCAAGGCTTGGGCATTGTT TAAAGGCAAGTTCCGAGAAGGG ATCG ACAAG CCAGAT CCT CCT ACTTGGAAGAC AAGAT T A 
CGATGTGCTCTGAACAAGAGCAATGACTTTGAGGAATTGGTCGAGAGGAGCCAGCTGGATATCTCTGACCCATACAAGGTGTACAGGATTGTTCCAG 
AGG G AG CCA AAAAAGGAGC AAAGCAGCT C ACT TTGGATGACAC AC AGATGG CCATGG GC C AC CCCT AC CCCATGAC AGCACCTTATGGCTCTCTG CC 
AGC CCAGCAGGT TC ATAACT ACATG ATG CCAC CC C ATGAC AGGAGCTGG AGGGATTATG CCCCTG ACCAGTCA CAC CCAGAAATC C CATATCAATGT 
CCTGTGACGTTTGGCCCACGAGGCCACCACTGGCAAGGCCCATCTTGTGAAAATGGTTGCCAGGTGACAGGAACCTTTTATGCTTGTGCCCCACCTG 
AGTCCCAGG CTCCTGGAATCCCCATTGAGCCAAGCATAAGGTCTGCTGAAGCCTTGGCGCTCTCAGACTGCCGGCTGCATATCTGCCTGTATTACCG 
GGACATCCTCGTGAAAGAGCTGACCACGACGAGCCCTGAAGGCTGCCGGATCTCCCACGGACACACCTATGATGTTAGCAACCTGGACCAGGTCCTG 
TTT C CC TAC CCGGACGACAATGG AC AGAGGAAGAACAT TG AG AAGTTG CTGAGCCACCTGGAG AGGGG ACTGGTCC TCTGGATGGCTC CAGATGGGC 
TTTATGCCAAAAGACTCTGCCAGAGTAGGATCTACTGGGATGGGCCCCTGGCACTGTGCAGCGATCGGCCCAACAAGCTAGAAAGAGACCAGACTTG 
CAAGCTCTTTGACACACAGCAGTTTCTATCAGAGCTGCAAGTGTTTGCTCACCATGGCCGGCCAGCACCGAGATTCCAGGTGACTCTGTGCTTTGGT 
GAGGAGTTTCCAGACCCTCAGAGACAGAGGAAGCTCATCACAGCTCATGTGGAACCTCTGCTAGCCAGACAACTGTATTACTTTGCTCAACAAAACA 
CTGGACATTTCCTGAGGGGCTACGAGTTACCTGAACACGTTACCACTCCAGATTACCACCGCTCCCTCCGTCATTCTTCCATCCAAGAGTGA 

HUMAN SEQUENCE - GENOMIC 

TCCTGAGATCTTATCTTTGATATAAGTTAGCTCCATTTACACTTAGCAGATAATACCAAATTAAAACAGAGACTGAATTCAGTGAGCAGAGAAAACA 

GCCTAGCCCATGGATTTGTTCTCAATGCACTGCCTCCTGGCTTTGGTGATTTTGCAAAGAACTGAAAGAAAAAGCACAACATTCAGAATAGAAAGGG 

AATTTCTGAAGGGAGTTTCATAACTGAAATTTCAGTTTGAACTGGATGAGGGACGCTGTTTAGTTCAGTGACCACTGAGGACCTCCGGCAGGCTTCG 

GGG TGT ACAAAGAC AGC CAGGAT TC AGC CCG CCT C AGTGG AGGGCC CC ACAGCAC AAAGGGACC AAGA AGGGATAGCAAGAGG ACAGCC AGAGTG CC 

CACAAGCTCGGCAGGAAGGAGGCTCACATCGCCGCGGAAGTCTCGTGATGGTTCTGGTAGAATCGTCCTCTTGAAACCTTCTGCTGCCATTATTGGG 

TTC T GT C CAAT ATG CCTTACGCA AAACACTG ACA ATAGCATAC ACCTTTCCAGAATGCCGGCACG CGCTGTGCAATGCC CAAACTGTCCCTGTG AGT 

TGAATGCCCCTGGCAGGATGCGTGTCCACTCCCGCAGGGCCGTGGGTGCCTCCCTTGTCTCACTCCTCTGCTTCAACCCTTCCCAGAACTCTGAAGG 

TTTGAGATGAAATGACTGAGCCACACCTAGTGCAGGAATGAGCACCAGGGCCTCTCTGCAACGGCTGCTTAAGGGCCACAGGATAAGCATAGTGTGG 

CACCCCCAGTTATTCCCTGTTTCGCTCATTGGACCACCCTGATAGGCTTCAGGGAGGCAGGAACCTCCTCTAAGGCAGCAGGAGAGGGGTGGATGGG 

GCAGCCAACAGGTGCATCTCATGCCTCCGTGTTCTCCATAGGGAGGGAGGTGGGCTAAGAGGCGAGTGGGGCAGAGGCCTGGGCCAGCGTGAAGTGT 

CAGGAGTAAGGGGGGAGAAGACTGGAGfGGGGCAAATCCAGGAGGCAGCTGAGGCTGCACTGTGCAAATACAAGCCCGGTTACGTCTGTGTAGCTTAG 

CTTCCCCAGCCCACCTGAGCCAGAGCTGGGTCTGCCCGTGCTCCCGCTTCTATTGACATCCAGCAACTCCAAATGACCTCTCCCATATCAATGGGCT 

CAT C TC CGAAG CCG AAT CTTATAGC CTG CTT CTCCTCC CC ACC AGGCT ATAGTGGGAAATCATTG AGGAAACT ATGGTTGAATGCATCGCACGGCCT 

CTCTGGTGTCTGAGAGAAGCTTCATAGCCAGCAGGCTGCCCCTGGTGGGCAGGrGCCGCATGGAGGTACCACGCCTCCCGTGCAGCGAGTAAGAAGG 

GCAAGGAGAGCTTGTCAGGGGAGTGGGCCAGGCTATCAC^TCAGATGGGCCAGGCCTCGCATCGCTCTCTGCATCCTAAGCAACGTCT 

TGCGTGGGTTTGTGGTACTGCAATGCCACAGTACCTTGGGGAGCCCCCAGGCAAGCCAAGAAAGTTCTGGGAGGCCACAGCCCTTCACTCAAGTTCA 

AAGGTTCAAAGCAGGGATTCTGTGTCACACTTGCGAAGGTGCCCACCCCAACAGAAGCTGTGGCTCTGCAGGACACGTGCGGGGTAGACCTCTAAGG 

CCTTGTCCCAGTGGAAATGCCCTCCCACTTGGCCTGGGTAGCTGCTCACGCTTCTGAAACGGGAAAAATCAACTGCATTCCTGAATGGGAGCCTCAG 

AGCATTAGAAAAGATAATGTTCTGATTTTTTAAAGCTAAACGTATATGGAAGGAAGCTGAAAGAGGACTTTTGATGAGTCACCTAATGTTGATGTTG 

AAATAAACCTAATGACCCTCTGGCTAAAATCTGTTCTCATTTGGGAAGCACCTGAACCCTCATTCTCCCGCCACTGCTCTGACCTCTGGAAAGCAAG 

GAGCAAAGGCACACTCAGCAGCTCCCGCTCCAAAATGCTCTCCCGAGTCCTCCCAAGCAGCCAGAGGACGACTCCACACCCAGAACCCAGAGTGGCC 

TGGGAACCAGGCCTCAGGTTCCTGGCCCATTTGTCATCGCCCAGGGTCCTC^ 

TCTTCCATCAAAGTTCTGCTCAAATGTCACTGCATGTTACGTCTTCCTCACACACTCCCACTAGCCTTACTGCATCATGCTTGGTTTGCTGAGCTAT 
GTGTGGGGTCTTTCCTTGCCAACCCTTCTTTCCCTGAGAGGTGCAATGACCTCATATTGAGGCTGTCCACTCCCCCACTCCCGACCCCATCCACCAA 
GCCTGGGAGGCACTGGCTTAATGCATCATGGTTAAAAGAAATGGATGGTTGAATTGCACAGAGAAGACACTGTGCCCCAGGAGATGTGCCTTGCACA 
AGATCC CTG CAG CACAT GCT GGG AGGCTCTG CCCGGG TAGCACTT AGG AAAGAAGGATG TATGCC CTG GGC TG CTGGGAGCACTGTGGGAACTCTGG 
GATCCCCCTCCTCCTAAGGGCTGTGCTACCCTGATTTGGAAACAATACAGAGGCTTCTCTCCTGCAAGAAAACATGCCCCCTGCCCACCCAGGATCC 
ACCTGGCTGGGGAATCTGCCTAGCTGGGGACCAGGTGGGCATGCTAAGGAGGAATGGGGCTGTGTCTGAAAGAAAGAGATGCCAGCAGGATCCAGGA 
GAGCAGTCACGGTCCTGGGGTCTGAGAGTGCTGGGCCAAGGAGGTGTGGGGAGCATAGATTCATATAAGAGAGAGTGGTTCAGTCTGTGGCCATGCT 
CCTGGGACACAGCACTGAGCACCCAGCAAGGCCCCAAGAGATGCTGAAACACACAGCTAGGATGGCTCCCAGGAACGTGAGACAGCATTAACCACAG 
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GAAGCCAGGCAGAAGTGCTGCAGCCAGGGCGTGCTGTGGGAGAAGGGCTGGGAGGATGCAGCTCGCTCGCACACTGCGTCAGAAACCTTCACATAGT 
GCTTGTCCTCAGTACGAGCCAAGGTGGGGGTGAGGAACGGCCCACTTACCAAAACTCTCAGAGACCCCTTGGGGAATTTGTGCTTCTTGTTTCTGCA 
AGT TTC C AGGAG AC ACAGAAAGG GTGCCACT AGACTCTAAGCTGTGGG AGGCCT CTGGGAGG CTC CTTGGGTGGAGCTTAG C AGCAAGAAGG CATGT 
TGACCTTGGTCATCAGGAGGTGTGGAGCTGCTGCTATACAATGGGGCAGGACCCCGGCAACCCATCC^ 

GATGGAAAATGGTATGTGGAGGTGCAGCACCCCTGGACCAGGAAGGGCACCGTGACCAGTTCAAAGCCCTCAAGGATGAAGGCTTGGGTCTGGCCAC 
TGAGCTGTAGTTGAAAGGTGGGAGCATCTAGAAAGGACAGAGCAGGAGGCGCTCACGAGTCTTCACTGGCGTTCCAGCTGCAGGGGCTGAGGCTGGA 
ATCTGTTCCACTAACTTTCCTCTTCTAAGTTTTCTCCAGGATAAAGTTGCTATTTGAATTTCAGATAAACAACAAGTAGTTTCTTTAGTGTAAGTAC 
GTC C CAAATATTGC ATGGAAT ACACCTATAC TAAAAC AAGTT ATT C AATGTTTAT CAGAAATGCCAATT TAACTGGGAGT TTTGC ATTTTT ATTTGC 
TAAAACTGGCAACCCTCCTCCAGGGATGGAGGCCCACTGGAACCAGGAG^ 

GTGTAGACTTGGGTGGATGTTGCTGTGCACTGCTTGGATTCCCACTTGGGAGTGAGAGGTTCATTCCCTCGGGTACTGAGACTGCTCCTGGGAGGTG 
GTTCTCAGCAGGCAGCCTTCTGTGGCGATGGCTCCAGCTGGAGAAAACGAAC^ 
GCTCAGTCAGGGCTGCGGGGGCCCTATCCCTTCACTCCACGCTGGAACAGGCTCG 
TGGCTTGGCGTTGCATCACGGGCACCTCTCGGCX^CAGTTCTGCTCCCTTCCCCT 

CAT CTC AGAGT CTG CCA CCGT CC CT GTT AAAGGCT GAAGGAGG ACT AGG ATCACTGTCAAC AAAAGCATGAC CC AAGAGGC AAG AC ACAGT CATGTT 
T AT AGCATATT ACT ATGAGC C ACTC ACT ACTTCTAGATGTCACGG TTCAAAGGTG ATTGAAAGAAACT C TGTGTTTC CC AATGCCTGC TGG ATAAAT 
AACTCAGCATGGATGGTGAGACAGCATCTCTAACTTCGCCTGTCTCCCCGGTAATCGTGTTCCTCCTCCTCACTTCTTTATCAGGGTTCTTGGTAAT 
AGTAGTCCCCCCTTATTCTCAGGGGATAGATATGTTCCAAAACCCCTAGCAGATGCCTGAAATAGAAGACAGTACTGAGCCCCTTATAGACTATGTT 
TTTTTCTGTACGTACATAGCTAGGATAAAGTGTAACTTATAAATTAGGCACAGTGAGAGATTAACAACACTAATAATAACACCAAGCAATGATGACC 
ATACGCTGTAATGAAAGCTGTGGATGTTGTTGCTCTCTCCAACATCTCACT^^ 

TGGGCCAGCAGCATATACAGTGGGGTCTGCTGGACAAAGCAGTGGTTCACATCCCAGAAGGGATGAAGCGGGACCCTGCGCGATGTCACCACCCTGC 
AAAGAATGGCATGCAACTTCAAACACAAAGTGTTGATTTCTGGAATTT^ 

GGAAAGGGAAACGGTGGATACAGGGGACTCCTGTGTCATTCCCCATCCTGGAAACCCTCCAGCAACCCCTGACTCCCCGACCGCCCCACCCCCTGCC 
GAGCACGTCTACTCAGCCCCATAACTGCTTGTCCTTCCCTCCTCTGCCACCCATGCACCTGCCCGTCTGAGAGCTCTCTACCTCACCCCGAGGCCTC 
CCCGGCCTCCTGGCCATTGTTCTCTCTCGGGCGGTACCCACACTATGGCCAGATAATTCCTCCTTTACTGTAGTTCTTACCTTATTACGGGGGAATA 
TGAGCCAAAGCCATGTAACTCGCTGCCGGGCAGGAACTAACAGAGCCAGCACATGCTTCTCCCGTACCCTTCTCTCCATCAGGTGGCCAGGAATGTT 
CCGGATCCCCGTGGCTTTGCCATGGATATCTCATTTGAACAAGAAGTAACCCCCCCCCCAGAATCTCAGGGGGTCTCGTTACCACAGCATAACCTAA 
C CTTTCT AG AAAGAT AC ACTCCTCTATCCAG CATGCCTAG AG TCTTATG TG TGACAAGTGACGGT ATG CTGTTT CA CAAAAT TTC GCTGAAAG AGC C 
GTTGATGAACTTTAGAATAACCCAACATTAGGCGCTGTTACAAATGGGTGGAATTAGACAAAGCCGTTTGCCTTTCACCTCTTGCTTCAGTTTTCAG 
AAAATTGCCCAAAGGTATTGCAAACCATTTGCGTTTTCATTGGCTTGGTTAAG 

TTCTGTAGAGAGCATGACACGTCAGAGAGGTGGGAAGTGTTGCCAGGACTCTTCTTAGAATGCTGGCAAGAAGAGCAGGCGAAAAGGGAAGAGCTGT 
TTATAAACAGGGAAAGGAGACATCACCTCGTATCCTTGGCTTCTCATTACGTCATCGATTACAATCACAAGTCTCATTTCCTACAAGCCTGAAAGCG 
GGTTCTTCTCTGCATGTAGGAAAAAGCCCCGCTTACTAACTCTGCCCTCAAAGAGATCACAAGCTGGTCCTGAACGCCCTGTGCCCTGGGAATGCTT 
ACTGTCGAAGCTCHXSCAAATTTTCTCATTGAAGGCACTCAGGGCATTC 

GTTTATGCTCTGTATTCTATTACCAAAGAATTTATCACTTTTCAAATAACATTCAATTATTTTAAACTGAGTGTGCAGTGAGAGCATTTATAATATA 
ATCTCATTTCCAGAGTGTTCCATCCCAACTCCTCTCTGGTATAAGTAGCrTAGAAGGTTAGATTTTTTTTTTCCAGAGGATTAAACATTCGAGAAGA 
GTGGCTAAGAGGCAGCACCAGGTTTCTCTCATTTCCTCAAAGAGAATTTAGCCTTGTGAAGAAGAATAAAGTGAAGGTCAATGGTTAGATCCATGGC 
T TAAAAATAAAATG C AG CTTTCCTT ACT CTG AAT TTA CCA TAACC AAAAAGGCCA CGAGGCAGCCATTAATGAT CCT AAGTTGCAACAGAAAGAAAA 
AAACACCTACTTAGGGTTAAACGAGGATTCAGTGGTCACTTGCCGCCGGCCTCCACTTCACAGGGGCGGAGGATGAGGCCAGGGAAGACTTTCTTAC 
CCACACTGACCTGACCAGAGATGAAACTCCCCCTCCTATGACAGCCTGCACTC^GAGGCTGATTCTGAAATAGTCATGCCAAATGCAGCCAATCCCT 
GTGTACAAGGAGAAAAAATAGTTCCTCAGAGCTGTGTCCTTGGGCCAGCCATCTGGACCTCTCACAATCTCTTTCGTTCTCCCTACACTTGGGCTGA 
ATCCCGCCTTCCTTGCCTGCCAGGTTTGGTTTGGGGGCCAAATGAGTGAAAGTCCGTGAGATACCCCTAAGTATTGAAGTGCGTAGTAAATGATAAT 
AGCCCTGATTTGTTGAACATGATGATGTGCCAGGCATTGCGCAAAGCATCATTTCATCATCACAACAAACACTGCTGGGAGGCAGGGACACCCCCAC 
C CT CCTT GTAC ACC C AG AAGT AGGACTTTGGTTAAATGGCTCTCCC AAGGCC TCAC AGCCTTAAAA ACT CAA GCACTT ATTTAATCCCC AAT GAT AC 
T AC CTTTTGGC CAT GCA ATG TGT CATGGCCAGTAATT AGATTTGC AAAC AC ATATAACC AAAG AGGATT AGAAGAG ACAT AT ATTCTATATAAATT A 
CATGAAAATAGCACTTTCAGTATTTTCCTGATTTTTTCTGGTAATTAAAAATGATTGCCCTTTATATTTCTTTATGGATACAGTTGCATTGTTTTTA 
GCCAATTGTAGCCCCCCCCTTTAAAGAGTGAATTAGTGGGAAAATAGTGTTATCCAAGATTTTAAGAATCTAGTGTGTGTGTTTGTGTGTGTAGGAT 
GGGTTTTTCTGAAATATTTTACTTGCATGTGAATGTATAATTTTAAAGATTGGCTTTTAAAATACTGAATTGAGGTAAA 

CTCTTTACGATAAATGTGAAATTGAAAATGGAAATGCTCCATTCAGATGGTGTGCCGACTACAACCTCAACATACGGATAATTCACTAAATAAG 
CTTCCTCGTGGAATATCACGGTCAGCCTTTGTTTTGGGTA 

ACTGATAAATATCCAGCCTTTTACTAAGGAATACCGATTTTCTCAAGTCTCAGTGTGTGGAGGGGGTGGGAGTTTGGTGGGGAAGCTTTTCAATTTG 
AGAACCTTTGCCGTTTTTTGTTTCAATCGTGACTAAGATATAAAAGTCCTTGGTCACCACTGCCAGCTGCTACCTGTTCACTGCCTCCCTGACGGGT 
AATGAGGCCATCCGGAGTTTTAGTGATTAAAATGTTTTATTACTTGTATTCACATCATCATTAAGAAAAATAACTTCTAGTGGATCAATATGGCAAA 
TGAGTAAACCAGAAGATGGCGTTGAAGATTAGCAATTACATTTAAAAACAA 

GAA CTAAAAGC AGAGGATGTGGATG AGT GAGGGG AGGGTATGTTGAAAATG ACT AAACC CGGTGC AGAGGACACGT ATACGAAAGTGC AAAAAGCT C 
TCATTTGCCATTTTTTG TCCTCAGC AAACTCCTCTCTGGrATGTTTGTGAACGTTTGAGGGAAAGGTAACA CTGAT TAAAAT ATAATGCAACAAGAA 
GGCAACGATCTTTGCTTTGTTCACTGATGTGTTCCAAGTGCCTAGGACGGTTCCTGACAAGTAGCAGGTGCTCAAAATACATTTGTCAAGTGAATAA 
ATCTGTGAAAGGAAAGG AAG AAAAA CAGGTTG TG ACC AGAACTTT TTG TGAACAAAA CC CTG TTG TTTATTACATAGACTTTGGT AATGG AAAACTT 
ACTTGGGTGAAATAAAAAATGGC ATCCTGTT AATTTCTGAGAACGTCT GTTGGTTTTAGGAAAAGC AT ACC TATCC ATACCT CACTACCTGTT TTG C 
TCTAGCCTAAA CATAGATTCTTCTT TGG TTAGG AGACATTGA CCCAATATATAAGA TTAAAA CAAAAACTGAGAAATAAGTG AAATC AAAAGCTTTT 
C AAAAGT TGCAGGTTGACCTACGGT GGTGGTGGT C TAAAATCATAT AG AAACCAGAG AT CTG ATGGTTTAC CT GAAAATGCG AAG TAAATG CACAT A 
CTCAGCATCTCAGACGATCGAAAGCTCAACGGGTGAAAGCTCAGGGGGTTAATTTT^ 
CCACTTAAGTATTATTCCTATTTTGGGCTTTTATTTATTTATTTTGA 

CCGGGTTCAAGCGATTCTCGTGGCTCGTCCTCTCAAGTAGCTGGGGCCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCGTGT 
TGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGCGATCCACCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCTTGAGCCACAGCGCCTGGCCTA 
TTTTGGGCTTTTATACCCCACTGGTAAACTGCTTTCCTCCAGGTTGAGGTTAAAACGACATCATTTTAAGGTGAACTGAAGTCTGGAAGTGATTAAG 
CACTTGGATCCTTAGGGAGCCTCTCCCCGCCCCCATCTCTTTCATGCTAAGATAATTAAAACTTCGGGGCCGGGGCATTGTCTGAGTCACTTCAATT 
CAC C AGCCT AA TAG ATGCAAAAGGATGT AAGC ATGT CAG ACA CGCAGAGAC AGTAT TTGAAT CAAGCTTAATAGCT CAAGGG AG CTGGGC C ATTTC C 
TATTTTCTTTTTAGTGAGTGCGATGTTCTCTAAACACCGCGGAGAGGCAGGGTTCCCGGTGATGGCCTTGCCGAGGGTGCTCCCGCAACCTCCACCT 
CCAGTTCTCTTTGGACCATTCCTCCGTCTTCCGTTACACGCTCTGCAAAGCGAAGTCCCCTTCGCACCAGATTCCCGCTACTACACGCCCCCCATTT 
CCCGCCCTGGCCACATCGCTGCAGTTTAGTGATTGACTGGCCTCCTGAGGTCCTGGCGCAAAGGCGAGATTCGCATTTCGCACCTCGCCCTTCGCGG 
GAAACGGCCCCAGTGACAGTCCCCGAAGCGGCGCGCGCCCGGCTGGAGGTGCGCTCTCCGGGCGCGGCGCGCGGAGGGTCGCCAAGGGCGCGGGAAC 
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CCCACCCCGGCCGCGGCAGCCCCCAGCCTTCACGCCGGCCCTGAGGCTCGCCCGCCCGGCCGGCCCCGGCTCTCGGCTTGCAAAGTCCCTCTCCCCA 
GTCCAACCCCCGGCCCCCACAGGCCTCGGCGCCCCGCCCCGCCCCAGGCCCCGCCCCAGAGAGTTCTATAAAGTTCCTCTTTCCCACCTCGCACTCT 
C AGTTT C AC CGCTCGATCTTGGGAC CCACCGCTG CCCTCAGC TCCGAGTCCAGGG CGAGGTAAGGGCTGGAGTCGGG CAGGAGGAGGGGT GTG AGGC 
TGATACCAGAGAGGACCCGGAGCGCGAACCAGAGGTTCGACCTCCAGGGCAGCGCAGGGTACCCCGGCTTCGGAGCGGGAAGGGAGCGCGCCCCGTC 
CTGGAGCTCCGACTCCCACCCCATCTGCGCTGAGCCGGAGGCGCGGGTTTGGGCTCCAAGGCC^ 

CCCTCCGCGCTCCTGCGACGGGGTCGCCACAAGCTGGACGGGATGAGCTAACCGGACTGTCGGGGCCCCAGGAGTGGCTGAGGCGGGGCCGTCCAAG 
GCACCCACACAAGACGGCACAACTGCCTGCGAGAAACAGGCCCGGCCCTGTGGACCCCAATCCTAGGCTCCTTCCCCTGCTCTTCGTTCCTAAG 
CCCAAGGTCACGGCGGCCTCCGGCGCGGTGCTCACCCGCTGGCGCAGGAGGAGGAGGAGCTCCACATTTGGGTCGCTCCGAGCCTTGCGTGCGGTGG 
CIOTNIOTNNNNNNIWNNNNNNNCTCC^ 

CGTTTGCAGAGCCCACTTAGTGCGCGCTAGCTGGGCAGGGATAGGGGTCCTATTCGGGGCGAAGGGTCTGGATGCGAGCAGAGAAAGCGGAGGGTGG 

AGGAACCCGGGGCTGCGCCCCTGGAACGCCCGGCCGCAGGCGAGGTCCTCCGCGCGTGGAGGCCGCCAGGGGAGTGGAAACTGACAGAGTCGC 

AGGGGCGAGAAGCGGGTTGGGAGTGAGCGAAGGCAAGCGAGAGCTGCGAGTGAGTGCGGAAGGAGGGCCAGGAGGGGTGGCGGCTGGGTGGGGAGAG 

AGGGTGCAAGACGAGCGGCGCGTGTCGGGAGCCTTTGGGCTGCGGGTGCGTTACAGGAGAGCAGGCGGGTAGGAGCCTTCGCGGGGGCCGAGCTCGG 

AAGGOSGACGGCTGTGCCCGCCCAGGGGATGCGCCCGGGCCGGCCGCGAAGGTGCC^^ 

GCTTCXSCAGCCTCAAAGACTCCGGGGCCTCGTGGTCACTGGCGCAGGGGATCGGGGCGGGGTGCCCGGAGTGCGGTGCCTCGTGGCTGAAGG^ 

TCTTCTCCCCGCAGTGCAGAGCAGAGCGGGCGGAGGACCCCGGGCGCGGGCGCGGACGGCACGCGGGGCATGAACCTGGAGGGCGGCGGCCGAGGCG 

GAGAGTTCGGCATGAGCG CGGTGAGCTGCGG CAACGGGAAGCTCCGCCAGTGG CTG ATCGAC CAGAT CGAC AGCGGCAAGT ACC C CGGGC TGGTGTG 

GGAGAACGAGGAGAAGAGCATCTTCCGCATCCCCTGGAAGCACGCGGGCAAGCAGGACTACAACCGCGAGGAGGACGCCGCGCTCTTCAAGGTCTCC 

GGCCTCGGGAGCCGGCGGGGGCGCGCCGGGGAGGGCCCAGAGACAGAGCCCGGGGTCCCCGGCGCCGCCTCCGAGGCGAGCCCAGGGGACCGCGCGG 

GGCGGACGGGCGGGCGGCGGAGGCATCAGGTGGCGTCGCCGGAGCCGCAGGAGGAGGAAAGGAGGCCTCGGCTCTCAGCGGGACCGCGGGGGCCGGG 

AGCCGGGTCCTGGGCGCGTGGAGGCTGCAGGGAAACCGCTGAAGGCCCGGCCGGGCCCGGGGAAGGGCGGCCAAAGGCTTGAGGGGTTTTGCGCGTT 

CGTCCGTGCGTTCTCGTTTCCACGCAAGCCTCCCGCCCTTCCTCCGGGCTCCCGTCTGCCGCCTCCGTCCGTGGGTCCCCCTCGCCCTCTCCGTGCG 

TCCGCGCCTGTGCCGGCGGCTGTTTTCGTCTCTCACCGCGTCTCTGTTTCTCTTTTCGCTGCTTTTCTCTCTGAGTCTCTCTCTCTCCATGTTTTTC 

CTGAGGTCAGCCTCTCTTCTCGCTCCTGCTAGCTCTCTGCGGGTACTCCCACCTCTGTCTTTCTCTTTGTGTGTCTCTGTCTCTCTCTTTCCCCCAT 

CGCAGTGGAACTCAGGGCCTCTGTCTAGAGCTGTCTCCCTTGCCCTTTGCGCGAGTGCACACACGTGTGTCGTTGTTACGATTGTTCTCCCCTAAGG 

CAGTTTACCCAGAGAACTACGTGTCTGGGCCCAGCCCCCACCTGTGGGCAGAGCAGGGGAAGGGGACTTCCTCCGGGAATTTGGTCTCAATTTGCTC 

TCAGAGTGCCTCAGCTGTGCTGCCATCCAGATGTCTCCTGTGGGTGACAGCTCACACCACAGCTGTCCTTAGTCCTAGGCAAGCTCACTCAGACACT 

GGGTGGGTAGAGCCCCCTGAAGGAACCTCCGCATCTCACTCTACCGGTTATAAATACCCAGAAAATGTGTCTTCAACTTGGCAGTGATAGGGTCCAA 

GATGGAAAATCATTTTCCTAGAGCATAGCTGGGGTCTTTCAGTTTACGTTCTGAGCAACGGTGTAAATCTGAAGGACCTATGCGCCATTCTTTCTTT 

T AGAAATACAATTT C AAG ATTCCTT ATAAAAT AT CTGT ATT TTGGGT TTAGC ATGGTGGCT C AC AC CTGTAATCC CAGC ACTTTGGG AAG ATGAGGT 

GGAAGGATGCCTATGCAACATAGTGAGACCCTGCCTCTGCAAAAAATAATAATTAAAAAAATTAGCCAGGCATGGTGGTGAACACCTGTAGGCCGAG 

GGCCCAGCTACTCTAGAGGCTGAGATAGGAGGACCCTAGAACTCAGGAGTTCGATGCTGCGGTGAGCTATCATCGTGCCACTGTACTCTAGCCTGGG 

CAGCAGAGCAGGACTCTGATTCCAAAAAGAAAGCTAATAAACCAGACATGTATTTTGACTTTTCGTTCTCTTCATTCTTTCCCACCAGGCTTGGGCA 

CTGTTTAAAGGAAAGTTCCGAGAAGGCATCGACAAGCCGGACCCTCCCACCTGGAAGACGCGCCTGCGGTGCGCTTTGAACAAGAGCAATGACTTTG 

AGGAACTGGTTGAGCGGAGCCAGCTGGACATCTCAGACCCGTACAAAGTGTACAGGATTGTTCCTGAGGGAGCCAAAAAAGGTAGGGGCTCTCCTGA 

ATTTGGGTCACCTAAC^GAGGC!AGCCAGATCCTTGAGGCACCTTAACTTCATTCTGAGCATCACTTTCTAGCTTTCCTTTTGTATTGCCTGCCTGCC 

TGCCTTCTGCCTCACAGTGAAGCCCAGCCTCCTTGCCTGAGTTATGTGGGTCACAAGTTGAAATTCCTGTGGGGTTCAGCATAGGAGAATTAATCAC 

ACTGTATGCCTCAATGTATATGGGGGGGGGTGCATTGAATATGTGTTTTCCAGTTAGTCTTTCAAAAAAAAAAAGAAAAAAGAAAACTGGATAACTT 

GTT AAAATTTACTTATCT AAT AGTC TTT AGTG AGTGAC TATTTTG C AGAAAAGG AGT AATGAG AGGAAT AT CT CAT C ACTATGCATGGTAATAAAAC 

TGTCAACTTTTAAAGTGTAACATGGTCAGCTGAGGTGTTTATTAAAAGTC^GATTCCTGGACCCCTCTTTTO^GATTCTGACCAGTGTGTCTGGAA 

GAGAGCTCAGGAATCTGCATCTGGAACAAATCCTTC^GGTGACCTCACTTGGGAAACATCTGGCTGGCGGGGTCCTTTATGTCACAAGGCACATTCG 

CATGCCATCTGTCATGTATTTCTTTACCTGTCCTGTACAACTCAGACCTTGTAGAAAAAAATCAGTTTAATAGCATGAAAAAATAACTTCCCAAGGG 

AGTTGCTGAAAAATACCTTATTTGTCAACACCGTGTTATGCATTCTAAGAATTAAATGCTCAGGTATTTTTACAAGATTTGACATTTAGTTAGGTCA 

GTTCCTGTTTTTACGTTGTGCCATTTCCCTTTTCCCCAAACATGTAGGAGCCAAGCAGCTCACCCTGGAGGACCCGCAGATGTCCATGAGCCACCCC 

TACACCATGACAACGCCTTACCCTTCGCTCCCAGCCCAGGTATGGTGGAGGGCACTGGGCTCCCTGAGGGCGAGGCTGTGTGGGCCAGCTGCCCACA 

TGGCCAGAGAACCACAGCAGCCCAGACAGCAGAACTTGCCATTTGCTATGGCTGCTCCAACAGCCCAGAAAAACCCCAGGTCACTGAACGAATGTCT 

CACTTTCCACACGGTGCTGCCATTGGTGTGGATTTTAAGTTGGGGAGGGTCGGGCGTGTCCGCCTGTTGGAATATGCTTCTCAGGTCTTCTGGGAAA 

CAG ATGTTTTGTGGAAG TGGAAG AT TTTGGAAGTAGTG CCTTATCATGTGAAAC CAC AGGGC AGC TGAT CT CT TCAGGCTTT CTTGATGTG AATGA C 

AGCTTTGTTTCATCCACTTTGGTGGGTAAAAGAAGGCAAATTCCCCTGTGGTACTTTTGGTGCCAGGTTTAGCCATATGACGAAGCTTTACATAAAA 

C AGT AC AAGTAT CT CCATTGT CCTTT ATGAT C CTCCATGAGTGTTTTCACTTAGTCrGATGAAGGGTTCACTC CAGT CTT TTCGGATGAT AAAATG C 

TTCGGCTGTCAGTCTAATAAGGGATTCCCTGAGGAGTTTGGAGGCTGTAAGAGCACCCCCCGTCTCAATGCCAGTGCTTCTTATCTCAGCCTCTCCT 

GCACTCCTTTACCCCCGTCTCGATGCCAGTGCTTCTTATCTCAGCCTCTCCTGCACTCCTTTACCCCCGTCTCAATGCCAGTGCTTCCTATCTCAGC 

CTCTCCTGCACTCCTTTACCCCCGTCTCGATGCCAGTGCTTCTTATCTCAGCCTCTCCTGCACTCCTTTACCCCCGTCTCAATGCCAGTGCTTCCTA 

TCTCAGCCTCTCCTGCACTCCTTTACCCCCGTerCAATGCCAGTGCTTCCTATCTCAGCCTCTCCTGCACTCCTTTACCCCCGTCTCGATGCCAGTG 

CTTCTTATCTCAGCCTCTCCTGCACTCCTTTACCCCCGTCTCGATGCCAGTGCTTCTTATCTCAGCCTCTCCTGCACTCCTTTACCCCCGTCTCAAT 

GCCAGTGCTTCCTATCTCAGCCTCCCCGTCTCAATGCCAGTGCTTCTTATCTCAGCCTCTCCTGCACTCCTTTAGCAGGTTCACAACTACATGATGC 

CACCCCTCGACCGAAGCTGGAGGGACTACGTCCCGGATCAGCCACACCCGGAAATCCCGTACCAATGTCCCATGACGTTTGGACCCCGCGGCCACCA 

CTGGCAAGGCCCAGCTTGTGAAAATGGTAAGGAGGATACCAGTGCAGGAAATAGAAGAGCTAACTGCTAATGTGGCCATGGGCCATGGCGAATCCTG 

GTCTGTCCTGGGCAGCACCAAAGCTCTTTCCCCTTCTTAGAGGCTGCGTGTGCAGCTCGGGGAGAGGGGGGTTTTCTCACTCCTGTGGGATGGTGGC 

ATCCCACAGCCAAGTTTACTCTCAGGATCCATGCTAGGCACTGCCCTTCGTGGGATCT^ 

CAG CCCCCCACTTTTTATATAGAGAGGAGATTGTGGAAGCTCATAAATAATAAGTAGGG ATG CTGGGAG ATTATTGAGCTGGT ATATTTCTCT CGAA 
TAGTGGTGTGTTCATTGTTGCTATTGTTGGAATTAGAGGTGATACTAAAAAAGGAACGGAAAAATACCATGGCAAACACATA 

TGCCTACCTACCACTGGGTCCCAGAAAGACTTGCTGATTGTGGATTAGATGTGGTCCCCTGCCCTCGGGAGTTCCCCAGACATCAAAATGAGCTCCC 
TCAGCAGAAGCT CT C ATT AGC TGGGTGC AGGGAGAAT CTAGCAATGCGGTAACT CAGGCCTT CAGGACT TTTGAGTT AGTTACTTAAATGG AAGTCC 
TGAGACAGGTCAACACACTGAACCCTTTGGATGTTTTAGGAATATTTGTTCCTTTGCAATTTGGCCTTTTTTTCCCCAGTCATGTTC 
TGGCTGCAGGTAGTGAAGAGGTGrACCACTCTTTAATGTTTGTACT^ 

GAGGGATTTGC AGC AAGGGCTGGTTTCC AATATGGT AGTGGAGGAAT TGG CTAACATAAAACTAGCT CCTGAAATTCAGAC TTG CGTTTA CTG CT CT 
GGCTCTATGGAATTAGTAACCATGGGAAAGAGGCAATGCTTAGAATTGTGTAAAGTGATGGAGTAGGTAGAGACCTGTCTTGAAGCATCGGTAACAG 
TAAATGAAGCAGCAGCAAGATCAGGTGTTCTGTGCCACATTCTTTTTCCATTCAT 

CCCAACCCACAAGAGACAAGCAAAATAAAAATGGACTAAATAATTTGCTAGTTTTTAATTGGGCTCTTTGGCACAGACTCAAGTAGACCTGAACTCT 
CAGCTCTGTTGTGAACCCGCTTTGTGAACAAGGACAAAGCATGTCATCTGTCTCTGAGCCTCAGTCTCCCTGTTAGTCAGTTACAGAGAGAGCGCTC 
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CAAGTTTCCTTCCAAAGCTGTGCACTGGCTCGTTTCT 

TGCTAGTTGCTGGTGCTGGGAGTGGGTGGGAAGGTGATTGGGCGCCAGCCCCTTCCTTCCCAGGCTTCACACACACACCCCAGGAAGCCCCGCGGTG 
CGT CGG ACT CTCTGTCT AGACATCATCTG ATT TTTATTTGCAAATGCAGGT TGC CAGGTGACAGGAAC CTT TT ATGCTTGTGCCC C AC CTGAGTCCC 
AGGCTCCCGGAGTCCCCACAGAGCCAAGCATAAGGTCTGCCGAAGCCTTGGCGTTCTCAGGTGAGTGCAGGGTTTGCTCCTGGAGGCACCGCAGGAG 
GCCAGCCTCTGCTGCCAGCTCTGTCATCTTTGGGCAGATTC&^ 

CATCCTGTGTGTGAGGTGACCAGGGTGATTTAGGAGCACCATTAGAAAACCTGACATCACTGCTTGTGGTTCTGCTGACCGTTTCAGCCACTGGCTT 
GAATGGAGTCATTTTGGCTTCTTC^CTGGCACCTC^ 

CGATGTGGAATAATTGCTTGGAAGTGTAGAAGGGCTTCTCATTTTGAGAAGCTGATCATCCTTCCAGGTTGAGCC^CAAATAAGTCCTCCTCCTCTA 
C TC CCTGGGGAC ATTAGTTCTGG TCCCTCATCTCTAAAACATTGATGTGCC TAAGAGTAATACA CATTTTGGT CTT CCTCTGAACTTTAATATAGCT 
TGC AAACAAATATGGATTCAATC TGATTT TT AAAGTTTTATTT CTAAAAAAAAAAAAAAAAATCC CTGCACCATGGAGATCTT ACCTACTATAAAG A 
AGGCAC CTCTAGGCTTGGCAAGC ACACGTGCTATATG TATATTTAT TTTTC AGATAA TAT TTTGGATTGTTTT TAAATGGGATTTGTTTTT AT ATT A 
AAACCAAAT AGCTT AAGGTTTGGAAT T CT CATCTTGCCCT CTGGCATCTTT AAAAAT CAGTGATGAAAAAT ACT AACTAAATTCTG AAGGTTTC AGG 
GAG GCG AGAGTTGTGGC ACTTTTGCTGCTCAGAGGGG AGCTGGAGTTTGACCTA CCAGCTCTTTCTAGTTGTG AATGAGGCTTGC ACCTT TTTTTCT 
GAGGTGCTCGTGAGTAACTGAGGATGCCCTTTGGGAGGAGGTGCTTCTGAGCAGGAAGGCT^ 

GAACACCTAAGACTTGTGTGGGTGCCTGAAGAGTAGGAAATAAACAGCTATTTATATCTCGGCAACCTCGTGATTTCTGATGACATTA 
AAACCTGCCCCGAGAATCACCTCGAATGGCCAACACCCACTCTCTTTGGGGCGCACTGTCTGACTCGCTTTCAGACAGTGTGTTGAAGCAGAGAATT 
GAGACAGTGATGTGGGTTAAGTCTCAAAACCTGCTGTTGAGGAATAAGATAGTTTTGTGGGTTTCTTTTTTTGGTATGCATGATACAGATTAATTAT 
CAGCCATGAGCACATTTATTAGATAACTGTGATTCCCATTGATTTTGGGGTTCCATACAGTTACCAGTGGCAGCTGCCTCCTGTCTGTGCCACTCCC 
CTGCGGTCTGGCTGAGGAGCTTGTGCAAACCCAGCTTGGTTCTTTCAACTCATGGGCGATATTTTAAAGAGGCTTCTGAAATCAGCAAAACGTTTGT 
AAATGGATTCATCTTGTTTAAAGTTTCAGAGAGCGCAGTTTTAGAGTTCTTTTCAAACTTAAACACAAAGCCCAGTGGGAATTTCAGCAGACTTTAC 
ATG AAC TTT TAG AGTATTTTTCTCATTAATTTCTTTCTTT TCACAAGAATT AAT TTTGTGATAAAAAC ACC ACATAT CCAGTGTGG AAAACTAATTT 
AAAAAT CAGAAAAACAAAAATGGAAAATAAAAACATCAC1ATTCTACCACCCAGAAATCACCATTAAGGCCTCAGTGCT 

GTGAGTGTGTTTATATATAAATTATATAGACATAAATAATTAAGCACCTAATTCTATATATAATTTATATATGTACACACACACACACACACACACA 
TTTGCTTTTTACCTAAATGATGTCATGTTATTATTTTGGGGACGTTTCAGGTATGTGGACCGACTTGCTTCTTTCTGAAACACCGTTATATTTTTTG 
CTAGTATAAATTAAACTTAGGGTTCATTTCCTCCAAAGTAAAAATTGATTTGCTTAAAGATGAATTTTCATTCAGTGACACATGACTTAAATGACAT 
CAC C ACT ATCATCATCATAATTC ACAAAT AAG ACTTAGCAAGTTCTTAATAGTG ATGCTCTAT AAT AGGAATT C AAC TTACAT TTC CAGAAAATTAA 
TGAAAGATACACTTGTTCCTTAAAAAAAGAGTCGGTGGCCACATACGTTTGTAATATGCTGGTTTAGACAGATTAAATGGGTTTCTTTCCACGGGAC 
TTCTCCAAGTGTTGACCGAGCTCATGCTGTCTGTGGATCTCTGGGGGAGACAAGGGCACGCTGTGTTTCCCAGGCCTCCTTGACGCGGGAACCTTTT 
GTTCCCCCACGGGACATGTGGGACACTAGAGTTCCACCACAGGTGCTTGGCTCTGTGGAGTCGTTGGCCTCGAGGTGGTGTCCTTGGCCCCCAGCAC 
TGACTCCGGAGCTCTGTGTTTGCAGACTGCCGGCTGCACATCTGCCTGTACTACCGGGAAATCCTCGTGAAGGAGCTGACCACGTCCAGCCCCGAGG 
GCTGCCGGATCTCCCATGGACATACGTATGACGCCAGCAACCTGGACCAGGTCCTGTTCCCCTACCCAGAGGACAATGGCCAGAGGAAAAACATTGA 
GAAGCTGCTGAGCCACCTGGAGAGGGGCGTGGTCCTCTGGATGGCCCCCGACGGGCTCTATGCGAAAAGACTGTGCCAGAGCAGGATCTACTGGGAC 
GGGCCCCTGGCGCTGTGC^CGACCGGCCCAACAAACTGGAGAGAGACCAGACCTGCAAGCTCTTTGACACACAGCAGTTCTTGTCAGGTAAGGCAC 
CTCGTTATCTGTTAGAATGGAGGTGGTGATGGCCTGCCTGCCACAGGGGTCAGAAACCACAGGGTCCCTCCCACCCCAGGCTGAGGTCTTCCTCCTG 
TTGACTTCGGCGCCCACTGGGCTTGGGGCTTGACTCXTAGTAGAGATCTTCTGTCTGGCTCTGTTGTGGGCAGAGAGTCCGGATCAGATGGTCCAGGT 
GGACCATGGCTCAGGCTGTTCTCTTGCAATTCTGGTTTTCAGCAGTCGCCTCTTAAAACAGTTCCATTTCATTGCACTATGATCTCGACAAATACGT 
CTGCTGATGTGGCGGCTCTTTTCAGGTTCCCAGAAAAATCATAAAGGTTGGAAGCATATTGTTAACATTCTTGGTTGTCAGGTGTGTCACCCTCATG 
C AG ATG CTGGC CCACGAGTG AGACGGAGGGTGGAGTGGAC TGT ACTGGTTT TGC ACAGGAG AAG TTATAGGAG AGTGCT ATAAGTTGGCT CTGTG AT 
ATGAACATTTCTAAGCATATTTTTCAGAACCAGTGAGTTTTGTTTTTCTACCTTGGATTTCATGACAATTCAGCGAACCTCTGGCTGGCTTCTGTGA 
AGCCCCTCGCCCTGTCATGCTGGGCTTCCTCACTGGCTTTGTGGGGCTGTCCAGGTGGAGGGAGCCTCCCGCGTGGGAGGGAGCTGCTCTACCCGCC 
ACTCGCCACCCCACCGCCAACTCTTCCTCTGCCTTTTCCCAGTCCTGTCCCTTCCTGAGCCCCTTCAGAAACTCCTGTGTCTTTGATTTCTCATGTG 
CCTCCCCTTTTCTTGAATGTGCTTCTTCATTCAACTTATTTAAAATAAAAATATGACACGTCTGTGTGCTCTGGCACTGTTGACATGTGCTCGTACT 
TCTGGGTGTAAGTGACCCAAGTTTTTGTAAGCTCATCCAGATTTTCTTTTGGTGCCCAAAGCAAATATGTCCCCAGATAACTAAGCTTCAGGCCAGG 
TGTGGTGGCCCATGCCTGTAATCCCAGCACTTTAGGAGGCTGAGGTGGGCAGATCACTTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGCA 
AAACCCCGTCTCTACTAAAGATACAAAAATTAGCTGGGCGTGATGGTGGGCGCCTGTAATCCCAGCTACTCAGGAGGATGAGGCACAAGAATCGCTT 
GAACCTGTGGGAGGCGGAGGTTGCAGTGAGCTGAGATCGCACCACGGTGCTGCAGTTTGCAGGACAGAGCGAGACTCCATTGCCCCCCGCACCACAA 
AAAAATAACTAAGCTCACACAGAGGCTGACCATAGGAAGAGGGAGAACTGTGCTGGTTCCCCGAGGCGGCGGCAAGTGAGGAAGTCACCTGGGGCTG 
GACACCTCTCATTCTAAGGGCAGAACCATCCCCAAGCCCTGGCCAGGGCGGTCCCATCCTTCTTGGGGTTGGGCATGGTGTTCATGGGGGACCAAGG 
AAATTGAAGGGCAACTGGGGGGACCCCCCCTCCCCTGTCAGCATCTCCTGTGTCTGCCTTGGGATCTGGAAGAGCTCACATGGGCCAGGAAGGCTAA 
GGCCCACTGGGCCTGGATTTCTGAAGACTCTGGACCTTGGTGCCAGTGGATTCAGAAGATACCACAAGGTGAGGGCTTTCTAATGAAAGTGTCACAA 
GGAACTGGCACTGGCTTTTCTGAGTGCCTCTTGCTGGGCGTTTTTAGGGAGGTAGAGCCCCCGTGGTGACTAAGCTGGAAGGTGCACATTGAGTCAC 
AGGTGCACTGCGTGAGAGACAGCACAGGCAGGGGGTGGACGCCTGTGAGTGTCCTGGGGCTGTAGACTTGCACGACTAGAACTTACTATTAATCTGT 
GAGCAAGAGCTGGTCTTGGCTTTCATTCCTTCCTCTGTAACCAAGGGCT^^ 

TCAAGGAGGATGCTGTGAACGTAAAAACTGCAGCTGTGCCAACCAGCTTCTGCATAATTAAGGATTCCCACCAAACACTCTCATGTTATCTAGGGTT 

G GAGCC ATG CTTTCTCAG AG AATTG TGC CAACTCGCCATTCTGATTAGCCTGTG TAGGTGTAGTCTCAGAT CA CGGCAGTGTGAATGT ATTTT ACAG 

ATTCTGACTAAGTCATTTGGGTTTGATTTGAATTCTGGAAAAAAAAAAAAGCAGGAAGTCAAATAGTCCTGTAAGTTAGCTAGAAACTTCTGTTCAG 

TTGAAGAGAACAGTGGAGATCTTTGATATCTTCCTATTCAGGTCTGCACAGCACTAGGGACAGCCCCCAGGGCCCGGCCCGAGGGTGTGTATCTGAT 

AAGGACGCGTCTGTTCTGCAGAAGCTGTAGCGGCTCCTGTGTCAACTCGTTTCTTTTGCGGGCGTGACATTTTATTGTAGCTACAGGCAGAAGATTT 

GTCTTGTGTACAGGGGAGGGAGCATGGGCTAAAGTCAGGGGATGGGCACTTGTCTTTCCAACAATGTTCCAGTCCGTTTTGTATCT 

GTGGTGGCTTCAGGATGTACATGTGTATATCCACAAGCGGGAGGCCAGGGGGGCTGCCCCACTCCTGTTCTCAAGTCAAGTTACTGTTCCATCCCTG 

GAGACAGGCAAGAAGTCTACTCAGAGTTACACAGTTCAGGCATAGTGACAGTG^ 

T AC AT ATTTTCTGTGGTGCCTGAGT T ACGTGGATGTCCGC AGT AGC AC AGATAC TGG ATT ATGTGGGCT CTGTAGTGAGGGG ATG AGGTTT TAAAAT 

GATCCTGGGATGTCGAAACATTCTGATTTTTTAAATGAAAACTTGTCCATC 

TATCTCCCGACTTCACTGTGGTCTACTACCTTTTGAATCTTCGTGGGTCCAATGCTC 

GAATCGAGGGAGACCGAGGGCCCTGGGGGGAAAGCACCCAAAGGAATGCATACATGCTATTTTGTATCTGAGATGTTCACATCAAGAGCCCCACTCA 
GCGGAGTAAGAGTGCTCATTCCTCTTGCAGTGTTCAGAATCACAGTAAGCTCTTGCTTCCTGTTTAACCTGGTGTGTTCGGTGATGAGGGTTTCTGA 
ACATGGTCTCTTTCTTGTGGGCTTCTACAGAGCTGCAAGCGTTTGCTCACCACGGCCGCTCCCTGCCAAGATTCCAGGTGACTCTATGCTTTGGAGA 
GGAGTTTCCAGACCCTCAGAGGCAAAGAAAGCTCATCACAGCTCACGTGAGTCCTCAGTTACACTCCTACCATAGTGGCTTCCTGTTCTTTGTAAAG 
GCCAGAGTTTCATTTAGAAAAGTCCCAAATGAAAAGTAAATGTCAAATGACCTGGAAAAATAAGCGTAACCCTTAAACTAGTGAGGGAGGAAGCATG 
GTCCAACGAGCAGCACAGTCTGAGGACTCACGTGCTCCTCCCAGACTTGAGATCTGCTCATCAAAGAACCAGGGAGGACAGCCTCAAAGGAGGGTGC 
CTCTTTCCCCATCTTTTTTATTTTTCAGGAAAGTTGTCGTATTTCCATTTTATAGGGATTGAAAAAGATTGATGGTTAAAGTTGCCCTTTAAAATTC 
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TCCAGGTTAA6ATTGCTGTAAGAATGCTATCTAGCTAGTGGATCTTCATTCAATGGAAAAGCTTTTCCCAAATGAGAAACTATCCATTCCCTGGAGG 
C ATTTT GT AGGTCTCTG CAGCTGTGTCC TAGC AC CTCTTTTATTT CTGG AATTTTAG AAATTATTTAATTATTGGG CAT ATG ATATTTGAAAGAGGC 
TGAATCTTTCAAGGAATTCAAGCAAJVTCAG&CCCTCTCGTAAT^ 

CCCTTAAGACTTTTGAATTTGAAAACTTCACATTGCTACATACTTGCAAGCATTATTAACG^ 

CCCCCT AAC AT CAAAGTTCTC AT CATGTTCTAGCTCTGATGCTATT TCTCATTGAGAACTGAAATGAAACATG GTT TAATCTTGAACATACAAC CC C 
CCTTCTTGATT TTAAAAACAAAGACCAGCC AAC CAACC AGT AT ATAATC CCAT AGACT CAGGAGTTTTC TCATGAGTTCT CTC C AGTGATTGAC TTA 
ATAATTGATATGATTGTTGAAATAATAACTGATACATTGTTAACAACAGGAATTGCTAAATC^ 

GAAGTTGATGATCGGCCGGGCACAGTGGCTCACGCCTGTAATCCCGGCACTTTGGGAGGCTGAGGCGGGCAGATCATGTGGTCAGGAGTTCGAGACC 

AGCCTGACCAACATGGTGAAACCCCCTCTCTACTAAAAATACAAAAATTAGC^GGCGTGGTGGTGCGCACCTGTAATCCCAGCTACTCGGGAGGCT 

GAAGCAAGGGAGTCACTTGAACCCAGGAGTCAGAGCTTGCAGTGAGCCGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTC 

TCAAAAAAAAAACAAAAAAAGAAGTTGATGATCAGCTCAGCTATCACATCAATCCAGTTTTAAAGTTGACATGGATGCAAGTCAC 

CAGGGAAAGATGGATCCCCAGATGACCCCCTCATGCTGAAGACGAGGCTGCGTGGTCCATAAAATGAATAAACACACGTGTACACCTATGAGTTCCA 

CTTTTAAAAATTATTAATTTAATTCATTTCAGGTACTTAGGAGTTTAGCTTAAGTCGTCATATATAAAATACAGTCTCTAATATCATGATTGATGGA 

GAGCATTCAGCTTGCCTTAGATGCTGTAAATTCAGGAAAAGCTGAACATTAAGGCGTTTTTGTTTTTTGGTGAGTGTGATCCACGCTAATAACCAGA 

CAGTAT AAATTTGAGGCCAGTTAGC TTCTCTTTTT CCACAATAGT AGTTTTGTT GTTGTTTAACTTTACTTTT T ATTTCTTTT ACCTCTGGTT GAG A 

ACT ATA CAG ATT AC CTAGAAATC AAATC ATTCAGGATGAGCTCCTTGTT TCTTT CAAACTGC AGTTGTGGAAAAAC AAAAT CATTGGCCTAAATT AC 

CCTGCAGATTCCCTCAGACTTCCTTTTAATTTGATCACTTTACTTGATTTGATGCAGTTTTAGTTAAATGTACATTTTAAGTGGCAGTCGATTTGAA 

AGTAACATCTTTAACCCAGAATTAAGTCACTTTGGCTGTTTTTTTACATGTTGATCTATGGAAGGACTAACCAAAAAATTGCTTCTTTAGATGGCTC 

CAA ATATGAAATGTTTTGATGTGTT CTAGGATGT AACTTTGGG CTTTACGT T ACTGT CTC CT T AG AATCTG AGTGCTGTTTAATAGTG AGC CAGTTG 

CAGGATATCTCAGTAATGTCTTTTTAAAATCTCTTATTAAAGGTAGAACCTCTGCTAGCCAGACAACTATATTATTTTGCTCAACAAAACAGTGGAC 

ATTTCCTGAGGGGCTACGATTTACCAGAACACATCAGCAATCCAGAAGATTACCACAGATCTATCCGCCATTCCTCTATTCAAGAATGAAAAATGTC 

TCTCAGCTCACTGTGACCTCCGCCTCCTGGGTTCAAGAGACTCTCCTGCCTCAGCCTCCCTGGTAGCTGGGATTACAGGTGTGAGCCACTGCACCCA 

CCCAAGACAAGTGATTTTCATTGTAAATATTTGACTTTAGTGAAAGCGTCCAATTGACTGCCCTCTTACTGTTTTGAGGAACTCAGAAGTGGAGATT 

T CA GTT C AGCGGTTGAGGAG AATTG CGG CGAGAC AAG CATGGAAAATC AGT GAC ATCTG ATTGGC AGATGAGCTTATTTC AAAAGG AAGGGTG GCTT 

TGCATTTCTTGTGTTCTATAGACTGCCATCATTGATGATCACTGTGAAAATTGACCAAGTGATGTGTTTACATTTACTGAAATGCGCTCTTTAATTT 

GTTGTAGATTAGGTCTTGCTGGAAGACAGAGAAAACTTGCCTTTCAGTATTGACACTGACTAGAGTGATGACTGCTTGTAGGTATGTCTGTGCCATT 

TCTCAGGGAAGTAAGATGTAAATTGAAGAAGCCTCACACGTAAAAGAAATGTATTAACGTATGTAGGAGCTGCAGTTCTTGTGGAAGACACTTGCTG 

AGTGAAGGAAATGAATCTTTGACTGAAGCCGTGCCTGTAGCCTTGGGGAGGCCCATCCCCCACCTGCCAGCGGTTTCCTGGTGTGGGTCCCTCTGCC 

CCACCCTCCTTCCCATTGGCTTTCTCTCCTTGGCCTTTCCTGGAAGCCAGTTAGTAAACTTCCTATTTTCTTGAGTCAAAAAACATGAGCGCTACTC 

TTGGATGGGACATTTTTGTCTGTCCTACAATCTAGTAATGTCTAAGTAATGGTTAAGTTTTCTTGTTTCTGCATCTTTTTGACCCTCATTCTTTAGA 

GATGCTAAAATTCTTCGCATAAAGAAGAAGAAATTAAGGAACATAAATCTTAATACTTGAACTGTTGCCCTTCTGTCCAAGTACTTAACTATCTGTT 

CCCTTCCTCTGTGCCACGCTCCTCTGTTTGTTTGGCTGTCCAGCGATCAGCCATGGCGACACTAAAGGAGGAGGAGCCGGGGACTCCCAGGCTGGAG 

AGCACTGCCAGGACCCACCACTGGAAGCAGGATGGAGCTGACTACGGAACTGCACACTCAGTGGGCTGTTTCTGCTTATTTCATCTGTTCTATGCTT 

CCTCGTGCCAATTATAGTTTGACAGGGCCTTAAAATTACTTGGCTTTTTCCAAATGCTTCTATTTATAGAATCCCAAAGACCTCCACTTGCTTAAGT 

ATACCTATCACTTACATTTTTGTGGTTTTGAGAAAGTACAGCAGTAGACTGGGGCGTCACCTCCAGGCCGTTTCTCATACTACAGGATATTTACTAT 

TACTCCCAGGATCAGCAGAAGATTGCGTAGCTCTCAAATGTGTGTTCCTGCTTTTCTAATGGATATTTTAAATTCATTCAACAAGCACCTAGTAAGT 

GCCTGCTGTATCCCTACATTACACAGTTCAGCCTTTATCAAGCTTAGTGAGCAGTGAGCACTGAAACATTATTTTTTAATGTTTAAAAAGTTTCTAA 

TATTAAAGTCAGAATATTAATACAATTAATATTAATATTAACTACAGAAAAGACAAACAGTAGAGAACAGCAAAAAAATAAAAAGGATCTCCTTTTT 

TCCCAGCCCAAATT CTC CT CT CT AAAAGT GTC CACAAGAAGGGGTGTTTAT TCT T CC AACAC ATTTCACTT TT CTGT AAATATACATAAACTT AAAA 

AGAAAACCTCATGGAGTCATCTTGCACACACTTTCATGCAGTGCTCTTTGTAGCTAACAGTGAAGATTTACCTCGTTCTGCTCAGAGGCCTTGCTGT 

GGAGCT CCACTGCC ATGT AC CCAGT AGGGTT TGA C ATTTC ATT AG CCATGCAAC ATGGATATGTATTGGGCAGCAGA CTGTGTTTCGT GAACTGCAG 

TGATGTATACATCTTATAGATGCAAAGTATTTTGGGGTATATTATCCTAAGGGAAGATAAAGATGATATTAAGAACTGCTGTTTCACGGGGCCCTTA 

CCTGTGACCCTCTTTGCTGAAGAATATTTAACCCCACACAGCACTTCAAAGAAGCTGTCTTGGAAGTCTGTCTCAGGAGCACCCTGTCTTCTTAATT 

CTCCAAGCGGATGCTCCATTTCAATTGCTTTGTGACTTCTTCTTCTTTGTTTTTTTAAATATTATGCTGCTTTAACAGTGGAGCTGAATTTTCTGGA 

AAATGCTTCTTGGCTGGGGCCACTACCTCCTTTCCTATCTTTACATCTATGTGTATGTTGACTTTTTAAAATTCTGAGTGATCCAGGGTATGACCTA 

GGGAATGAACTAGCTATGAAATACTCAGGGTTAGGAATCCTAGCACTTGTCTCAGGACTCTGAAAAGGAACGGCTTCCTCATTCCTTGTCTTGATAA 

AGTGGA ATTGGCAA ACT AGAATT TAGTTTGT ACT CAGTGGACAGTG CTGTTGAAG ATTTG AGG ACTTGTTAAAGAGCACTGGGTCATATGG AAAAAA 

TGTATGTGTCTCCCAGGTGCATTTCTTGGTTTATGTCTTGTTCTTGAGATTTTGTATATTTAGGAAAACCTCAAGCAGTAATTAATATCTCCTGGAA 

CACTATAGAGAACCAAGTGACCGACTCATTTACAACTGAAACCTAGGAAGCCCCTGAGTCCTGAGCGAAAACAGGAGAGTTAGTCGCCCTACAGAAA 

ACC CAGCT AGACTATTGGGTATGAACTAAAAAG AGACTGTGC C ATGGTGAGAAAAATGT AAAATCCT AC!AGTG AAATG AGCAGCCCTTACAGT AT TG 

TTACCACCAAGGGCAGGTAGGTATTAGTGTTTGAAAAAGCTGGTCTTTGAGCGAGGGCATAAATACAGCTAGCCCCAGGGGTGGAACAACTCTGGGA 

GTCTTGGGTACTCGCACCTCTTGGCTTTGTTGATGCTCCGCCAGGAAGGCCACTTGTGTGTGCGTGTCAGTTACTTTTTTAGTAACAATTCAGATCC 

AGTGTAAACTTCCGTTCATTGCTCTCCAGTCACATGCCCCCACTTCCCCACAGGTGAAAGTTTTTCTGAAAGTGTTGGGATTGG 

TTGTATTACGTATCTCCCGAAGTCCTCTGTGGCCAGCTGCATCTGTCTGAATGGTGCGTGAAGGCTCTCAGACCTTACACACCATTTTGTAAGTTAT 

GTTTTACATGCCCCGTTTTTGAGACTGATCTCGATGCAGGTGGATCTCCTTGAGATCCTGATAGCCTGTTACAGGAATGAAGTAAAGGTCAGTTTTT 

TTTTGTATTGATTTTCACAGCTTTGAGGAACATGCATAAGAAATGTAGCTGAAGTAGAGGGGACGTGAGAGAAGGGCCAGGCCGGCAGGCCAACCCT 

CCTCCAATGGAAATTCCCGTGTTGCTTCAAACTGAGACAGATGGGACTTAACAGGCAATGGGGTCCACTTCCCCCTCTTCAGCATCCCCCGTACCCC 

ACTTTCTGCTGAAAGAACTGCCAGCAGGTAGGACCCCAGAGGCCCCCAAATGAAAGCTTGAATTTCCCCTACTGGCTCTGCGTTTTGCTGAGATCTG 

TAGGAAAGG ATG CTTCACAAACT GAGGTAGAT AATGC T ATGCTGT CGT TGGTAT ACATC ATG AATTTT TATGT AAATTG CTCTGCAAAGCAAATTG A 

TATGTTTGATAAATTTATGTTTTTAGGTAAATAAAAACTTTTAAAAATTTGTTATGGAATGTGTGTGTGTGTTATTAAACATGCATCCTGATAGCTT 

TTAAAGCACTTTCTCCTGCGGTTACGTTCCTT<^GAGAGGCTGCTGTACGTGTCTCTGAAATCAGATCTCCAAGTGTTCACCTTGTTAAAGAGAC^A 

ACGCTGGCCAAGTGCCGGGCTGCTCCTCCCACACAGAGGAGTGTCTGTGAAGAGTAATCATCCTGCTCTCCCGTTAGCATCTGCCTCCTCTTCCTTT 

ATCTGTACTTTAAAAGGTTTAGGGCTTTCAAGGAAGTTGTTTTTA^ 

AAGAAGGGATATAGGTTAACTAACACTCCAAATGTGAAAATCAAATGGAAAACTGGATAAAGAGAACAGGATTTAGATGGAACCCCTGGGGATCCGT 
ACGTCTGCTGTTGACCTCAGCTCC^AACTCTTCCTCTCTCTGAGCAGCCAGGCAAGGGAGTCTCCTTTCTCAGCTTCATCGCTTCTACTGATTACAT 
GGCTCAGCTCCTATGCACCTACTAACTTCCAGTCCAGCA^ 

TACATAGAGCTGTCAGAAAAGTTTGTAGTTTGGATCTAGAACCCCCCCTAGAATAGTAGATTAGCATAATTTTGATTTTGCTAAATGGTAAAATTTC 
TTTTCTAAACCCTCACAAGTTCAGTTGAGAAACTTTCCTGAAAACAAAGATCAATAAATCTTAGCATAGTGATAAGAC^U^GGAGACAGCAGCTCCA 
GTGTTTTAAAAGGGGACAAAAATGAGAGAAGGTAGTAAAATCTGTTCCCAGATTCCACTGCTGCCTTCGCTGTTCCCAGGAAGGAAGGATTTATGTC 
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CTGATCTCAGGAGACAGGCTTC^GCAGAGTGCTTCCCTGTGAGTTCTGTGTTCCTAACTTAACAATTCTCAATATTTTGGGGAGAAATATTTTGGTT 
TCCTTC AGT CC C CC CTTGAAGCTTTACTTTCAG AAAGTTTC ACATGTT AAAAGCTAAGTTGGTAGTTTTGG AGAGATTTGGGTT AGAAG TTTTAAGA 
TAAGAGATTTGGCAAAGGAGGAGGAGAACATAGTTTGGAACAAGG&GAAAGGAACCAATTTGGGT^ 

CAGTCCTGAAAATAGGT C AGTTCCGTTAAATCAT TGCATC CC ATTTCAGAGGTGGTGTAAGTGGGCTT C CATAAGAGTT ACGTCTTCCATATGGTGT 

AGGCACACAGGTACTTTATCAGAGACATTTGTATGGAAATAAAAAGAAAACAAAGGTCTATGTTTCAAGCAGTCTATAAGCTAGTTTCTGGAGAGCA 

ACCAGCTGAGATTTCTAGATTTGGGTTCAAATAATTTTAAGTTGGAGTGGACAGGCAGTGGCAATCTGACAGATTTTTCTGGATTATAGTTTG 

AGACGTTCCAGTGAAGTTTTTGTCCATAAATAAATTGTGATGACTTCTCCAAAGTTTATATCAAGTTGTTCAGCTTTCATTTGCAGGGC 

AGG CACGGT TTT AATTC CTAGTG ATTTCAAGT CAGAAAGGTGGGAGAAAGTTGGGAATGTTAGTC TGAAGAGCCATAGCAGAT ATT AGAGGAAACT A 

AAA CTT CAGGATTC AGT C CAGATAATTTTTAAAAACT CTAAAACAATGG ACAGGGCTAGAAT CTAATAACAGCTATGCTATAGCTGTT CTGAAACAT 

TTTTTCTCTCTACAGTCACCTCCCTGTTTACCAAAGATAATCACAATAAGTCCAGTTTAC^ 

TCATTTACATAAGTACAGCAAGAACAGTGATTTAACATATAAGCTCTTTTTA^ 

AACACTAAAACCCTCTT CAGACTAGGAAGCCGTG C CAC AG ACTTAC CAACAGATTTTATCTG AGATACCTGGGTAAATTTTTTT CTTGAGGTCTCC A 
AAATGTTTTGAGGTTCCTGGGCTTGTCAGAAAATGACATTATTTACTTACTGCAAGGCTAGAAGTC 

GTGCCAGGCCAGTTCTTTCAAGGGTTTTATTGGCTCCTTAAAGTTAACGTTCATTCCTTAAAGCAGTCTGGTCACATCTGAAA 
TCATAGCCTTGATGATATAACCTGTGTTTCCAATTTTTCCAGTCCGAAGAGA 
ATAGAATTTCTCACATACAGTTTCTAAATCCTGGAGGGCTCAGGTAAAGAAAAAGATAAATC 
ATTACTGTAAAGCTGTAGATTACTTAAGAGGGAAAAAAAGTTTTCTCAACTATGGAAAGAAAAC^ 

TGTTGCAAAAAATCATTTTAGTCCTTCCTTGGTTTAGTTTCGTGTAATTCTTGTTTTGCTTGATTTGGGGTTACTAATTTTATGAGTCCAGTTTTTT 
TTTC ATT AG CATTC TGGG AATTTTTAATGACTCC AATAATAT GGT CTG AAAGTTGTC AGGAACCTGTATTCAAG AGTACTTCTTTCAG TCCTTTCT A 
TGAACCTTC TTG AAGAC ATAACACTTTAGGATCTG CAAAG AG CTTTTAGAAAAAGTATC ATAATAAAG C AATT AAT AGCAGATAACAAGACTTAAAA 
TGGACATAGTTAAAGACACAATTGACATGGAAATTTO^ 
H= ACCAAAACATATCAGATTTTTAGGAATCTCATACAATTTTAGAACACATATTAATAACATATCCATATGATATAGTTTGGATCTGAGTCCCC^ 
g AATCTCATGTTGAAATGTAATCCCCAATGCCAGAGGAGAGGCCTGGTAGAAGGTGACTGGATCATTGGGGTGGTCCCTCATGAGTGGTTTAGCACCA 
~J TCCTCTCAGTACTGCTCTCATGATGGTGAGTGAGTGCTCTCAAGATCTGGTTATTTAAAAGTGTGTAGCACCTCCCCC^CTTCCTCCTTCTGCTCCT 
M GCCATATAAGATATGCTTGGTTCCCCTTCACCATCCACTATGATTGTATGTTTACTGAGGCCTTCCCAGAAGCCAAGCAGATGCTGCCATGCTTCCT 
tj GTACAGCTGGCAGAACTGTGAGCTAATTAAACCTCTTTTCTTTATAAATTACCCAGTCTCAGGTATTTATTTATGAAGTGCAAGAATGGAGTAATAT 
j* AGAAAATTGGTACCAGTAGTGGGATATTGTGATAAAGATACCTGAAAATGTGGAAGTGGGTTTGGAACTGGATAACAGACAGAGATTGGAAGAGTGT 
7™ : GGAGGCTCAGAAGAAGACAGGAAGATAAGGGAAAATTTGGAACTCCCGAAGGACTTGCTGAATGATTGTGACCAAAATGCTGATAGTGATATAGACA 
H» ATGAAGGCCAGGCTGAGGAGGTCTCAGATGGAAATGAGGAACTTATTGGGAACTGGAGTAATGGTCGCTTTCGTTACGCATTAGCAAAGAGCCTCTA 
CTCTAAAGATTTGTGGAACTTTGAACTTGGGAGCAATGATTTAGGGTATTTGGCAGAAGAAACTTCTAAGCAGTGAAGTGTTTAAGATGTGACCTGG 
: CTGCTTCTAACAGCTTATGCTTATATGTGTGAACAAAGAAATAACCTGAAACTGGAACTTATATTTAAAATGGAAGCAGAGCATAAAAGTTTGAAAA 
|l| ATTTGCAGCCTGGTCATGTGATAAAGAAAGGAAAATTTCACTAGCACCTCTATCTAGGGTCTCAGGTTCTCATCTGACA 

CAACCCCATGTGTCCATAGATAGAAGAATAAAAAAAGACAGTCTGTAAGAGAGGAAATCAAAAGCTGTTCATGGTAGTAG 
f . TACCACAAAAAGTCAAGACTCATATAAATAATTAAAAACAAGGCATTTACACTAATTTGTATAAATGTTTCTCTCCCAAGCTAAGGTATTTA 
*•* GAAATCAATTTGGTGGTGGGTCTAAAGAACTTTATTTAACTCTGTGTCAAATCCAATCTCTGCTTGAATGCTGCTTAATTAATTCCCTGAATAACAT 
|Ff TTCAAAAACATGGTAAGATTAACATCTTCCATGGGCTGAGAAAGGCTTAGAGAATTAGGCTGAAGAGTGTTTTGTCAATCTTGCAATGACTTACAGG 
C ATTTG AAAAGTGAACTTCTCTAAAAGAAATTTATTT TAAATATA GCCAGT CTAATTTATTT C AAAGTG ATGT AAACCAATAAGCCAAAATAAAAA C 
ACC AGAGTCACT AT ATC CAAGAGCC AATTTATACAAATAATTT TC TCCTATTTAAATTTGGAAAT GAAGGAAAAAAGAAAAAACTCTTACCCTCCCC 
TCTCAACTGAGCACTAAAAATAGAGATGTGAAGAGCTGACGTTGGTAAGAAACCTTACCCTTTTCTGCTGGTTTCTTTCAGATCTCTCAGAGCTCCA 
TCCATTGGCCCTAGTGTGAGCAAAGTGTGCCTTTGTGGTTGTCAAATTGAAGGAAAGAAAATATGATTTATTTTCTCAATCCTCATATTTTCT 
TGAGACACTTTCCTGAAAAAAAAAAGATTAACAAGAGAAAAGCCAGAAGAAGTTTATTGATGAATATTGTACCCATCACCTAGAAGAAGGCCTCAGT 
TCAAATGTATTTCTCTCCCAGGGC^GAGTAGATGAGGAATCTTTCTTAAGTAGTATTTTAACAAAAAACAATAAATCTTAGCATAGAGACAAA 
AGGAGGGGGCAGTTCCAGTCTTTTAAAAACTGGGAAAATGTGGGAAGATAGT^ 

GGGCCAATTTCTAGTAAGGAAGGATTTATGTCCTGACATCAGGCAAATAGAGGTGGGGCAGAGTGTTTCCTTGCATTTTCAGTGTGTTTAACTTAAT 
AATCCTCAATATTTTTGGAAGAAGTATTTTGGTTTCCTTCATAACGAAATAGAAATAATGGACAAATAAACAATCTTTCTTCTTCTTGGGGCTCCAT 
TTGCTCAACAAATACAGGTGGACTCAAGTCAGTATACATGCATAGGC!AAGTTTTT^ 

CAAAGGGTTTTG AG C CCTTGAGATTGCT CTGAAGT AATTT T ATGT ATCTCT AAGGTACCAGTGTT TAT TCT CGT AAACTGTT CTGTAACTGCAGCAC 
AAGTTTGGCTGTATTGGATTTGATTTCCTAAAAACAATGATCACCTGACAAACACGCTTGTTGCAGTGTGCCACCAGTACTCCCCTAGGCCAGAAAG 
ACACGTTGGGTCCTGTCTCATCTCTCTGCAAAGGAGCTCTGGACAGGCTATACCTAAATTAACACAGGCAGACATATATTCTCTGTTTTAGTGTAGC 
ATTATGAATTATATGGTGTTTGAAGGCAACCCGGCCTTTTCAGTGACCCCTCCTCCTGCCACACTGGTAGTGTGTTGGCAAAATGTCTGTCCTTAGG 
GAGTCCTCAAAGACTCCTTCCTTCTCTTCCCTCTGCCCTCTGCCTCTGCTCCCCTGGCTAGTCTGCGCCCACAGGTCTTGGCTGTGGCACACTCCCC 
TTCTGCAGGCATTGCTTCCCTTATTCCACCCTCTGTTTGGGTCTCC^AAGTGTGATATTTGTCTTGAGAAACACTTTGCCCCACTATGTCATTCCCC 
TTTCCTTACTCTCCAAGCAGCTGGGT<^CTGGTGCTCAGGACTCAAACTGGCTGGGAATGTGAGAAAAGCTCCTTTTCCCTCCTGGCCTCCTACTTG 
AGACTAAAGCAAAATGCTGAGGGTTGAGGAAGGGAGAGCGAGTGCTCGCTGAGAAAGGAAGAAATGAGTGTTTAGACTTCAGCCACTCCCTGATAAA 
GGGGAACGATCTTCTGCCTCCTGGGAAAAACTTTGATGAGACAGCAAACTTGAGAAAAGTTTGTGGGAAGGAGCAGTGAGAACCGCAGGTTACGAAG 
GGGCCAGGTGGGCAGGACTGAAGGATAACTCCATGGTAACCGGCGTGGATCGATGACTATGGGGCCAAAGAGTGGGCATGCCTGGGTGTGTCTGAGC 
AGAAACTGCATTGAGGACAAGATCCCCTGCCCACCAGCACAGGCTCTGGCTCTGTGGGAGCTATGCCAAACTTAGCCCCAGTTCAAGGGAAGGAGAG 
GGAGGAGAGGGAGGAGAGGGAGGAGAGGGATCCAAAGTGAACTGAACTTGAATTTGCATCACCCTAGTGGAATATGAGCTTAAAATTAAC 
TTGTTAAAGAAAATATGAACTACATGTCTTAAAAACTTTGGCTCTGGACTTAG 

TTGTAATTGTCTGTTTATTAATTTGTCTTCTGTACTATAAATTCCTTGAGGAAGGGAACTGTGCTTTCC^GTAAACACC<^GTACTGGTGCAGCACC 
TGGCTT ATAGT C AG CAC CATCGAAT CCTC AGGAAATG ACACAGTTGGCG AACCTAGATGTCTACCATGT CC AAC AGTGTCCAATAGGATG CTGGCC A 
AGTGATGAGCCACCTATGGCATGAAATGCTATAGAACCACTTACAAATTGTGCAGTATACAAGTA 

TAAAGTAAAAAGAAACAGGTTTCAAAACAATGTGATATCTTTTTAAAATTTCATGGAGCTCATACATAGAATTTTTTAAAAAGAACATTCAACAGTG 
TAAATTATACCCCTGAGATGTGGGGTCATGGAAATTTTTTATTTTCTTGTGTGC^ 

TTACTAGAAAGGGAGAAAAGCACAAATGTAATCTTTCTAAGTTGATACGTCCCTTTATATTTTTTGGTGGGCTTCCAGTTTCTCAAGCTGCCTTGTG 
GGGTCCCCAGAGCTCTGGGGGTGGGACCAGCTCTGAGGTGAAGGGACCAGGGGCTCAGGGTCTC^^ 

TCTTTTCCTACCATTTCCCCTTGATATGCTTTCCCCCACCTCATCTTTCCCAAGATGCTTCTTTGTCCTCTGAGCTATGACACTGAGGATGAACGCT 
CTCTCAGCTTTCTGGCTGCAGCTGCCTCCATGCAGGGACCCCCAGCCTGTGAACTTCCCTCTGGCACCCCCACTCCCCCATGCGGAAACTCCCTCGG 
CCACCCGTTCTCATCCCCAACCCAAGCGGGACCTCCCTCCCCCACCAGCACCCCCATGCGGGACCTCTCTCCCCCACCAGCACCCCCATGTGGNNNN 
NNNN^NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNN^NNNNI^NNNNNNIWNmNNNNimNNCCCCCrrCCCACCCCCATGTGGGGATCTCCCTCCATCCCTAGCACCCCC^ 
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CTTTCCTCACCTCCAACCCCCATGTGAGAACCTCCCTCTCACTCTCACCCCCATTCTGGGACCCTCCTGAGTCTCTGCCCTTCCTCAGTCATCACAG 
TCTCTGTACAGCTAGGGACTGACTCTTTGCCTTTGGCC 

AGAACATTGATTCTTCTTTTTTATTTGTGGAAAATATGCATAACATAAAATTTACCATCTTAACCATGTTTAGGTATACAGTT 
AATTCACATTGCTGTGCAGCCATATCCAGCATCTGTCTGCAGGACTTTCTCATCTTCCC^ 

TTCCCTGGCCCCTTTGCCCCCAGCCCCC^CCATTCTACrrTCTATCTCTATGAAACTGACTACTCTGGATACCTTGTATGCGTGGAATCCTGCAGTA 
TTTGTCCTTCTGTGTCTGGCTTATTTCACTGAGCTTGATGTCCTCCAGGTTCGTTCATC 

AAAT CATAT TCCATTGC ATGGGTGGACCATATTTTGTTCATTTATCTGTCC TGAACA CTTGAGTT GCT TCCGTGTTTTAGCT ACTGTG AAT AACGCT 
GTTGTGGACATGGGTGGACAAATGGGTCTTCAAGACCCTGCTTTCAATTCTTTGATGTGTATACCCAGAAGTGGAACTGCTGGAGCGATGAAAATTC 
TATAATT AATATTTT AAAGGT ACCACTGT ACTGTTTAC CACAGTGGCTGCAT CATTTTAC ATTCC CACC AGCAATGCAC AAGG ATTCCAG TTTCCCC 
ACATCCTCACCAACACTTGTTATTTTCTGAAGAACATTGATTTTTTTAAAAAGGATT^^ 

C C C T CC CTG CTTTC C AG GGATAG CCAGCTGT AATCGCACAGCTTCTGTGTGGGTGCAGGAGG CCAGCAAGC ACTGAGAC AGACGCT ACGGGCAGGGC 
ATTTCCGAGAAGGAAATATTTTCATTCTCCATTTTTAAGGTGAGAAACTGAGGCCCAGGAGTGTAAAGCCAGTACCCTAATATACAACTACTAAAAG 
AGGAAG C CAGG ATT C AAATC CAGGC CTTTGTGGTT TC ATAGCCTTATCT CTTAATTC TGATGCCAGCCTGCCT TCC CTT CCACCATTG ACGTACCAA 
GAGT CAGAG AGGTGGCA CGGTGT AC CTGTAT CCAC ACC ACTGGTG AGCAGC CCTCCAGCATCACACCCAGACTTTCTAAGATGCTTGGTATCCCACA 
GATGGGAGCCAATTTCTTTAGGTGGGGAAAGAGAATGGGAGCAGAGAGAACAGCCTGTTCCACAAA 

G CATAAAGG TC TAATTAA CATTT ATTAGGTG CTAAGTGCACTG CC ATGTGT CAACTC CTGTAATC CTT GAAGT AAT CCTGGGAGAC AGGC ATTTCAT 
T TC T CACCT TAC AG ATGGGGGCACAG AG C AC AGGGAGG ATCTTGAGAGGGT CCC ATG GT C AG CAAGTGTTGGTGAAAGG ATATCATCCTGGCTGGTC 
CTAC CC AGGTG CTC CAT CTT C AT CCTAT AAAACTCT CTGTGCTTT AGTTT CTTCTATGAAACAGG AC ACC TATTTC ATAGGGTAGTT GTCGGGATTA 
GTGATAAAGTGCTAAGAGTAATAGCTGGGGCACAGTTAATCCTATGTAAATATTGGCGAATTGTCTTAGTCTGTTGCCTGTTGCTATAATGGAATAC 
CAT AGAGTGGGTAATTT ACAAAG AAAAGTAT GTATTATTATGATTCTGGAG GCTGGGAAGTC CAAGAG C ATGG CACTGG TACCTGGT AAGGACCTCC 
TTGCTGC 

HUMAN SEQUENCE - mRNA 

CGCTCGATCTTGGGACCCACCGCTGCCCTCAGCTCCGAGTCCAGGGCGAGTGCAGAGCACAGCGGGCGGAGGACCCCGGGCGCGGGCGCGGACGGCA 
CGCGGGGCATGAACCTGGAGGGCGGCGGCCGAGGCGGAGAGTTCGGCATGAGCGCGGTGAGCTGCGGCAACGGGAAGCTCCGCCAGTGGCTGATCGA 
CCAGATCGACAGCGGCAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCATCTTCCGCATCCCCTGGAAGCACGCGGGCAAGCAGGACTAC 
AACCGCGAGGAGGACGCCGCGCTCTTCAAGGCTTGGGCACTGTTTAAAGGA 

CGCGCCTGCGG TGCGCTTTGAA CAAGAG CAATGACTTTGAGGAACTGG TTG AGCGGAGC CAG CTGGACATCTC AGACCCGTAC AAAGTGTAC AGGAT 
TGTTCCTGAGGGAGCCAAAAAAGGAGCCAAGCAGCTCACCCTGGAGGACCCGCAGATGTCCATGAGCCACCCCTACACCATGACAACGCCTTACCCT 
TCGCTCCCAGCCCAGCAGGTTCACAACTACATGATGCCACCCCTCGACCGAAGCTGGAGGGACTACGTCCCGGATCAGCCACACCCGGAAATCCCGT 
ACCAATGTCCCATGACGTTTGGACCCCGCGGCCACCACTGGCAAGGCCCAGCTTGTGAAAATGGTTGCCAGGTGACAGGAACCTTTTATGCTTGTGC 
CCCACCTGAGTCCCAGGCTCCCGGAGTCCCCACAGAGCCAAGCATAAGGTCTGCCGAAGCCTTGGCGTTCTCAGACTGCCGGCTGCACATCTGCCTG 
TACTACCGGGAAATCCTCGTGAAGGAGCTGACCACGTCC^GCCCCGAGGGCTGCCGGATCTCCCATGGACATACGTATGACGCCAGCAACCTGGACC 
AGGTCCTGTTCCCCTACCCAGAGGACAATGGCCACAGGAAAAACATTGAGAACCTGCTGAGCCACCTGGAGAGGGGCGTGGTCCTCTGGATGGCCCC 
CGACGG GCT CT ATG CGAAAAG ACTG TGCC AG AGCACGATCTACTGGGA CGGGCC CCTGG CGCTGTGCAACGAC CGGCCC AACAAACTGGAGAGAGAC 
CAGACCTGCAAGCTCTTTGACACACAGCAGTTCTTGTCAGAGCTGCAAGCGTTTGCTCACCACGGCCGCTCCCTGCCAAGATTCCAGGTGACTCTAT 
GCT T TGGAG AG GAG TTT CCAG AC CC TCAG AGGCAAAG AAAGCT CATCAC AG CTC ACGTAGAACCT CTGCTAGCC AG ACAACT ATATTATTTTGCTCA 
ACAAAACAGTGGACATTTCCTGAGGGGCTACGATTTACC!AGAACACATCAGCAATCCAGAAGATTACCACAGATCTATCCGCCATTCCTCTATTCAA 
GAATGAAAAATGTCAAGATGAGTGGTTTTCTTTTTCCTTTTTTTTTTTTTTTTTTTTGATACGGAGATACGGGGTCTTGCTCTGTCTCCCAGGCTGG 
AGTGCAGTGACACAATCTCAGCTCACTGTGACCTCCGCCTCCTGGGTTCAAGAGACTCTCCTGCCTCAGCCTCCCTGGTAGCTGGGATTACAGGTGT 
G AG CCACTG CAC CC ACC CAAG AC AAGTG ATTTTC ATT GTAAATAT TTG ACTTTAGTGAAAGCGTC C AATTG ACTGCCCTCTTACTGTTTTG AGG AAC 
T CAG AAGTGGAG AT TT CAG TTC AGCGGTTG AGG AGAATTGCGGCG AG ACAAGCATGG AAAATCAGTG ACA TCTGAT TGGCA GATGAG CTTATTTC AA 
AAGGAAGGGTGGCTTTGCATTTTCTTGTGTTCTGTAGACTGCCATCATTGATGATCACTGTGAAAATTGACCAAGTGATGTGTTTACATTTACTGAA 
ATGCGCTCTTTAATTTGTTGTAGATTAGGTCTTGCTGGAAGACAGAGAAAACTTGCCTTTCAGTATTGACACTGACTAGAGTGATGACTGCTTGTAG 
GTATGT CTGTG CCATTT CTC AGG GAAGT AAGATGTAAATTGAAGAAGC CTC ACA CGT AAAAG AAATGTATT AATGT ATGTAGG AGCTGCAGTTC TTG 
TGGAAGACACTTGCTGAGTGAAGGAAATGAATCTTTGACTGAAGCCGTGCCTGTAGCCTTGGGGAGGCCCATCCCCCACCTGCCAGCGGTTTCCTGG 
TGTGGGTCCCTCTGCCCCACCCTCCTTCCCATTGGCTTTCTCTCCTTGGCCTTTCCTGGAAGCCAGTTAGTAAACTTCCTATTTTCTTGAGTCAAAA 
AACATGAGCGCTACTCTTGGATGGGACATTTTTGTCTGTCCTACAATCTAGTAATGTCTAAGTAATGGTTAAGTTTTCTTGTTTCTGCATCTTTTTG 
ACC CTC ATT CT TTAGAG ATG CTAAAATT CTT CGC ATAAAGAAGAAG AAATT AAGGAACATAAATCTTAATACTTGAACTGTTG CCCTTCTGTCCAAG 
TACTTAACTATCTGTTCCCTTCCTCTGTGCCACGCTCCTCTGTTTGTTTGGCTGTCCAGCGATCAGCCATGGCGACACTAAAGGAGGAGGAGCCGGG 
GACTCCCAGGCTGGAGAGCACTGCCAGGACCCACCACTGGAAGCAGGATGGAGCTGACTACGGAACTGCACACTCAGTGGGCTGTTTCTGCTTATTT 
CATCTGTTCTATGCTTCCTCGTGCCAATTATAGTTTGACAGGGCCTTAAAATTACTTGGCTTTTTCCAAATGCTTCTATTTATAGAAATCCCAAAGA 
CCTCCACTTGCTTAAGTATACCTATCACTTACATTTTTGTGGTTTTGAGAAAGTACAGCAGTAGACTGGGGCGTCACCTCCAGGCCGTTTCTCATAC 
TACAGGATATTTACTATTACTCCCAGGATTCAGCAGAAGATTGCGTTAGCTCTCAAATGTGTGTTCCTGCTTTTCTAATGGATATTTTAAATTCATT 
CAACAAGCACCTAGTAAGTGCCTGCTGTATCCCTACATTACACAGTTCAGCCTTTATCAAGCTTAGTGAGCAGTGAGCACTGAAACATTATTTTTTA 
ATGTTTAAAAAGTTTCTAATATTAAAGTCAGAATATTAATACAATTAATATTAATATTAACTACAGAAAAGACAAACAGTAGAGAACAGCAAAAAAA 
TAAAAAGGATCTCCTTTTTTCCCAGCCCAAATTCTCCTCTCTAAAAGTGTCC 

AATATACATAAACTTAAAAAGAAAACCTCATGGAGTCATCTTGCACACACTTTTCATGCAGTGCTCTTTGTAGCTAAACAGTGAAGATTTACCTCGT 
T CTG CTCAG AGG CCTTGCTG TGGAG CTC CACTGCCATGTA CC CAGTAGGGT TTG ACATTTCATTAGCC ATGCAACATGGATATGTATTGGG CAG CAG 
ACTGTGTTTCGTGAACTGCAGTGATGTATACATCTTATAGATGCAAAGTATTTTGGGGTATATTATCCTAAGGGAAGATAAAGATGATATTAAGAAC 
TGCTGTTTCACGGGGCCCTTACCTGTGACCCTCTTTGCTGAAGAATATTTAACCCCAC^CAGCACTTCAAAGAAGCTGTCTTGGAAGTCTGTCTCAG 
GAGCACCCTGTCTTCTTAATTCTCCAAGCGGATGCTCCATTTCAATTGCT^ 

AGTGGAGCTGAATTTTCTGGAAAATGCTTCTTGGCTGGGGCCACTACCTCCTTTCCTATCTTTACATCTATGTGTATGTTGACTTTTTAAAATTCTG 

AGTGAT CCAGGGTATGACCT AGGGAATGAACT AGCTATGG AAATAACT CAGGGT T AGGAATC CTAGCACTTGT CTC AGG ACTCTG AAAAGGAACGG C 

TTCCTCATTCCTTGTCTTGATAAAGTGGAATTGGCAAACTAGAATTTAGTTTGTACTCAGTGGACAGTGCTGTTGAAGATTTGAGGACTTGTTAAAG 

AGCACTGGGTCATATGGAAAAAATGTATGTGTCTCCCCAGGTGCATTTTCTTGGTTTATGTCTTGTTCTTGAGATTTTGTATATTTAGGAA 

AAG CAG TAATT AAT ATCTCCTGGAA CACTATAGAGAACCAAGTGACCGACT CATTTACAACT GAAACCTAGGAAGCCCCTGAGTCCTG AGCGAAAAC 

AGGAGAGTT AGT CG CCCTAC AGAAAACC C AGCTAG ACTAT TGGGTATGAACTAAAAAGAGACTGTGCCATGGTGAGAAAAATGTAAAAT CC T AC AGT 

GGAATGAGCAGCCCTTACAGTGTTGTTACCACCAAGGGC^GGTAGGTATTAGTGTTTGAAAAAGCTGGTCTTTGAGCGAGGGCATAAATACAGCTAG 

CCCCAGGGGTGGAACAACTGTGGGAGTCTTGGGTACTCGCACCTCTTGGCTTTGTTGATGCTCCGCCAGGAAGGCCACTTGTGTGTGCGTGTCAGTT 

ACTTTTTTAGTAACAATTCAGATCCAGTGTAAACTTCCGTTCATTGCTCTCCAGTCACATGCCCCCACTTCCCCACAGGTGAAAGTTTTTCTGAAGT 
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GTTGGG ATTGGTTAAGGTCTTTATTTGTATTACGTATCTC CCC AAGTC CTC TGT GGC CAG CTGCIATCTGTCTG AATGGTGCGTGAAGGCTCTC AGAC 
C TT ACACAC CATTTTGT AAGTTATG TTTTAC ATGCCCCGTTTTTGAGACTG ATCTCG ATGCAGGTGGATCTCCTTG AGATC CTG ATAGCCTGTTAC A 
GGAATGAAGTAAAGGTCAGTTTTTTTTGTATTGATTTTCACAGCTTTGAG^ 

GGCCAGGCCGG CAGGCCAACCCTCCTCCAATGGAAATTCCCGTGTTGCTTCAAACTGAGACAGATGGGACTTAACAGGCAATGGGGTCCACTTCCCC 
CTCTTCAGCATCCCCCGTACC 

HUMAN SEQUENCE - CODING 

ATGAACCTGGAGGGCGGCGGC<^AGGCGGAGAGTTCGGCATGAGCGTOGTGAGCTGCGGCAACGGGAAGCTCCGCCAGTGGCTGATCGACCAGATCG 
ACAGCGGCAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCATCTTC^ 

GGAGGACGCCGCGCTCTTCAAGGCTTGGGC^CTGTTTAAAGGAAAGTTCCGAGAAGGCATCGACAAGCCGGACCCTCCCACCTGGAAGACGCGCC 
CGSGTGCGCTTTGAACAAGAGCAATGACTrTGAGGAACTGGTTGAGCG^ 

AGGGAGCCAAAAAAGGAGCCAAGCAGCTCACCCTGGAGGACCCGCAGATGTCCATGAGCCACCCCTACACCATGACAACGCCTTACCCTTCGCTCCC 
AGCCC^GCAGGTTCACAACTACATGATGCCACCCCTCGACCGAAGCTGGAGGGACTACGTCCCGGATC^GCCACACCaMAAATCCCGTACCAATGT 
CCCATGACGTTTGGACCCCGCGGCCACCACTGGCAAGGCCCAGCTTGTGAAAATGGTTGCCAGGTGACAGGAACCTTTTATGCTTGTGCCCCACCTG 
AGT C CC AGG CT CCCGGAGTCCCCACAGAGCCAAGCAT AAGGTCTGC CGAAG CCT TGGCGTTCTCAGACTGCCGGCTGC AC ATCTGCCTGTACTACCG 
GGAAATCCTCGTGAAGGAGCTGACCACGTCCAGCCCCGAGGGCTGCCGGATCTCCCATGGACATACGTATGACGCCAGCAACCTGGACCAGGTCCTG 
TTCCCCTACCCAGAGGACAATGGCCACAGGAAAAACATTGAGAACCTGCTGAGCCACCTGGAGAGGGGCGTGGTCCTCTGGATGGCCCCCGACGGGC 
TCT ATG CGAAAAGACTG TGCCAGAG CACG ATCTACTG GGACGGGC CCCTGG CGCTGTGC AACGACCGGCCCAA CAAACTGGAGAGAG ACC AGACCTG 
CAAGCTCTTTGACACACAGCAGTTCTTGTCAGAGCTGCAAGCGTTTGCTCACCACGGCCGCTCCCTGCCAAGATTCCAGGTGACTCTATGCTTTGGA 
GAGGAGTTTCCAGACCCTCAGAGGCAAAGAAAGCTCATCACAGCTCACGTAGAACCTCTGCTAGCCAGACAACTATATTATTTTGCTCAACAAAACA 
GTGGACATTTCCTGAGGGGCTACGATTTACCAGAACACATCAGCAATCCAGAAGATTACCACAGATCTATCCGCCATTCCTCTATTCAAGAATGA 
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TABLE X 



MOUSE NOMENCLATURE 
ICSGNM Notchl 
Celera mCG!B747 

HUMAN NOMENCLATURE 
HGNC N0TCH1 
Celera hCG1780817 

MOUSE SEQUENCE - GENOMIC 

GTCTGGAGGTTTTATTTGTCCCTGAGGGGGTGCCATTTTGGTTTGGGGGTATGATATGAAGAACCTGTGTCTGAGAGCGGAGTTGTAGAGAGTGGGC 
AGCAGCCAGCATCCTGGCATAGTCCCCTTGCTACAGGGCAGGAACTGAAGAGAGAGAAAGCACACACAGTCTCCAGTCTGCTCTGCTTCCCGCGTCC 
ACTGTGCTATTGATGTTTGAAAGTGGAAGGAACCGTGAACTCTTTTATTACCCAGATGATGCATATTCATGACAGAAGCTGGGAGAGAAAAGCAGAC 
CAGCCTCCTAAATGAGAGTTTCA<^GAGTCTATGTCGACATCCCCCCACAGGGCTGGGGGGAGGGGGCTCTGCTGCGCCTGTAGGCTCACTTCGAGT 
GTAG ACC CAGATTTGAAATCTGC CTGGT TTG AAATTCAGGCTGGTCTTGTCTCTAGCCTGGCCCCAATTAA CAGTCATTATTATTTCTTTTTTATTG 
AGGTGAGGTTCACTTAGTATGTGAAAGTAACCCCTTTAAAGTGAGCGCCTCCATGGCATGCTCCAGAGCTTACGTCATCCCAGAAACCTCTGCCCAG 
CAAGCAGCCACTGCACCTCCATTCTAGAGCCTGATTCCAGTTGCACCCACTGCCCAAGGTCGGGTAGCCTGTGGCCATGAGAGATGAGGG CGGGGAG 
CTGACTGCCCTACATGTCTGTCAACAGCTGCAGCGGAGCCTTTGTGGGCC^CGATGCCAGGACTCCAATCCTTGCCTCAGCACACCGTGTAAGAAT 
GCTGGAACGTGCCACGTTGTGGACCATGGTGGCACTGTGGATTATGCCTGCAGCTGTCCCCTGGGTTTCTCTGGGCCCCTCTGCCTGACACCTCTGG 
ACAACGCCTGCCTGGCCAACCCCTGCCGCAATGGGGGCACCTGTGACCTGCTCACTCTCACAGAGTACAAGTGCCGCTGCCCACCAGGGTGGTCAGG 
TAAGGGCTATAGGAGGTTGTGAACTTCACGGTTGGGACCCTTAGGCCTGGGCCCAAGGACCCTATCTTTGTTGACTTGACATACTCTGGTACACTAA 
ACTTTATCCTCTCCTGGATCAGAGTTAAGCTGGGATCTACTATCCAGAAAAATCTTGGGGATTCCTTGGGAACTCTGGTCCTTTTCAACCCCAAACT 
ATCTGATTTTTGGCTGCCCTGTTCCAGGACCCCAACCTCTCACTGTTTCTGTGCGTAAAGAATGAGTCCTCATGTAGCCCGGGTCTTACCATCTCCT 
TGTCAGGGTGCCTCAGGTTTCCATTGCTTCTCTATGTGGGGTCCAGGAGACAGCTAGGTACAAATTTTAGACTGCCATTCCTACTTTCTGTCCTTGA 
GCAGTCACTTAGTCTCTTATGACTTACAGTCCTCTGTATGCAGAAAGATGGCCACATTCTGCAACAGATACTGAGCATCATGAAGATAACACTTTAG 
ATCTATCCCTGGTTGGTACCTACATGATGTCCAAATCTGCCCTGGGTTCACAGGGAGGCCCTGGCAACAGCTGACCCTCTTCTCTGCCTCCCAGGAA 
AATCATGTCAGCAGGCTGACCCCTGTGCCTCCAACCCCTGTGCCAATGGTGGCCAGTGCCTGCCCTTTGAGTCTTCATACATCTGTCGCTGCCCGCC 
TGGCTTCCATGGCCCCACCTGCAGGCAAGATGTTAATGAGTGCAGCCAGAACCCTGGGCTGTGCCGCCATGGAGGCACCTGCCACAATGAGATCGGC 
TCCTATCGCTGTGCCTGCCGTGCCACCCATACTGGTCCCCACTGTGAACTGCCCTATGTGCCCTGCAGCCCCTCACCCTGCCAGAATGGAGGCACCT 
GCCGTCCTACAGGGGACACCACCCACGAGTGTGCCTGCTTGCCAGGTAGGTTCCATCTCAGTATGCTGGCATCTAGCCTGGTTGTCCATGTCCATGC 
GTGTT C CACTG TCAAGCC CGAGGGATGC AGAAAGAGTT AATACCCTTTGAAGGG AGCCCAAAAAGAAGT TT AG A GA A GGGCAG CAG C CATGGAGCCT 
TAGGTGTGGTATAAGAGCAGGTCAGGATAGGGATACCAGAAGTGAGGGGACACTCTCAGTGACACCACACACGTCTATTTCTGTCTGGGCCATCTAC 
TGAGATCCAGTTAGTTTGGTGCATAACCTCCAAGCCTTCTTTCTGCTGATAAGAGTGCCTTCCCTTGGTATTTTCTTCATGCTGGGCCTCCCCTGAC 
CCATAGCCAACTAATACCCACCATGTCAATACCCTGAGCCTCTTAGGGCAGGGAGCCAGCCTGCCTTTACAGATGCTCATGAACTGTTTGGAATCCA 
CATGTGACTCTCTGTATGTTACGGGAGCACCCTGTCATGGGCAGTCCCTGGGGTTTCCCCTTTCTCTGGGGATGGTAATTGAATAACAGGGTCTTGA 
GGGCCTTAGGGGCCTCACGTTTTTCATTACTTCCTCTGCAGTGTTCTTCCCACTGTGCTCAATGTCTCTGCAGGGGTGGGGCCTGCAGGCACAGCCC 
TAGTCCTTTCTGTTGCTGCTCTAACGATGGTAACTCGGTGATCTAAGGCAGCTTGGAGCTGCCGCCAACCCACTTTTGGCAAGGGTGGGGTTGGTCA 
ACATACCAGGCCGTTGAGAGGAAGGAGGAGTGGGCTCAGGAAATAGGAATAGATTTGGTTGCTGGGGGTGACCTCGGACCACTCCCCTTGCAACCCC 
GGGGCCACCCTACACACTGAAAAAGGGAAGTTCCCTTCTGCCCCAAGCTTGCAGAGCCAGCCTCAAGACTGCGTGCAGACACTGGCTTATACATTTC 
CTGAAAGTCATTGGGCAAGCTCGGAGGTAGCCGACAGCTGCCTACACGCTTCCCAGCCTCCCTGGGGGGGCTGGGACAGGATCCTCACGCTCAGGAG 
GGCCCCA GAGCTGAGTTCAGCTCTGTCTTGGTGGCCACCTAAGACCTCAGCTGTTAGAACCTGTGTTGAGAGGCTAGACTGGGTCTGTCATGCAGAC 
TGAGAG AAAGC C TTGTTC AAGGAGCTTGTGAATGG AG CTG CC ACC AAT CCAC CTTAGGCAGGGTGGGGTGGGTC AACTC AGAGTGCTGGAAGATGGG 
CAGGAGGGAAGCAAGGACTGGGTGGGGTCCCTTTACCTTAGCCGGTGGGTCATCCCTGAAGACGGAACACACACTACCCTGTGCTGTGTGGAGACAT 
GCTGTTTCAGAGTAGGGAGAAGTGAAAATTGTCCACCCAAATTGCTGTTGAAGGGAGTCCGGTCTGGGCACAGTTATACCCACCATTTGGGCTCCCA 
GGTAGCTATTTACCAGGTTTCTGAAGAATGTATGAACTAGGGTGGACATAAAGAACTAAGATGGGAACAGAACTTAAGCCTGAGAGCTGACTGGGAA 
TTAAAGCGACTCTGGCTCCTGTATATCTATCCTCTCACTCTCTCACAAGCCCGGTTTCCCTGGAGGAGGCTGTATAGGCAGAGAAGGCTTTGGCCAG 
GCTCCACATAGGACCTATGATCCCTTCTCCTCGCCCCCAGTGAAATCCTTCTACACCTGCTCAGGGCCCCTCCAGGCTCGCTTCTTCTTGGAAGCGT 
GCCCTGTGCTAGAATAAATCTCTGCACAGGCAGGGAGACAAGCTATTTGGTCTTGCTGACCCCTGGGCGGGCACTAGGCTCCAAGTTGGTTCCTCCC 
TCACCCAGATAAGCCTTTTCCAAGCCTCCAGCTTCTTGCTGTGGTGCAAGATGGTGGGGTGGAGGGGCATCTTTGAGGAGGAGGCAGACAGGTGGGT 
TCTCCMGGGCAGGAGGGGTTCAAAGGCATCCCCAGAGATGGTGTTAATTAGTAACCGGCTGCAAGTTTGGCTGTTGCCAGACTCGCCTTTCTGGGT 
TGATCATTGACCCGGCAGGAGTTGAGGAGATTAATAGCTAACTAGCTGCTGAGCAGGCCCTCAGGAAGCCACACTGCAGGCC^CACCTTGGGGCCCA 
GCCCCAGGCTAGATGGGCTTGTAGAAGAGAGATGGTTGCCCTGGCTGTGCCCTGAGGAAATCACAGGTTCTGACTGAGCGGGCCTGCCTGGATCCCA 
CCACTCTGGTTTCTAACTTAGTGCTCCCAAGCCCCACTTTTAGCTCCAGCAGTTTTAGGTATTCGTGGCTGCCCCACTGATTCAGTGTGGACAACCA 
TAGAGTGCAATCCTCTGAAACTCTGTTAGTGGTTTTGTGGCAGAACCTCCAAGGACACTATGTTCCTAGATCCCTGTCTGTTGAGTCAGCTCTTTAC 
ATAGCG ACTTTCTAGAG AAG AGC TGT CAC CTGTGGAC ACGGG AGT AGGG ATGCCACGGAAGG ACAGAC AGATGCCTTGT CCAGAGC AGAACATTGCA 
GGT CCACCC AGAATC TGAAG ATACCGGAGCC C AC CCG AGG CT CCT CAGCAG ACTTCTGAGGTCAGGCTAACCTGCTGGCTGG CCCCTGTC CCACAGG 
TTTTGCTGG AC AGAACTGTG AAG AAAATGTGG ATG ACTGTCCAGG AAACAACTGCAAGAATGGGGGTGC CTGT GTGGAC GGC GTG AATAC CTACAAT 
TGCCGCTGCCCACCGGAGTGGACGGGTATGTATATGGGACTTGTAGGCAGCAGCCTTCCACTAACAAAACTGCACACCGCGGAGTGGTAGGAAGGTC 
TGAACTCTGTTCCACGCTATGAAAGCCCACACTGCAAGACCCAGGGTGGCTGGTAGCATGGGAGCAGCAGCTTCTGTGCTTCCAGTTGGAGTCAGCA 
CAGCTCAGCATCAGATGGTTCTTGTGAGCTATTAGCAGCTAACATCCAGAGTCAGGCCCCCAGCACTGAGCCATACAGGAGCTAGCATTCTGGGCAA 
GCTACCAGGGATGAATACAAGGTTGTCCTCAGCCCCACAGCCCAGAGCACGGAGCTCATCTTGTAGCTGAGGTTGATGTTCCTGTGACCCCTCATAG 
GTCAGTACTGTACAGAGGATGTGGACGAATGTCAGCTCATGCCCAATGCCTGCCAGAATGGCGGAACCTGCCACAACACACACGGCGGCTACAACTG 
TGTGTGTGTCAATGGGTGGACTGGCGAGGACTGCAGTGAGAACATTGATGACTGTGCCAGTGCCGCCTGTTTCCAGGGTGCCACTTGCCACGACCGT 
GTGGCTTCCTTCTACTGTGAATGTCCGCATGGGCGCACAGGTAAGTCGGTGCAGGAATGTCCTGGGAGAAGGGAAACTTGGTTGGCTCACCACCCCT 
GGTGGAAACACCTGGGGAGCCATGTATGAGATATATATACAGATATATGTGTGTATATACATATATATATATATATATATATATATATATATATATA 
TATATCAGATATAGATACAGCTCAGATGTAGATACAGGGCTGGAAATGCAGCACAGTTGGTAGAGTGGCTGCCTAGCATGCACAGGATACTGGACTC 
AATCCCTAGGACCACATAAACTGTGGGTAGTGGCACGTGCCTGTCAAGCCAACTCCCAGGGAAGAGTGAATCCAAAGGTTAAGGCCATCTTTAGCTA 
TATAGCAAACTTGAGTTCAGCCTGGAATATGTGAGACCCTGTTTTAGGAAAGGGATGGGGAGAGAGAAAGCTCTGTCTCTTTGGCACCTCCAGCAAT 
GACCCAGTAAGCCTTGCTGGACAGAGGCATCCTGGTGATCTGAACTTTGTGGGATGCTTGAGTGAGTCAGAGAGATTCATTCCAGGGTGTTGGGTGA 
GGATGGCCCTGCTGCAACAACCTGTTAATCCCAGGATAGGGAAAAGCAGCACCCACTCACCGCAGCTGCCTTGTCTAGGTCTGCTGTGCCACCTCAA 
CGATGCGTGCATCAGCAACCCCTGCAACGAGGGCTCCAACTGTGACACCAACCCTGTCAACGGCAAAGCCATCTGCACCTGCCCCTCGGGGTACACA 
GGGCCAGCCTGCAGCCAGGACGTGGATGAGTGTGCTCTGGGTAGGTACAGATGCAGTGGCACAGGCAGCAGGAACGGGCTGGCTGTCACATATATGG 
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GATGTGATCTGTCCTGGAGAACCACCATTTTGATGGCCTGGGCTGGGGCATTGCGGGTGGTCGGGAGGCTGCAAGGCCAGATGAGCCCTCATCCGTG 

CCTCTTTCACCAGGTGCCAACCCTTGTGAGCACGCAGGCAAATGCCTCAACACACTGGGTTCTTTTGAGTGCCAGTGTCTACAGGGCTACACGGGAC 

CCCGCTGTGAGATTGATGTTAATGAGTGCATCTCCAACCCATGTCAGAATGATGCCACTTGCCTGGACCAGATTGGGGAGTTCCAATGCATATGTAT 

GCCAGGTATGTCTTATGCCATTTCCCACCATGGGGGCTGCTAAGAGCAGGCAGCAGGGGAATGAGTCAAGGCTGCATAGCTTCAAGTAACATTTCTG 

TCATGGTTGGTAGCATCTGGGAAGCCGAGGCCAGGATAAC^GGACAAAGTGGAGAATGTACCCAGAGTTGGGTGCATAGACTCAGTAGATATAGACA 

TGAACCACAGTGGGCACACTAGGCCCGTATGAGGGAGGGTTATTTCTCTCCCAAGTCTTCAGCTGGGGCTCTAGAAGGACCTACCGAAGCCGTCCTC 

GGTCTCACCATCTCCCGTCCACAGGTTATGAAGGTGTATACTGTGAAATCAACACGGATGAGTGCGCCAGCAGCCCCTGTCTGCACAATG 

CATGGACAAGATCAATGAGTTCCAATGTCAGTGCCCCAAAGGTGAGGCCACCTACTTACCACAGGACCCGTATCCTCATGATCTCTTCCCAGACTCC 

TTTGAGGGAGGACTTGAGTTGACTAGGGCGTTTTCAGGCTTTCCTCAAAAGGTAAAGGCCC^CGTGAGCAGTGGACGCTGTAGGGGACCTGGCACCG 

GAAACCCTTTACTAAAGGAGGGAAGGATGGGCAGGAGCTGCAGCGCTCCTAAGATAGGAGAGTCCACCTGCTTGGCTCTGTCCTCTTCCTGATGGTC 

ATGGGAGTAAGGACATAGTGTGGGGATCTGACCTCTATGCCTGGGAAGGACTTGGAGGAACTGAAGAGGGGACCAAAGTCCTGGGAATCTGTCTCTT 

GCCTACTCCAGGGCTGGACCAGGCCTTGGGCCTTCTGAGATGTAAATAAGCCATCCTAGACTTGGTCCAGCTAGGGGAGTCCCATTCACAGCCTAGT 

GCAGCTGTAACCATAGCAACCAGAGTCTATTTAGGAAAAAGGCCCCTCTGTTGGGCCAAGGAGGCAGCCACTGAGCAGTGCTAAACAAGGAGCTGAT 

TAGAGGGGCAGGGTGCAAGGGGCAGGATATAAGGGCTGGTTGAAGCCTGGAGGTTAGTGGCCCCTGTTATCAAAGTACCCTCTAGGGAGCCCATGTG 

AGCAGACATTCAACTCTAAGGTTTGTGGGTAGCAACCAAGGGGTTAGGATACTGTCAGATATAACCATCTCGTCTTCCCTGGCCACAGGCTTCAACG 

GGCACCTGTGCCAGTATGATGTGGATGAGTGTGCCAGCACACCATGCAAGAACGGTGCCAAGTGCCTGGATGGGCCC»ACACCTATACCTGCGTGTG 

TACAGAAGGTACAAGAGGCAGTGCAAGATCTGGGGGAGGGAACTCTTGAGGCTAGATAGTGCTGAGAAGGGGACAGGAAACTGGGGAGGCCTTAGTT 

TGGGGCCATATAATCTGAGAGTCAGAAACTTAACTCCGCTTGGACTCCAGGCAGGAGTGATGAGAACATCCCTGGGTATGTGATACCCCACATCCAT 

CGATGACCAGG CAGTGATCCTGGGCCTG CTCGGTGGTTTGC CAAGGTGTGGC CAACT AAGTGTTGAGGG AGT AACTGGCCCTG AAG CT CAC TAATGT 

CCACCTTTCCTGCCATCTAGGTTACACAGGGACCCACTGCGAAGTGGACATTGACGAGTGTGACCCTGACCCCTGCCACTATGGTTCCTGTAAGGAT 

GGTGTGGCCACCTTTACCTGCCTGTGCCAGCCAGGCTACACAGGCCATCACTGTGAGACCAACATCAATGAGTGCCACAGCCAACCGTGCCGCCATG 

GGGGCACCTGCCAGGACCGTGACAACTCCTACCTCTGCTTATGCCTCAAGGGAACCACAGGTGACTGGCCGGAGCTGGCAGTGGGTGGAAAAGGGTG 

AGGCCAGGCTTGGAAGCTGACCACATGTCATGCTGCCCCTAGGGCCCAACTGTGAGATCAACCTGGATGACTGCGCCAGCAACCCCTGTGACTCTGG 

C ACC TGT CTGG ACAAGATTG ATGGCTACGAATGTGCCTGTGA ACCAGG CTACAC AGGTGAGTGGCTG AAG GCAGAGGTC AGGGT C AT CCCT AGAGT C 

TGGGAGGGACTGTGGGGCTAGCCAGGGTCCAAGGGCTGGATGTCACAAGCTGGGTGTCAACTACCCAGATCTTGATCTGCCAACCAGTAGAGGTATC 

TATCATAAAGACTGGGTGGCCTCCATATGCCTCCTTTCCCCTGTCAGTGTCAGCCCTGGCCATGCCAGACTTAAAGCCCCAGTAGCCTGAAGTGGGG 

TGGCCAGAGCTCCGAGGCCAGGCTTTTGGATATGTGTTTAGATCTGATAAAATCCTTAGTTTCTCTGAGCACCGAGGACATTGTCCACCAGAGGATA 

GAAGAGGATGGGTGGTCTCAGGCTGAGGTAAAGCTGGGCTGCAGAGTGACTCAGTTGGTAAGGTGTTGACCTAGCATGTGTAAACCCTGGGTTCAGT 

CCTCCTGCCATGTTAACCAGATGCAGTGGGA.TGCATACGTTGTAACCCTAGCACTCTGACATTGGAAGCGAGAGGATCAGAAGTTTAGGTCATTCTC 

AGCTACATGTGAGTTTGTGGCCAGACTGGATCACATGAGGCTCAGAAGCTACTGGCACTACGGTCATGTAACTCATCGATACATAAGGCACTTTACT 

ATCGCTCAAAGTTTCATAGATTAGCACAAGGGGAAACTTAGACCCAGCTAGGTCTAGCTCTAGGAGGGGTAGGCTCTGGGTGGCAGGCCTGGAATCT 

GAGCCCCTTCCCACCCACAGGAAGCATGTGTAACGTCAACATTGACGAATGTGCGGGCAGCCCCTGCCACAACGGGGGCACTTGTGAGGATGGCATC 

GCGGGCTTCACTTGCCGCTGCCCCGAGGGCTACCATGACCCCACGTGCCTGTCCGAGGTCAACGAGTGCAACAGTAACCCCTGCATCCACGGAGCTT 

GCCGGGATGGCCTCAATGGGTATGGCAGACTGACCTGGGAGGGACTAGCTGTAGCCTTGGGCAGACACAGGCTGGGTGAATGGGTGCTCCTGTGAGA 

CCAGTGATCTGAGGTCAGCCCCTAGTGTCTTCCCAGGATCCCTGGGCTGTATCCT 

GTGGCTGCTGAAACAGTTGTCTGATACTGTGGTGACAGCAGCCCTTTGTCAGTGTGTGGGGGACGAGGAGATTACTAAGGTCTCCGTTAAAACTATT 
AAAGTTTTATACAGAGGGTCAGCATTGTCAAGGTGGCATTGTTCCAGTCTGGAACCCTTGCACGGTCTTGGGCCAGGTGGGTACCTCCAGCCCAGTG 
GATGATGTGGCCTTGTCCTCAGACTGGAACTGTAGTTTAGCAGGAGAGCTCTGAGCCCAGGCTACAGCTCAGGCCTCTCCGTAGGTACAAGTGTGAC 
TGTGCCCCTGGGTGGAGTGGAACAAACTGTGACATCAACAACAACGAGTGTGAGTCCAACCCTTGTGTCAACGGTGGCACCTGCAAGGACATGACCA 
GTGGTTACGTATGCACCTGCCGAGAAGG CTTCAGTGGTAAGTGTGGCGGGTGACACAGCAGAATGGGT CCTCCC TTTGGT CT CTTT CT CGGTCC ACC 
TGATGAGCTAGCATTTCTCAGGCAGAGATCCTAGTTCTCTTGAGGAAAGGCATAGGGCAGAGGCTCTGGCCCACCTCCTCTTGCTCAACTTCCTTAC 
AGGCCCTAATTGCCAGACCAACATCAACGAATGTGCCTCCAACCCCTGCCTGAACCAGGGGACCTGCATTGATGATGTCGCTGGATACAAGTGCAAC 
TGTCCTCTGCCATATACAGGTA CGAA CA GC CAGGGACTG GAATTGGCT CT GGTGGT AGCT AGTGGAG CTAAGAGTATATGGGCTT AC C AGGCACTG A 
AGAAGGTTCTGTTCCTCGCTGAGGGTCTTAGGTCCTCAGACTGGAACTGTAGTTAAGCAGGAAGGCTCTGAGCCCAGGCTACAGCCCAGGCCTCCTG 
GGCCTGGGGATAGTCACCAGATGGGCAAACACAGAGGATGCCTAGGAGGAAGCTCCAGGCCGTGTGTCCGGGTGTTTATCTGGTATGTCTGGAGTGA 
GGTGAGCACTGGACTCCTCCAGAGAAAGTCTGGGAGGCTGACTGGTAGGTACTGCATGGATCAGGGTCCACTTTGCCTGTGCCTCTGGGGGCCCTGG 
CTGCCCTCCTCTGCTGGCTGGGCCTGGGATTCCTTTGATGGCAGCTTCTTTGTGTGAGGAGACAAGGAGC^ 

CTGCCATGGGCACTGCC TGAGGAATGGCGAGAAATAGA CCC CC TGC AGGAGAGATGTGCATTGAAC ATG AAAGC CGTGAC C AC ATGGCC CC CAACCC 
TGCAGCTCCCAATACTGCCACAGAGTACTGGTGGCTTCATGCTGCTGCTGCTAGGTTCTGGACATAGCAAGTGGGTCTGTGAAGGTGGAGGCCCTTC 
CAGTGTTCCTATGATGCCTGGCAGGATGCTGTTTTCTTTGACAAAGTTAAAGGGCCTGGATTTTGTCTCCTGTGTTGGTCAGTGAGGAAACTGTCAG 
CAGCCAGGCACACCCCATGTTCCTTTTGGTCTTGCTGCTATCGGGGCTAGCGGGAGGAG^ 

AATTGTCGGAAACTCTGTCAAAGTTCCTGTCTGCCCAGGTCACCGTGGAAACCGATAGAGAGAGAGGCCCTGGAGAGTCTCACCAGTGGGCTGGGGA 
CATGGTGGGAGACAAGGAAGATAAGGTTTCCTTGCTTAGCTCCTGGAGCCAACCATGGCCATAA 

ACCTGGGTTACTCGTGGCTCCCCCAAGCGGTTATCAGATTTCACTGCCCTAGTGTGAGGCTGCACCCCATGTTCCCCAAGTAGGATATGCTTGGGGT 

AGCCACAGTAGCTCTGTCTCAGGAGTGAGCCTGCTGTGACCTAAAGCTTCAGAATCAGCTCAAGGATTTTCCTGGAATGGGGAGCAGGCCAGGTTCC 

TCCAGGCTCAACCACCTGTTATAGTAGGGTCTTGTTTTGAGGTTTTCTGGTTGGGCAAGGCCCAGGCTTCATCGTGGAGCATTTGGGAGATGAGAGA 

TGGCTTCCGGGACTACATAGATGGCTCATGGAGAAAGAGTGCTTGTGGAAACATGAAGCCTGTTTCATGAAAACAGGGTACCTCTAAATGCCCATAA 

CCCCAACATTTGGGAGCAGAGACATCTTGAGAGCTCTCTGGCTGGCCAGCCTAGCTAAGATGGCAAAGCATCTGATTCAGTGAGAC 

GGAATAAGATGCGGGGTTGGTAGAGGACACCTGATATCCGTCTCTCCTTCTGCATACACTCTTACACACCTGCATGCAATATACTATGCTCGTTAAA 

AAAAAAAAAAAGTGACTTAAGAGAGAAACTTGATTCTGCTTTATCTCTAACCTCCTGTCAGTAATCGTCATTCCTCCCGGGATCAGAGTCTGGCTGG 

CCAATCATACAGCATCTCCTATACCTACCCCCCGCCCCCCTTTATGGAAGACCCTATGGGCCAGGCTCACTGGGTTCTATCTTCTACAAATCTTTCT 

TTCTAGGAGCCACGTGTGAGGTGGTGTTGGCCCCATGTGCTACCAGCCCCTGCAAAAACAGCGGGGTATGCAAGGAGTCTGAAGACTATGAGAGTTT 

TTCCTGTGTCTGTCCCACAGGCTGGCAAGGTGAGGCTTGCCTGGTCCCACCAGGCTGGGACAAGGATGGTGGATGGGAAGTCAGGGGCCCTGGCCAG 

CCTGTGGGGCAGTCTCGAGAAAGGAGCCCATCAATGCTGAGCTACCTGCTCTGGGAAGCTCTGGGAAGCTCTGGGAAGCTCTGGGAGCAGGAGATGG 

TCCTTCCCATTCCAACTCCATATCCTTCTGTTGCCCAGAGGTCAGGGGAGAAGCTCTGGAGATAAACAGACCAGCTAGGACAGAGGCTCAGAGGGTG 

GGGGCAGGTAACCAAATGAGTTTTTTGTCTTCACAGGTCAAACCTGCGAGGTTGACATCAATGAGTGTGTGAAAAGCCCATGTCGCCATGGGGCCTC 

CTGCCAGAACACCAATGGCAGCTACCGCTGCCTCTGCCAGGCCGGCTATACAGGTCGCAACTGT^ 

GTGGTACCCAGCTCTGTCCATCCTATGTTTGGGTGCCTCTGTTCCTAAGCACACATGACAGGGAGCTGCTGAGAGTCTGAGGTGGAAAATAGCCCTG 
GTGCCCGGCAGTCATGAAGGACTGTGGATAGCTGTGCCCTCAGCTTAGTCCACTTGCCTGTGCCTTAAAAAGAGACAGGCCAGAGGCTAGACAGGCC 
AGGAGGAGGCTGTTGAGCATCAGTGTGTGCTTCTCACGGGCTTTGCTGCTAGATTTCCCAGCCCAGTTGAGTGATATGGAGCAGAGGCACCAGGCAC 
TTCAAAATGATGGCATGGGGCCTAGTTGTATTCAACCTTGACACTCGGCCCTTGTCAGGACCCATTGAGTTGTGTAGGGCTTTAGAGATGCCCTGCA 
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TCCTGAGTCAA AGCTAAAGGCAG AG AGAG ATTGAGGG AGGTT CTGCCTGGC AC CAT C AG TGTGTGTGTAGGGG AATGGG CTTCC CTATGACTG CCTT 
G GGC CTGAG GGAAAAAGTGT ATGCGGAGC AGATT C CAGGC TC AGTG CTGGG ATGGGC CCCTGTTG ACTACTGTCAT CAAGGC TTGCTGTGAGCACTT 
GCCACCCCATACCACCTTCTCAGCATCAGGTCCCCGAAGC^ 
GGGGTTCCTGCACCGATGGCATCAA(^CAGCCTTCTC^ 

C^TCCCTGCCAAAATGGTGCC^TTGCACTGACTGTGTGGACAGCTACACATGTACCTGCCCCGTGGGCTTCAATGGCATCCACTGCGAGAACAAC 
ACAC CTGAC TGTACTGAG AGGTGGGTCC CAGC CT C AGCTGAGAC AGGGAAAG AAATGGACTGTGGAAGACACAAAGTCTTGACAGCATGGCATCT CA 
AGCTGGTGTCTTGCCCAGACCCAGTGGGCATACAGCACTAATTGAACTGAATCCAGCGTTGTTCATTATGTTGCAGTATCCCTCCTAAAGGGGCCTT 
TGTGACCTCCAGTGACTTAGAACAAGGAAGTTTGTGCAGAGGAATTGGGGAA 

CTTTTGCCCTGGTTGCCAGGAAGCTTAGAGTTAAATCAATCTCAAGCCTGGTTTGTGATGCGCTTCCTATAGTGTTCCTATTTTGCCCTTGATTTTG 

TTTTTCTCIAACCACCACGTGTCTATATGCTATAAAATCTCTGCTCGTGATTTGG 

ACCTTAGGGAACTCCTCACTCTTCCCTGCTTCTCATCTGCCTCTAGC^^ 

TCTGTGTCCACCTGGCTTCACGGGCAGCTACTGTCAGTATGATGTCAATGAGTGTGATTCACGGCCCTGTCTGCACGGTGGTACCTGCCAAGACAGC 

TATGGTACTTATAAGTGTACCTGCCCACAGGGCTACACTGGTCTCAACTGCCAGGTGAGCTGAGGTTACAGGGTATGACCCAGAAGGCCTAGGGCTC 

TCCAGTGCCCATTGCCCAGCCCTGGGCCACACTACCTTCAGAGGGCCAGGGACACAGAAGCAGAGAGGCTCCCTGGTGTCATCATTGGTTGATGAGC 

C AGAATG AACCC AC ATCCTCTGGGAAAATGCCAAGTTGGG TGC TTCTGAGGGAGACAGGTGGGCATGGGTGCC AGC C AGGCT CTT CTGTGGAGTGC A 

TTATGTGGCCATCTGTCTGAAGCAGCCTGGGGCAGTGCTCACTTCTGGTGGAGGATGTCCAAGGGGAGGTAGCCAGTGGGTTGTGGAGAGCCTTGAA 

AGAGAGCAAGGAAGACGTGGAGCCCAGGCCTATGTTGTTCGGAGGGCGGGCAGTGTTTGTGAGTCCTGTGCCTGGGATTTCTTAGGTGATATTCTTT 

CTGAGATGGTGGGGGTGGGGGTGGGGAAGTCAGAGCATAAAATGCCTTATTCCAGTGGCGCTCA^ 

ATCAGATATCCTGCATATTTTTGCTACATGACAGTAGCAAAATTACAGTT^ 

GAGGAACTATATTAAAAGACTGCAGCATTAGGAAGGCTGAGAACTGCTGTGCTTCTCAGAGCCAGCCAGGACCAGGGCAGGAGGGTGCCCTAGCAGT 

CAGAGTTGGTCAGGGAGAAGCTAACCAGCGCCTGTCTCCTCAGAACCTTGTGCGCTGGTGCGACTCGGCTCCCTGCAAGAATGGTGGCAGGTGCTGG 

CAGACCAACACGCAGTACCACTGTGAGTGCCGCAGCGGCTGGACTGGCGTCAACTGCGACGTGCTCAGTGTGTCCTGTGAGGTGGCTGCACAGAAGC 

GAGGTAACCAACCGGCCGGGCCCACCTGTCCTTGCTCAGGCTCAGCTTAGGCCAGGAGTGCCCCGGTGCCTGCCAGGTCAGGTGACAATGGGCAGTA 

GTCCCGAGAATCGGAGCCCAGGCCTCAGCTCCTTCCCGTCTTGTCCCCACAGGCATTGACGTCACTCTCCTGTGCCAGCATGGAGGGCTCTGTGTGG 

ATGAGGGAGATAAACATTACTGCCACTGCCAGGCAGGCTACACGGGCAGCTACTGTGAGGACGAGGTGGACGAGTGCTCACCTAACCCCTGCCAGAA 

TGGAGCTACCTGCACTGACTATCTCGGCGGCTTTTCCTGCAAGGTGTGGGCCGTCATGGGGTGGCGGGGAGGGTATCAGCGTGAGTGTTGCTGCAGG 

GGAGGGCTTTTGACTGGGAGCTAGTGCATCATGGCAGACTGAGCACATACCTGTGCAGGATCCGGTCACATTACAGATGTGCTGAAGGAGGAGAACT 

ATGCCAAGCAGTGGCTTAGGGCACCCCTTTCCTATATGGTTTCATGGTACCTAATACCTGCTCCAAGACCTCATCCCTGGGGGACCAGGTGTTGCCC 

CACACACACACACCTTGGACACTTATTTTAACTTACTGGGGTCTACTGGGTGGAGCCCTCGCTGCAGGGTGGGACAGTGTAGTCTAACTGGGTATAT 

GGCCTGCATGGCAAGTGGAGGACATGATACCACAACCCCAGCTAAGCTGTGACCTGTGCCATTTCTACAGTGTGTGGCTGGCTACCATGGGTCTAAC 

TGCTCCGAGGAGATCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTACCTGCATTGATCTGACCAACTCCTACAAGTGTTCCTGCCCCCGGG 

GGACACAGGGTAAGGGTAGGGAC CTGGATGATGGGCAGGATC C AGC CAG AG AT C ACT CAGGTGAT TGAG AGCT GAC CAC GGAAT AGG AATCTC AGAA 

GGGAGGTGGGCTCCAGTGGCACTGTGCTGCCTCCCCCGGGCCCCAGGTCCTAACTGATTTCTGGACTGGGCTGCACAGGACAGAGCACAGTCAGCAT 

CATCCGACGGTGTCTCTGGTCACTCTGCAGGTGTACACTGTGAGATCAATGTTGATGACTGCCATCCCCCCCTTGACCCTGCCTCCCGAAGCCCCAA 

GTGCTTCAACAATGGCACCTGTGTGGACCAGGTGGGTGGCTATACCTGCACCTGCCCACCAGGCTTCGTCGGGGAGCGGTGTGAGGGTGATGTCAAT 

GAATGTCTCrCCAACCCCTGTGACCCACGTGGCACCCAGAACTGTGTGCAGCGTGTTAATGACTTCCACTGCGAGTGCCGGGCTGGCCACACTGGTG 

CGTGCCTTGGGCAGGTGGGTCGAACGGGTGGGTGGGCAGGGTCAACAGGGATGGCTCCTTACTGTCCCTTCTTCACACAGGACGCCGCTGTGAGTCA 

GTCATCAATGGCTGCAGGGGCAAACCTTGCAAGAATGGGGGTGTCTGTGCCGTGGCCTCCAACACCGCCCGTGGATTCATCTGTAGGTGCCCTGCGG 

TAGGTGTCCCTGGCCCTGTGCGGCTGCAAGGGTGCCAGGTCTGAGGGTGCCTGAACCCTGACCGACTCTTATGGTGGATCTGTCCTCCACCCACCCT 

CACTGTGAATAGTCAGCCTGAAGACCTCTAGCAGAGGCTAGGGTCCAAGCCACATGCTTCCTCGTGTCCATTCCCCAGCTCGTTTATAAAACCCGAG 

CGGGCACGATCCATACATGCCTTGCTCCCAGTGCTCTG<^GCCTCAGGCTGAAAGATTGCCTTGTATTTGAGGTACCTTAGGCTTCACAACAAGACC 

CTGTTTAAAAAGTTTATTTCCTGCTGTGCTATGTTGGGAACCTCACACATGCTGGGCAAGTGTTTTGCCACTGTGTCTTAGCCCAATTTTTTAAATA 

GATGTTTATTTTAGAGCGGTTTTAGGTTTGCATGAGAACTTAGCAAGAGGGCCAGGTCCCCACCCTGCAGGCCGTTCCCTATGTTTGCTGCCTTGCG 

TGGAAGGTGTTGAGCTGGCACTAATATGCAGTATTTGCTCTTGGGTGTTGACATCCAGGGATCTGTACACATACAATGTACCAGCATCAGCCTTGGG 

TTTCATGGCTCTAAAAACTCCTAGGGTTCTCCCCATTCACCTATCCCTCCAGTTTGAAGCCCTTGGCAACCCTGACCATCCTATGGTGAGATGTTTC 

AAGGGTCTTGTTATTGGGGC AAG GCC AT AGAC ACCTT TCA CAGGC C AC AGA CTGGGG CCAGG TCTTTG CTGGGG TGTGCATGGTGTCC CT GAT TGAA 

TGCACCATCCTCTCTCCACCACAGGGCTTCGAGGGTGCC^CATGTGAGAATGATGCCCGCACTTGTGGCAGCTTACGCTGCCTCAACGGTGGTACAT 

GCATCTCGGGCCCACGTAGTCCCACCTGCCTATGCCTGGGATCCTTCACCGGCCCTGAGTGCCAGTTCCCAGCCAGCAGCCCCTGTGTGGGTAGCAA 

CCCCTGCTACAATCAGGGCACCTGTGAGCCCACATCCGAGAACCCTTTCTACCGCTGTCTATGCCCTGCCAAATTCAACGGGCTACTGTGCCACATC 

CTGGACTACAGCTTCACAGGTGGCGCTGGGCGCGACATTCCCCCACCGCAGATTGAGGAGGCCTGTGAGCTGCCTGAGTGCCAGGTGGATGCAGGCA 

ATAAGGTCTGCAACCTGCAGTGTAATAATCAOTCATGTGGCTGGGATGGTGGCGACTGCTCCCTCAACTTCAATGACCCCTGGAAGAACTGCACGCA 

GTCTCTACAGTGCTGGAAGTATTTTAGCGACGGCCACTGTGACAGCCAGTGCAACTCGGCCGGCTGCCTCTTTGATGGCTTCGACTGCCAGCTCACC 

GAGGGACAGTGCAAGTAAGTAAGGCTGAGTTTCTTTAGAGTCCCAGGGCTCAGGATGCTACGGGAGGACCTAACCAAACACCAGGCTCCT^ 

ATGT CT ATC CCT GCC CATTGCGAGTCGCC AAG CACAT TTC CC AGATCT GGCCTATTCAAGGTATG GGT TTGAGGTCCACAGGCTGGGATGGGA CTG A 

GTG C AT C CT ATACCTCCTCAGCCCCCTGTATG ACCAGTACTGC AAGGA CCACTT CAGTGATGG C C ACTGCG AC C AGGGC TGT AAC AGTGC CGAATG T 

GAGTGGGATGGCCTAGACTGTGCTGAGCATGTACCCGAGCGGCTGGCAGCCGGCACCCTGGTGCTGGTGGTGCTGCTTCCACCCGACCAGCTACGGA 

ACAACTCCTTCCACTTTCTGCGGGAGCTCAGCCACGTGCTGCACACCAACGTGGTCTTCAAGCGTGATGCGC^AAGGCCA 

CTATGGCCACGAGGAAGAGCTGCGCAAGCACCCAATCAAGCGCTCTACAGTGGGTTGGGCCACCTC^ 

CAG C GC AGGGAGCTGGACCCCATGG ACATCCGTGGGTGAGTGC TC CAGCTC CTG CTGTTGTGGGC TGT TT CCC AGTGTG TCC CTGGGTTCTCT AAC C 
AGCTTAACCCTGGAGGTGTAACCTCAGGGGAGGTAGTATTCACCTTATTTTATAGTCAGAAAGCACCTGGGATGAAGAGGGCATCCCTCTGATGAGG 
ACTTGGCAGGC CT CAGGGTTTGC CC AGATGT GAT TTT CAGGC CCC ACTGTGTACCAGGTGTTGGAGAC CAAGCTGATTAAAGCCTCCAGGGTGTCCC 
CTGGGG CTAAGT CAT CCT AGACC AAT CCTGAC CCTCAGTC ACTAC AGCCCTTATCCCTACTACAG AGG AGGCAG TGAGGGGTACAGAAGGGCA CTGT 
TGCTGAG CCTGGAAGGCTGAGGAGCCGAGGGAGG CAC AAATAAGAGCATCTCCTGGATGCTG CAC AGAGCCAT CTGGGG ATGGGC AGGGAAGCACC A 
GCCAGGCATTTGGAGGTTCCAGGAGCCTTCGCTGTTGGGCATCTGCCTGGTGGCACCTCAGTGTCTCCTGACCCCAGTGGTTCCTGGTCCTGGTTTT 
CTTGTCGTGGGACTGTGGTGAAGGTCAGAAATTAATTGTTTCCTGGAGGCTGTTAGAATCCTGTTTCTGAGAGGCCAGTGACTGAGAGTTGGTGACT 
TGGCTGGCATGAGAGAGCTGACTCTCTGGCATCTGAGCCTGCTGTCACCTCAAGCTACACCArAGGGAGTCACAACTGCATCCTTGCAGTTCTGTGG 
GGCCCCAGGCACTGCTCTCAAGAGGGCATTGGATGCCTAAAGACTATTCTTTGAAGTAAGGAAAAGGGGTGCTGTGCACCTAAGGGGCAAGGCTAGA 
CAGTGGAATGACCCCCGCTGAGTGCTAAACACTGGGGCAGCAGGGAGAGTGTCCCATTGCGGGGCTGTCGTCTGATAAGCCCTGTGGGTCAGGGTGA 
TGGGCTAGGGAGTCAGAGCTGGTGTGTAGGCAGTGTACTGGGGTGTATCTAATCTCACTGTGCCATGTGTGTTCTCAGCTCCATTGTCTACCTGGAG 
ATCGACAACCGGCAATGTGTGCAGTCATCCTCGCAGTGCTTCCAGAGTGCCACCGATGTGGCTGCCTTCCTAGGTGCTCTTGCGTCACTTGGCAGCC 
TCAATATTCCTTACAAGATTGAGGCCGTGAAGAGTGAGTAATCAGGGGCTGGAGGGATGGCTCGGTGGTTAAGAGCACTGGCTGCTCTTGCAGAGGA 
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CAAGGGTTCAAATCCCAGCACCCACGCAGCATCTTATAACTATGTGTAACTCTAATTC^ 
CATGCACAGGGTCCTGACGTACATGCAGGCAAAACACCCATACACATTAC^ 

GGATGGGTTCATTGGGTGGCGGGTCACAGGCAGGCGGTCTCTCACTCCTTGTGCCAGCTGCCATTCCCAGCTCAGGAAGTGCTTTTCTGG 
TTCCAAGGCCTTCC CTC CCTGTGGCT AGTGGGAGCCGT AAGT AAAAGC TGTCCC C TAAG ATG T AT ACAT AT AC C AGAAAGTATGGAGG AGTCTTTCT 



GAGAGAATTGATCGGTGTCATGTGAAGTGTTGAAGTTTGTATCTTGAAAATCCCCCTAAATCCTTTGTCTTAACAGCTCAATGCGAGCGCAGTGATO 
TGAAGTTCGCTAATCCTCCTTCTCGAAAGGGGAGAAGTGAGCACTGTCTCCAGACAGATCAGCTGGTGCAGGAGAGAATTTAGCGATAGTTTGCAAT 
TCTG ATTAATCACGTAG AAAATG AC CTTATTTTGGGGGGTGGG ATGGAGGAGAGTGGGT GAGGAGGCACCGGC CGTGGAGCC AGT CCT CCGCCCCC C 
GCCAGCCCACAGCATCACCACGCCrGACGAGGGGTGCTTGCCTGCCGCCCCTGCCCGCAGGTGAGCCGGTGGAGCCTCCGCTGCCCTCGCAGCTGCA 
CCTCATGTACGTGGCAGCGGCCGCCTTCGTGCTCCTGTTCTTTGTGGGCTGTGGGGTGCTGCTGTCCCGCAAGCGCCGGCGGCAGCATGGCCAGCTC 
TGGTTC CCTGAGGGTTT C AAAGTGTC AG AGGCCAGCAAGAAG AAG CGG AGAGAGCCCCTCGG CGAGGAC TCAGTCGG CCTCAAGTG AGTGGAC ACTG 
CTCCCIACTGTGTGTGGGTGAGTGAGTGGCAGGGTTGGTGGGGGTGCTTAGCTC^ 

CCTTCTGCATGTAGAGGATGCCCATATGAGCTGTGGCTTCAGGGCATCTCCTGGAGTACCTGCTCAGTCTCCCTACCCCCATACCCAGGCCCCTGAA 
GAATGC CTCAG ATGGTGCTCTGATGGAC GAC AAT C AGAACGAGTGGGGAGACG AAGAC CTGGAGACC AAG AAGTTCCGGGTGAGTCCCGCAGGCTCC 
CAAGCCCCCCCTGGGTGGCACCTCCTGCCTTGGCCCCAAGTGATGAGGCTCTGCT^ 

GAC CTG AGTGATC^GACTGACCACAGGC AGTGGACCC AGC AGCACCTGGACGCTGCTGACCTGCG CATGTCTG CCATGGCCCCAA CAC CGCCTC AGG 
GGG AGGTGG ATGCTGACTGC ATGGATGT CAATGTTCG AGG ACCAGG TAAGG CCACTGGAGACA CATGC ACACAT TCCCT AGC ATAGACTT AAGAGG C 
CAGTAGGACTTGAGGAGGGCTGGGTCACTGAAGGGCTCTGCCAGAGCAGCATCCTGCCCTGAGCTGATCGTCTGGCTCCTCCTGCCCACACAGTATA 
CTTTTTCATGAAATGCTGCTTTGCAATAGTCAAAGTGTTAATTTTTTTTTTTTTTTTTGGTCTCAGTTTTTAGTATGCTGGGATTATAGGTGTGTCC 
CCATGC CCAGCTAAAGT GTT GAC TTTTAAAATTATAT GCATATAGGTG T AT ATAAAGAT ATC TTTT ATTTTATGTGCATGGTTCTGCCTACATATAT 
GTCTGTGTAGTGTTTAGATACCCAGTGCCTGTGGGGAGCCTGAAGAGGCCATTGGATCCCCTGAAGCTGCAGTTACAGATGGTAGTGAGCAGCTATA 
TGGGTACTAGGGATTGAACrCAGGTCCTCTGGAAGAGAAGCCAGTGCTCTTAACCACTGAGCCATCTCTGCAGCCCTTATTTTATGTTTTAAAGCAT 
H TTCTTCATTTGAGTGTAAGGGGCACACATGTGCCATAGTACACACGTGGAGGTCAGAGGACTACTTGTATGAGTGTGGGTTCTCTCCTTCTACCATG 
f'3 TTGGTCCCAGAGTGCCTTCCAGGGAGGCAGCACTGCATCCCTGGGGACTTCATCCGGGGTCCCTTAAACTGCAGGACTGCCTGCCAGGGCTATGGTT 
iS, GAATGTTGACCCTCCCTGAGGCTGACACCATCCTTCCCACACCTACCCTTGTTTCTGTCCTCTCTGCAGATGGCTTCACACCCCTCATGATTGCCTC 
W CTGCAGTGGAGGGGGCCTTGAGACAGGCAACAGTGAAGAAGAAGAAGATGCACCTGCTGTCATCTCTGACTTCATCTACCAGGGCGCCAGCTTGCAC 
D AACCAGACAGACCGCACCGGGGAGACCGCCTTGCACTTGGCTGCCCGATACTCTCGTTCAGATGCTGCAAAGCGCTTGCTGGAGGCCAGTGCAGATG 
p CCAACATCCAGGACAACATGGGCCGTACTCCGTTACATGCAGCAGTTTCTGCAGATGCTCAGGGTGTCTTCCAGGTACGCAGCACTGCATCCTTGGA 
? c;; GCCAGTGACCTCATCAGGTCTGACTGGCATGGACCCTGCAATCTCTAATTCCAAGTGTGACTGGGCAGGGAAACTTGGCACCTCCACACTGCATTTT 
bk GAAGGAAGTCCAGGTACAAAGGGC!ATCAAGTCCCACCTGTGTTCTCTGAGCAGGCTGGCTGAAGGTGGTCTTGCGGAACCTACAGGGTGAGGCAGGG 
|^ CCTCCTTACTCTC^GTAAAACCCTCCCAATCTCAAGGCTATGGCCTTGACT^ 

J ,. CCATTCAGCTGTGGCTGCAGGCATAGCCCATGGCGGCAGAAAGAGAGCTTTAGAGGCTGCTGGTCCTGGCCTGCAAAGCTATCTCCTGGCTTTGCCT 
yij AGAAGCCTGGAGCCAGCCTGGGTGGATTTAAGGATTTAGCTGCCTGGGTCTGCAGGGACCAACCCTAGTAGTTGCTCCTGCATGTTGCCAGCCAGTG 
ACCCCTGCTACCAGACCTGTCCTOTAGGCGGCCCTGGAGCAAACTCCAATGGGGAATAGACCAGGCATCAAAAAGCTTGTCATTTAATCATTTCTTA 
A CT TGAA C C AC ACTT TGTCGGTCTGT AG AAG AAATTA AAC AAGAC AGGGCAATGT AC AC CTT T TAT CCCAGCACTCTAGAATCTGAGAGGATTGTGA 
G CATGAGGCTAGCCTGG TCT TCATAG AAAGACCC T AT TTC AAAAT AGC AAAGAG AGGTTGAG ACCAGCCACAGGTTATGTTGACTGCC TGGTGTAC A 
GCTCTCCACATGGCCTGCCTTGCTTGCATAGAC&GATGTGTGTAC^ 

ACTCCCTCCTCTCACCTTAACAGTTCCAAGTACGGCTATGGGCCATGAGCGGGATTCCTAGCCTGGTTACTCTCCCCTCTCTCCCCCATGCTATGCA 
GTTTGATTTCCTCAACCTGGAGGTAGTTCTGGCAACCATAACCAAAAGATGAGGTCCCAGTAGATCCCCCACTGTCACTAGGGACTGGGTAAGAGCC 
CCAGCTCTTGCACATGGAATGTAGCTCCTGCCTTCCTCAGGGGCTCAGCTGAGCCCAGTCCAGGCATTGGTTTCTGGAGATACAGGCrrGCATTTAG 
ATCACCCTGCCTGAACCCATCCCTGCCTTCCAGATCCTGCTCCGGAACAGGGCCACAGATCTGGATGCCCGAATGCATGATGGCACAACTCCACTGA 
TCCTGGCTGCGCGCCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAACTCACATGCTGACGTCAATGCCGTGGATGACCTAGGTAGGCCCGAGCT 
ACAGCCCAGCTACCGAGGACATCAGCCAGTGCCTGGTCTGCTCAGCCCCAACCCACTCTCTCCTCTCTTTGCTTCCTTTCCAGGCAAGTCGGCTTTG 
CATTGGGCGGCCGCGGTGAACAATGTGGATGCTGCTGTTGTGCTCCTGAAGAACGGAGCCAACAAGGACATGCAGAACAACAAGGTAAGCAGACGGG 
CTGCTT AAC CGC AGGCGCATG ATGGGGA CTC CGGGGC ACC CAT ACGTTGAC CAC TTAGAGGTTTAAGAGTGTCTGGGACAACAGCAGCCTAAAATAA 
CCCGTGCTCTTGCTGGAGAGGAGCTGAGGCGTAAGCAGCTGTCCCATGTCACGGTTACTTGAGCCTTGCCTTCTGTCCCCTTCTGTCCCTGGTGGCT 
GTGGTGACCCTAGGGGTTCAGGATTGGCCAAGGGGCAGGGATGGGGTGGCCCTGTCTCTTAAGGCCTGCAAGGCTTCTGGCTTGGTCCACCACTACC 
ATGCTCCCTCCTTTGGGATAAGAAAACTGCTGAGCTGGGCTGCGTGTCTCCACCTCGCCTCCCAGTGTCTCTGTTTGTGCAGGGCTAGTTGCTTCAT 
C AGG ACTCT GCT TA GTTGCT C AC AGGGC AGAGTTTGTG CC TC CCT TAC GAG CTC C ACTG C AGG CTGGG AAATGG AG C AGG AGT CAGGT AC CTGGAGT 
TGCTGTTGGGC ACAGGTTCT CCACAGTC C AG GGTGGGC CAGTGGTT AT AGAGAAT CC AAGCCTGAT AG ACATTCTG TTG TGGTGTATG ATTATGTCT 
GCTGGACAGAGCAGACCTGGAAGGAGCTGAGATGCCCTAGCCACTGGTGACCCTTTGGCATGGTAGTACCTTCCCCATGGCCATTCACTTGCCTTAA 
AGGTCCCTGAGTCCCAGGATTGCAGTACTTGTGCCTGCATTTTGGGGAGGAAAGCAAAGGGCACACAGTTACATTCTGTTTTCGGCAGTCTGACTCC 
ACACAGGGGAAGAAGCCCTATGTATTTATCCCACAGATCTCTGGGGTGTGAGGTGTTAAATCTAGGTTCTCTAAAAAAAGAACTCGGGACCAGCCAT 
GTGTGCGTGCTAGCACCGAGAAGCTGAGGCAGGACGATACTGAGTTCCAAGTACGGCTGTGGGCCCTGAGTGAGATTCCTAGCAACATAGGAAGGCC 
CCACTAT AAAT ACT CTC TGAGCC TGGTGGTCTAGGATGCTTGGGT CTAG ATAGAT CTCT CCAGATAAGAGAAG AGAAAG AGGCAGCACCTGCTTCCC 
TAGCCCCTCCCCCCAGGCCATCCATCATCAATGACTCAGCAAATTCTACACCTGCCCGCTCCCTCATGTACCTCCTGTCCGTGTCCTACAGGAGGAG 
ACTCCCCTGTTCCTGGCCGCCCGTGAGGGCAGCTATGAGACTGCCAAA 

GATTGCCGCGGGACATCGCACAGGAGCGTATGCACCACGATATCGTGCGGCTTTTGGATGAGTACAACCTGGTGCGCAGCCCACAGCTGCATGGCAC 
TGCCCTGGGTGGCACACCC^CTCTGTCTCCCACACTCTGCTCGCCCAATGGCTACCTGGGCAATCTCAAGTCTGCCACACAGGGCAAGAAGGCCCGC 
AAGCCCAGCACCAAAGGGCTGGCTTGTGGTAGCAAGGAAGCTAAGGACCTCAAGGC^ 

ACAGCTCGAGCATGCTGTCGCCTGTGGACTCCCTCGAGTCACCCCATGGCTACTTGTCAGATGTGGCCTCGCCACCCCTCCTCCCCTCCCCATTCCA 
GCAGTCTCCATCCATGCCTCTCAGCCACCTGCCTGGTATGCCTGACACTCACCTGGGCATCAGCCACTTGAATGTGGCAGCCAAG 
GCACTGGCTGGAGGTAGCCGGTTGGCCTTTGAGCCACCCCCGCCACGCCTCTCCCACCTGCCTGTAGCCTCCAGTGCCAGCACAGTGCTGAGTACCA 
ATGGCACGGGGGCTATGAATTTCACCGTGGGTGCACCGGCAAGCTTGAATGGCCAGTGTGAGTGGCTTCCCCGGCTCCAGAATGGCATGGTGCCCAG 
CCAGTACAACCCACTACGGCCGGGTGTGACGCCGGGCACACTGAGCACACAGGCAGCTGGCCTCCAGCATAGCATGATGGGGCCACTACACAGCAGC 
CTCTCCACCAATACCTTGTCCCCGATTATTTACCAGGGCCTGCCCAACACACGGCTGGCAACACAGCCTCACCTGGTGCAGACCCAGCAGGTGCAGC 
CACAGAACTTACAGCTCCAGCCTCAGAACCTGCAGCCACCATCACAGCCACACCTCAGTGTGAGCTCGGCAGCCAATGGGCACCTGGGCCGGAGCTT 
CTTGAGTGGGGAGCCCAGTCAGGCAGATGTACAACCGCTGGGCCCCAGCAGTCTGCCTGTGCACACCATTCTGCCCCAGGAAAGCCAGGCCCTGCCC 
ACATCACTGCCATCCTCCATGGTCCCACCCATGACCACTACCCAGTTCCTGACCCCTCCTTCCCAGCACAGTTACTCCTCCTCCCCTGTGGACAACA 
CCCCCAGCCACCAGCTGCAGGTGCCAGAGCACCCCTTCCTCACCCCATCCCCTGA 

CTCTGATTGGTCCGAGGGCATCTCCAGCCCGCCCACCACCATGCCGTCCCAGATCACCCACATTCCAGAGGCATTTAAATAAACAGAGATGTGGGAT 
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GCAGGACCCCAGCTTCCGTTCCCAAGCCCTGTTGGGAGTCCTTTCCAGTGCTTCAGGATGCTGGGGCGACCAAAGGAGCCTTTTAAAAAATGTTTTT 
ATACAAAATAAGAGGACAAGAATTTCCATTTTTTTTTTTTAGTATTTATTTATGTACTTTTATTTTCCACAGAAACACTGCCTTTTTATTTATATGT 
ATTGTTTTCTATGGCACTAGGGAAAAACATATCTGTTCCAAGAAAATAAAC™ 

GTGTCTGTAAAATATGAACAAGGATTCATGATTTGTAAATGCTGTTTATTTATTGATTGCTTCTTTCCAAAATCGAAAAGAAAGA 

ACAGGAGAAGGGAAGCTGGAAACTGCCATGGCCAGAATTGCCCCTCCCCCACACTCACTGCCCCTCCCCCCAGCGTCACCTGGGATTTGCAGATGTG 

TTT AGAAACACGCC CAG ACCTTGAACCTTGGGTTC ATGGATT AGT TTTGTATCTAAAACAGG AAACAAG TCAG ATGATGTGGT TTGTA CACTTTCTG 

TAACCACCAGTGTGGACTTGAAGAAGTGTCCTCAGCATGTGCAGAGTCTACTACCCAGTACCAGTCGTGAGTCTGCAGGCTCCAGTGTTCTGTAGTA 

GTGTTTATGGGCCTTGGGAGTACTTCTCCCCTGCCCTGCCCCACTGTCCCCTTCCTGACAACTTGAGCCAGTAAGCCATGCAGGGTGTGGTGCCTCC 

TAGAGAAAAC&CTGCCTGGACTGTTCTGTGCATCCCTCCAAACAGCATCATCCAA^ 

CTC CTAACA CC TGACTG CCAAAGGG CTCCAATGTGCATTGTGGACTCG CCAG AGTAGCCTGCATTG AGACTCC AAGAAAACAGAAG CTATGTGGCCT 
CTGATCCCCAAACTGGCCTGGGTGGGGACATGCCTTGAGTGTGCTGGAATGTGGGTGGAGCCTGCTTCTGGGCCACCCCTCCTGGTTCAGGGCTGTG 
CTCACAGCAGATTCTTGCAGTATCAAGTATACGCCTGTGGCAGAATAAGTATCTGTAAATACATGTTTAAAGATGGATTTTGTTTAAAAAATCTAAA 
GGAACAAGTGTGTCGTGTGTCAAGCTGATGAGGACTGTCAGACTGTGGCTTAGCTCAGTGTGACCCAGACCTTGTGACCTGTAGCTGCCGAACCAGT 
AGCTCCTAAGAGCACAACCCAGGATGGCCCATCTGCTGCCCACCAAGTCCCTTTCC^^ 

TCCTCAGGGCAGCTCTTTCTGGCCTTTTGGGGGGCAGTGTCTGTGCCATGCCTAATAGATATGACCAGACGCATCCTAAGATGTTGATTCTTACTGT 
GTTGTATAAAATAAAGTGTAGTTTACAAAAAAAAGAAACGTAAAAAAAAAAAACTACATGCAAAACTGTAGGTAATGAAAATGATGTATTT 
TCTTTTTTTGTTAACTAATTTGCAATAAAAATGATACTGATGGTTGTCCACATTTTGCTAAACTTGGAGTGTGGTTTGGTTCACTTTTTGGGGAGTG 
TGA C AG AGAAAGGG CCACCC AGC ACTGAAAT AACC AAATC CAAAC CCT CTCTTACAC AG AAATACAGAGGTTG CCATAAGGACAC A CC ATT TT CAG A 
AGC TGC ACAGCG CAGGGC ATC AAGGTGGG AAGACAGT TTCCAAAAGAG TTGATG CAGTC CTC TCAGGG ATGTGGATCTGATTG ATG CC AGCTGT ATT 
CCTG TG AG CAG CAGAGGAGTGAAGG CA C CT TATGTTGT C AT AGATGACT ATC CTCTGAGCC CAG TAACC C CC AAC CTCTT CAAATT CAG CCATGCCT 
TGCAAAAGGTCATCCCAGCTGTGCTTGTTGACTGCTTATAGACCCCCTCAAAGAGTTGGGAAGGTCATGGGACTCAAAGGACCCTTGCCTGATGATC 

TATGTGGCTGTGAGGAAAGCCTTATCCCTGCTGGGTAAATGTTGTATTAATATTCATTTAATAAAGGCTGGGCGTGGTGGCACACGCCTTTAATCCT 
O AGCACTCAGGAGGCAGAGGCAGGTGGATTTATGAGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTATACAGAGAAACCC 
TGTCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTCATTTAATAAATCCTC 

AGACACACAACCTTTCTTTATATTTTATTATGTCTTATTCAGCACAATAGCTGGGTAGCTGCCTAGCCTCCATATTGCTAAGACTCTGTCTCCCCAT 
O TCTGAATTCCTGCTTACTAATTCTATGTTCTATCGTGGCTGATCTAGACCAAGTTGGGCAGCCTTATAGGCCATGCTCTCCCAGCTCCTTCAGGTGG 
Jri TAGCCTTGCCTCTCTGTCCTGCACTCTTCCCAGGCATGGCATCTCTCCTCTCCCCAACTCTCCCTTCTCCCACTGCCTCAAGTCTCAAGCCCAGGAG 
r ? „ TCCTAAACGGTCCACCTTGTCTGTCCTCCCCAGCTACTGGCTATTGGTATCTTTATTTACCAATCAGAACCCACTGGGGACAGGTTCCCAAAAGC 

TGTGTAGACCTTCCTGTGCAGTTCGGGGGAACATTAACATTTGTAATACAAACAGCTACAAGTAAAGAGTTCCAATGTGGACTCAGTAAGTAACCCT 
fek TTACCTATGTTCTGCTCTTGTCAGCGCTGCAAATCCATTGGTGCCCTAGTCAGCTTTGGTGCCTAGGACCTTAGGAGGAGGGTTAAGTGTTGTCTCT 
I , 5 TCCTGGGAAGGAATTTTAGCAACACTCATCTAGTCCGAGAGCCCAGCAGGTACAAAGGCATCTTAGCCCTAAGAAGAAATCATAATAGCACAGGTGT 
|=y GGATCT CCC TTGGC CTG CTGTACTGGCTAGTT TTG TGT CAACT TGAC AC AG CTGGAGTTAT CAC AAAGAAAGGAGTTTCAGTTGG GGAAATGCCTCC 
ACGAGATCCAGCTGTAAGGCATTTTCTCTTAGTGATCAAGGGGGGAAGGGCCCTTTGTGGGTGGTACCATCTCTGGGCTGGTAGTCTTGGGTTCTAT 
|„„, AAGAGAGCAAGCTGAGCAAGCCAGGGGAACCAAGCCAGTAAAGAACATCCCTCCATGGCCTCTGCATCAGTTCCTGCTCCCTGACCTGCTTGAGTTC 
t* CAGTCCTGACTTCCTTGGTGATGAACAGCAGTATGGAAGTGTAAGCCGAATAAACCCTTTCCTCCCCAACTTGTTTCTTGCTCATGATGTTTGTGCA 
Q GGAATAGAAAC CCTGACTAAGACACCTGCCTAGGCCTCTCCATTTGGCCCACAC CTTGGAGCACAGCAGGAGTCTCTAACCAAAGTGACAATGGGGA 
||| ATGTGATGCCCTCTCTGAGCCTGACATCTGAGGACAAGTGTGAATCCATCAAGTTCAGATGTCTTGCCCCGGCTCAGTGTTGCACACTGTGAAAGCG 
l''** GAGGGGGCCCAGGGGCCCAGAAGGGGTGCTCACACACAGAATTGTGATAACTTGTCT 
(4| CCCGGAGAACTAGCATCTCCAAAATTCCATCCGGAATGGAAATGTCAGCTCGGAAAGGGCT^ 

f4 CATCCACAATGGCATGTGCAGCCCAGGGTTCACAACTGATGGCTGGACCACTGAGAACACATCTGGACTGGGTTCGGGGTGAAAAACATTGATGTTT 
W GCAGATCAAAGCAAGGGCCTACCCATGTGGGCAAATCTTCTCCAGTGAGAGACCAAACAACAAAAGGTCAGGGAAGAGGGGATGGATCCTGCCTACT 
Ms GCGGCAGAACATGAATCCTCTCCCCTCCAGGCTCCCACGGAAAGATTC 

ACCCCACAACCTCTCTTTGTATCTACCCTAGTAAACATTCAGTGGAGTATTTTTCAGCCATTAGAAGGAACAAAGCTCTGAGACCTACTAACATGTG 
GGTGAGCTGCAGGAGGACACAGGGCTCGTGAAAGACATCTCAAAAATGACCACATGATTTCATCTTTATGAAATGTCCTCAGTAAGTAAAGCCACAA 
AGACAGAGAACAGAGTGGCCATTCCTGGGTCCAGGGAAATGGGTATGACTGACAATGCCTATACATTTGACTTTGCATGGATAGTCCAGCACTGTGT 
ACG TCT AAGGC AGAAGGCTGC ATGG CACTGTGAAGGTCCCAG A CC CGTCAA TGCT AGAA TTC C ACAATCTGTGAGT CTA CTAAAAC TAGC AAAAAAA 
AAAAAAAAAAAAAAATCCAAGCAAAATGGGAACTTTCAGAAATTTGTCCCATGGACACAACAAGTTCTCTTTTATTTTO 

TATCCCAGGTTGGCCTCAATCTCACCATCCTCCTGCCTCAATCTCCCAAGTGTTGGGATTCTAGGTGTGTACCACGCCACCCACTTAAACCTAGTAC 
TCTTAAGGAGCATCTATCTGCCCCTAAAAGCCACTGGAAATAAATAAAACATGTAAAACAATATTTTTAAAGGGCTAAAGATGTAGCCTAGTAGTAG 
AGGGTT TGTTT ACT CTGTGCAAGGCTCTGGATTTCAT CCC CAGGA CCAAATAAAGTAAT CAATAAATTGGCTTGAG AC AACCT TCAATAAATTCCTT 
GTTCTATGAACCTGAGCTCCTCATTTTGAGTCATGTAAAAAAGTTGGGTATGATCATGTGCTGTAATCCCAGCATCTGTGAGATGGGGAGGTAGAGA 
TCGGTAGAGTCCTACTTAGCTTTTGTCTAATCAGTGAGTCGCAGGTCCCAGTGAGAGACCTTGTCTCAAAAACTAAAAACAAAAAGGCCGAGGACTG 
GGGAGATGGCTCAGATAGTAAAAATTCCTACTGCACTAGCACAAAGACCTGAGTTTGGAATCTCCAGCATCCACATGCTGTCCTGGAATGACCATAC 
ATCTGTAATCCTGTGCTGGGGTGGGGAGGGAGACAAGATCCTGGAGCACCACTTGCCTAGCCAATGATCAATCAAGCACTAGGTCCAGTGAGAGAGA 
CCCTGCCCCAAAAACC&GGATGGAGTCAGGCATTGTGGCACATGTTTAACCCCAAC^ 

C AAAAACAAAC TAAAAAACC AAG CTGAGAATGAG AATGAT AC CTT CTGGC TTC CAC ATG CC ATATAT ACATAT AC ATG CTCACACGCACAGGCACAC 
ACAT ATGCAC AGTGTAAGGACTAAATTGGGC ACCC AGC AG CT AGA C AG CCCAGAGAG AAAATAGAGTCTTGCAGCCTCAGCCTTC CTT TGT TTAGGG 
GGG ATT ACAGG ACAGTGGGCCCTGCTGTCTCTCAACAGAAATA CT CCAGGA CCAAAT CC CAAGCAGAC AGAGGTATGG AGATGAGCTCTGTGT CATA 
CACACACTGAAGATGGTCATATACAGACATGGGAGCAGAGGCAACTGAGTGGTCTTGGAGCAACTTGCCCTGTGGTACCCGGTCACAGACCTCCAGG 
ACAAGGAAATTCCTGTGAAATCCAACTGGTGTGGGGCTGTTTACACAGCAACCTGGATAGATTG 

CCGAAATCTCTCAGGGACACAGGCCAGTTCAGTTCCCTAGGTACTGCCTTTCCCCCACAATGAAATGCCATCTGTGCTGGCTAGCTGTAACACATCA 
A CT TGACAC ACAAACTACAG AGTTATCTG AG AGG AAG AAA CCATA ATTGAGAAAATGCC ACTGTAAGAT CTGG CTGT AAG AC ATTTTCTT AATT AGT 
GATTGATGGAGGAGGGCCAGCTTATGGTCCTGAGTTCTATAAGAAAGTAGGCTGAGCAAGCCATGGGAAACAAGCCAATAAGCAGCACCCCTCCATG 
ACCTCTGTCTGTATCAGCTTCTGCCTCTAGGATCCTGCCCTGAGTTCCTGTCCTGACTTTCTTCAGTGATGAACAGTGCTGTCAAAGTGTAAGCCTA 
AGAACTCCTTTCCCCTCCAACTTGTTTTTTGGTCGTTGTTTCATCACAGCA^ 

CTTGTAATGTTTTTGTTTGTTTTCAATTTAGAAGAATAGTTATTATGTATATACTACTAGAGGGCAGTGAGTTAGGCAGTAGTGAGTAAGCAGGGGC 
ACACTACGTCCTCTGGCTGGCTGAACCTTGTGATCCTTCTGCCTCAGCCTCCTGAGTTCTAGGATTGCAGGTATGCCCCACCAGGCTTGACTTTAAG 
TCCTACTTTTCAACATTCAAGTATGTAAGTAGTTCCAGACAAGTATGGTAAATCC^ 

GGC C AG CCTGAGTT ATAT AG CTAACTTCTG TTTCAAAAATAACAAAAGC AGA TATAAA TAA CCCGAGAA CAGC CTGGTGGTGGTGTTGC ACACC CTT 
AATCCCAGCACTCAGGAGGCAGGCGGAGGTAAATTTCTTGAGTTCCAGGCCAGCCTGGTCTACAGAGTGAGTTTCAGGACAGCCAGGGCTACAAGGA 
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GAAACCCTGTCTCAAAAAACTGAAACCAAACCAAATAG<2AACAAAAACAAACCCAAGAACTAGT^ 

GTCTGTGTATATCTTCAGTATGAGAGAGAAGGGCTTCCTTAGCCTCCTGGGGTCATTCAGAGAAGCCCATTCCAATAGAAATAAGACGGAAACCACA 

TAGGTCATTCAAAGTACTTCATAACCCTTAAAAAGGTCAAAGGAGCAGGTGAATTTATTTCAATACTACATCTTGGTTCGTCCAATGAAACCACAGT 

TACCACTTAACTAGCAGTCAACACAAGCGAAGTATGAAAAGGACAGAGTTGACATTCTTTGTATTATTATTTGTATTTATTTGCTTA 

T GCT AACAC ATGCC ATAC ATGTAGAGGT C AGAGG ACAAGT GATGAGAGCCAGTCTCTCC CTCTACC ATGTGGGTCCT AGGGA CTGAACCTGGG TTGT 

CCT CCTTGG CAGCAGGC ACCC TT ACCCACTGTGCC AGCTC AC CTGC ATTATCACCATCTGCACTGG CC ATGCTGAAGGTGCTTAGAGG CCACACATG 

GTCAAAATGTCATCCCAGACAGGCTAATCTCCACCAGCACAGATAAACAACCCATGGAGCCAGATGTAGTACTGGGAGCACTGGGGGGCGAAGGGG 

GGACGGTAAAGGCAGGAGGTTCAGGAGTTCTAAGCCAGATCCTGTCTATAAAGATGTAAGCAAAGGAATAAAGAAGGTGAGGGGAGAGACTGCTATG 

CTGGTGTTGTTTCTTGTC AAAGG AGGCACCAAATGTGGCT CTGCATTGCTT TGT CCAGC ACAAAG AGGGACCCAAGGGTATTAGTGCCATAAACCCA 

GCCAAAGCAGCAGGGTTATGCCTACAATCAAAAGTGATGGAGTGCTGCTGTCCTTCTGAGACTCCGGGAACCAGCCAGCCATGAGTCTTGCAAAGTT 

TGCTATCAGGGACACTCTATGGGGAGATTTTTACTATGTGACTCCAGACCCCACCCTGTGGAGCTGTGGAGACCCAGGCCAATCCTCAGCTGCATAC 

GAGTTAGGAAACAGGGCTCTTTTACTACAGTATCCTCAAGCCCGACTGAGCTATGTGTACTGTGGATTTCGAAAGGTTTGTGTGGAGCTTCTCCAGA 

GAATCCCTCTGTTTTGAAGATCAAGGACAGGGATGCTGCTCTCTCCACCTGCACTTTAAAGATGATGTGGTCCTGTCCTAATTCCTGACCACCCTCA 

CGGGTGGTGTTCTCATGGACTACCTGGCAGCTGTGGGATAGAAGGTCCCTAGAGTGTCCTCCTGTGTGAGGGGTGACCTCCAAGGTCTATGTGTGGA 

TGC TTTGG C CATCT GTGTAGAACTC ACTTCTGGTAAGGACCC AGAAATAG CAT GGCTATTGTTATCACAATGG ATTTT AAG AACATTCCTT AGCAGC 

CAAGCCCAGCATACACAGTCACTGGTCACCACAAACTCATTGAGAG 

CAACTCTTTTTTTTTTAAAGATTTATTTATTATTATATGTAAATACACTGTAGCTGTCTTCAGACATTCCATAGTAGGGCATCAGATCTCATTGTGG 
ATGGTTGTGAACTACCATGTGGTTGCTGGGATTTGAACTCAGTACCTTTGGAAGAGCAGTCAGTGCTCTTACCCGCTGAGCCATCTCACTAGCCCTG 
ATCAAGGCAACTCTTATAAAAACAACATTTAATTGGGGCTGGCTTACAGGTTCTGAGGTTCAGTCCAT^ 

CAGG<^GACATGGTACAGAAGGAGCTGAGAGTTCTACAACTTCATCTGAAGGCCACTAGGAGAAGGCCACTGACTTCCAGGCAGCTAGGACAAGGGT 

ATTAAGGCCCACAACCACAGTGACACACCCACTCCAAAAAGGTCACACCTCCTAAT^ 

TGGCCAAGAATGGTTGCTTCAATGCTTTTATGTGCACATACCACCTAGAGTGTTCC^ 

AAGGCTTATTTTTAGATTCTAGTGCTTGTGAGATCCATTCTCTACAACACCATGCTATTTCTATGACTATTTCAAATACAAGACTGTGTTATGCCGG 

GTGTGGTGGTACATGCCTTTAATCCCAGCACTCAGGAGGCAGAGTCAGCCGGATTTCTGAGTTCGAGGTCAGCCTGGTCTACAAAGTGAGTTCCAGG 

ACTTCCAGGGCTACACAGAGAAACCCTGTCTCGAACCCCGAACCCCCCCTCCGCCCCCCAAAAAAGACGACTGTGTTCTGTTTCTCATGAGATTCTT 

CTGCATGGGGTGAGCATGCTTTCTACTGGAGCTGTGGACACTCTGGCACTCAGAAACAAGTCTGACCTAAGCTACTTATTTACTCCCCATCAATACA 

ATCTTCTTAAGCTGGAAAAGGGAAGGGGAGAGAAAAGGTCTCTGACCTTAGCCTCAGCAGGCCAGGTGGAAGGGGTGACACAGAACAAGGCCCTTTC 

CTCTACCTTGAAAGCAATGCCACAGGCTCCCACAAGGTTGAGGATTCTGTCTCTGACCTCAGAGCTTGGAAATCCATTGTCTTCTCTGACTCCAATG 

TGCTCAACTCAGAGAGCATTTATGCAGCACCTGCTATCTTTAGTAAGCCCACCCCCTCCAGGGAGTCCAGAAGTAAAGACTTCCCTCTGCGAGGGCT 

GCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGGTTAGGGGGGTCTGCAGGGTATGCAACACCTGCACCTAACTCAAGTGAGATAGGATC 

TGGAAGGAGCCCTCAATGGATGGATAATTTGTAGGTAGATTTGT^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNIWNITONNNNNNN^ 

NIWNNNNNNNNNNNNNNNITO 
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W GTGGTGTGCGTCAACGTCCGATCCCCGCCGGCCACCCCAAGAGGCCGCCGCCGGGCTGCGGGCAGCTGGCGAGCAGGCATGCCACGGCTCCTGACGC 
Q CCCTTCTCTGCCTAACGCTGCTGCCCGCGCGCGCCGCAAGAGGCTTGAGATGCTCCCAGCCAAGTGGGACCTGCCTGAATGGAGGTAGGTGCGAAGT 
GGCCAGCGGCACTGAAGCCTGTGTTGCCAGCGGCAGCTTTGTGGGCCAACGATGCCAGGACCCC^TCCTTGCCTCAGCACACGGTGTAAGAATGCT 
GGAACGTGCTATGTTGTGGACCATGGTGGCATCGTGGACTATGCCTGCAGCTGTCCCCTGGGTTTCTCTGGGCCCCTCTGCCTGACACCTCTGGACA 
AGCCCTGCCTGGCCAACCCCTGCCGCAATGGGGGCACCTGTGACCTGCTCACTCTCACAGAGTACAAGTGCCGCTGCTCTCCAGGGTGGTCAGGAAA 
ATCATGTCAGCAGGCTGACCCCTGTGCCTCCAACCCCTGTGCCAATGGTGGCCAGTGCCTGCCCTTTGAGTCTTCATACATCTGTCGCTGCCCGCCT 
GGC TTC CATGG CCC CAC CTGC AGGCAAGATGTTAATG AGTGC AGC CAG AAC CCTGGGCTGTGC CGCCATGG AGG CC A CTGCCACAATG AG ATCGGCT 
CCTATCGCTGTGCCTGCTGTGCCACCCATACTGGTCCCCACTGTGAACTGCCCTATGTGCCCTGCAGCCCCTCACCCTGCCAGAATGGAGCAACCTG 
CCGTCCTACAGGGGACACCACCCACGAGTGTGCCTGCTTGCCAGGTTTTGCTGGACAGAACTGTGAAGAAAATGTGGATGACTGTCCAGGAAACAAC 
TGCAAGAATGGGGGTGCCTGTGTGGACGGCGTGAATACCTACAATTGCCGCTGCCCACCGGAGGTGACGGGTCAGTACTGTACAGAGGATGTGGACG 
AATGTCAGCTCATGCCAAATGCCTGCCAGAATGCGGGAACCTGCCACAACACACACGGCGGCTACAACTGTGTGTGTGTCAATGGGTGGACTGGCGA 
GGACTGCAGTGAGAACATTGATGACTGTGCCAGTGCCGCCTGTTTCCAGGGTGCCACTTGCCACGACCGTGTGGCTTCCTTCTACTGCGAATGTCCG 
CATGGGCGCACAGGTCTGCTGTGCCACCTCAAGCATGCGTGCATCAGCAACCCCTGCAACGAGGGCTCCAACTGTGACACCAACCCTGT^ 
AACGAATCTGCACCTGCCCCTCGGGGTACACAGGGCCAGCCTGCAGCCAGGACGTGGATGAGTGTGATCTGGGTGCCAACCGTTGTGAGCACGCAGG 
CAAATGCCTCAACACACTGGGTTCTTTTGAGTGCCAGTGTCTACAGGGCTACACGGGACCCGGCTGTGAGATTGATGTTAATGAGTGCATCTCCAAC 
CCATGTCAGAATGACGCCACTTGCCTGGACCAGATTGGGGAGTTCCAATGCATATGTATGCCAGGTTATGAAGGTGTATACTGTGAAATCAACACGG 
ATGAGTGCGCCAGCAGCCCCTGTCTGCACAATGGCCACTGCATGGACAAGATCCATGAGTTCCAATGTCAGTGCCCCAAAGGCTTC^CGGGCACCT 
GTGCCAGTATGATGTGGATGAGTGTGCCAGCACACCATGCAAGAACGGTGCCAAGTGCCTGGATGGGCCCAACACCTATACCTGCGTGTGTACAGAA 
* GGTTACAC^GGGACCC^CTGCGAAGTGGACATTGACGAGTGTGACCCTGACCCCTGCCACTATGGTTCCTGTAAGGATGGTGTGGCCA 
GCCTGTGCCAGCCAGGCTACACAGGCCATCACTGTGAGACCAACATCAATGAGTGCC^^ 

TGACAACTCCTACCTCTGCTTATGCCTCAAGGGAACCACAGGGCCCAACTGTGAGATCAACCTGGATGACTGCGCCAGCAACCCCTGTGACTCTGGC 
ACCTGTCTGGACAAGATTGATGGCTACGAATGTGCCTGTGAACCAGGCTACACAG 

CCTGCCACAACGGGGGCACTTGTGAGGATGGCATCGCGGGCTTCACTTGCCGCTGCCCCGAGGGCTACCATGACCCCACGTGCCTGTCCGAGGTCAA 
CGAGTGCAACAGTAACCCCTGCATCCACGGAGCTTGCCGGGATGGCCTCAATGGG 

GACATCAACAACAACGAGTGTGAGTCC^AACCCTTGTGTCAACGGTGGCACCTGCAAGGACATGACCAGTGGCTACGTATGCACCTGCCGAGAAGGCT 
TCAGTGGCCCTAATTGCCAGACCAACATCAACGAATGTGCCTCCAACCCCTGCCTGAACCAGGGGACCTGCATTGATGATGTCGCTGGATACAAGTG 
CAACTGTCCTCTGCCATATACAGGAGCCACGTGTGAGGTGGTGTTGGCCCCATGTGCTACCAGCCCCTGCAAAAACAGCGGCGTATGCAAGGAGTCT 
G AAG ACT ATGAG AGTTT CTC CTGTGTCTGTCCCACAGG CTGG CAAGGTCAAACC TGCGAGGTTGACATCAATGAGTGTGTGAAAAGCC CATGT CGCC 
ATGGGGCCTCCTGCCAGAACACCAATGGCAGCTACCGCTGCCTCTGCCAGGCCGGCTATACAGGTCGCAACTGTGAGAGTGACATCGATGACTGCCG 
CCCCAACCCGTGTCACAATGGGGGTTCCTGCACCGATGGCATCAACACAGCCTTCTGCGACTGCCTGCCCGGCTTCCAGGGTGCCTTCTGTGAGGAG 
GAC ATC AAT GA ATGTGC CAGC AATC CCTGCC AAAATGGAGCC AATTGC ACTGACTGTGTGG AC AGCTACAC ATGTACCTG CCCCG TGGGCTTCAATG 
GCATCCACTGCGAGAACAACACACCTGACTGTACTGAGAGCTCCTGCTTCAATGGTGGTACCTGTGTGGATGGTATCAACTCCTTCACCTGTCTGTG 
TCCACCTGGCTTCACGGGCAGCTACTGTCAGTATGATGTCAATGAGTGTGATTCACGGCCCTGTCTGCACGGTGGTACCTGCCAAGACAGCTATGGT 
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ACTTATAAGTGTACCTGCCCACAGGGCTACACTGGTCTCAACTGCCAGAACCTTGTGCGCTGGTGCGACTCGGCTCCCTGCAAGAATGGTGGCAGGT 
GCTGGCAGACCAACACGCAGTACCACTGTGAGTGCCGCAGCGGCTGGACTGGCGTCAACTGCGACGTGCTCAGTGTGTCCTGTGAGGTGGCTGCACA 
GAAGCGAGGCATTGACGTCACTCTCCTGTGCCAGCATGGAGGGCTCT^ 

GGCAGCTACTGTGAGGACGAGGTGGACGAGTGCTCACCTAACCCCTGCCAGAATGGAGCTACCTGCACTGACTATCTCGGCGGCTTTTCCTGCAAGT 
GTGTGGCTGGCTACCATGGGTCTAACTGCTCCGAGGAGATCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTACCTGCATTGATCTGACCAA 
CTCCTACAAGTGTTCCTGCCCCCGGGGGACACAGGGTGTACACTGTGAGATCAATGTTGATGACTGCCATCCCCCCCTTGACCCTGCCTCCCGAAGC 
CCCAAGTGCTTCAACAATGGCACCTGTGTGGACCAGGTGGGTGGCTATACCTGCACCTGCCCACCAGGCTTCGTCGGG^AGCGGTGTGAGGGTGATG 
TCAATGAATGTCTCTCCAACCCCTGTGACCCACGTGGCACCCAGAACTGTGTGCAGCGTGTTAATGACTTCCACTGCGAGTGCCGGGCTGGCCACAC 
TGGACGCCGCTGTGAGTCAGTCATCAATGGCTGCAGGGGCAAACCTTGCAAGAATGGGGGTGTCTGTGCCGTGGCCTCCAACACCGCCCGTGGATTC 
ATCTGTAGGTGCCCTGCGGGCTTCGAGGGTGCCACATGTGAGAAT^ 

CGGGCCCACGTAGTCCCACCTGCCTATGCCTGGGATCCTTCACCGGCCCTGAGTGCCAGTTCCCAGCCAGCAGCCCCTGTGTGGGTAGCAACCCCTG 
CTACAATCAGGGCACCTGTGAGCCCACATCCGAGAACCCTTTCTACCGCTGTCTATGCCCTGCCAAATTCAACGGGCTACTGTGCCA 
TACAGCTTCACAGGTGGCGCTGGCCCGGACATTCCCCCACCGCAGATTGAGGAGGCCTGTGAGCTGCCTGAGTGCCAGGTGGATGCAGGCAATAAGG 
TCTGCAACCTGCAGTGTAATAATCACGCATGTGGCTGGGATGGTGGCGACTGCTCCCTCAACTTCAATGACCCCTGGAAGAACTGCACGCAGTCTCT 
ACAGTG CTGGAAGT ATTTTAG CG ACGGC C ACTGTGAC AGCCAGTGC AACTCGGC CGG CTGCCT CTTTG ATGGCTTCGACTGCCAGCTCACCGAGGG A 
CAGTGCAACCCCCTGTATGACCAGTACTGCAAGGACCACTTCAGTGATGGCCACTGCGACCAGGGCTGTAACAGTGCCGAATGTGAGTGGGATGGCC 
TAGACTGTGCTGAGCATGTACCCGAGCGGCTGGCAGCGGGCACCCTGGTCCTGGTGGTGCTGGTTCCACCCGACCAGCTACGGAACAACTCCTTCCA 
CTTTCTGCGGGAGCTCAGCCACGTGCTGC!ACACCAACGTGGTCTTCAAGCGTGATGCGCAAGGCCAGCAGATGATCTTCCCGTACTATGGCCACGAG 
GAAGAGCTGCGCAAGCACCCAATCAAGCGCTCTACAGTGGGTTGGGCCACCTCTTCACTGCTTCCTGGTACCAGTGGTGGGCGCCAGCGCAGGGAGC 
TGGACCCCATGGACATCCGTGGCTCCATTGTCTACCTGGAGATCGACAACCGGCAATGTGTGCAGTCATCCTCGCAGTGCTTCCAGAGTGCCACCGA 
TGTGGCTGCCTTCCTAGGTGCTCTTGCGTCACTTGGCAGCCTCAATATTCCTTACAAGATTGAGGCCGTGAAGAGTGAGCCGGTGGAGCCTCCGCTG 
5 , CCCTCGCAGCTGCACCTCATGTACGTGGCAGCGGCCGCCTTCGTGCTCCTGTTCTTTGTGGGCTGTGGGGTGCTGCTGTCCCGCAAGCGCCGGCGCC 
^ AGC ATGGCC AGCTCTGG TTC CCTGAGGGTTT CAAAGTGTCAG AGGCCAGCAAGAAGAAGCGG AGAGAG CCCCTCGGCGAGGA CTCAGT CGGCC TCAA 
Q GCCCCTG AAGAATGCCTCAGATGGTGCT CTGATGGAC GAC AAT CAGAACGAGTGGGGAG ACG AAG ACC TGG AGACCAAG AAG TTC CGGTTTGAGGAG 
Si CCAGTAGTTCTCCCTGACCTGAGTGATCAGACTGACCACAGGCAGTGGACCCAGCAGCACCTGGACGCTGCTGACCTGCGCATGTCTGCCATGGCCC 
CAACACCGCCTCAGGGGGAGGTGGATGCTGACTGCATGGATGTCAATGTTCGAGGACCAGATGGCTTCACACCCCTCATGATTGCCTCCTGCAGTGG 
£3 AGGGGGCCTTGAGACAGGCAACAGTGAAGAAGAAGAAGATGCACCTGCTGTCATCTCTGACTTCATCTACCAGGGCGCCAGCTTGCACAACCAGACA 
Jg GACCGCACCGGGGAGACCGCCTTGCACTTGGCTGCCCGATACTCTCGTTCAGATCGTCGAAAGCGCCTTGAGGCCAGTGCAGATGCCAACATCCAGG 
^ ACAACATGGGCCGTACTCCGTTACATGCAGCAGTTTCTGCAGATGCTCAGGGTGTCTTCCAGATCCTGCTCCGGAACAGGGCCACAGATCTGGATGC 
H* CCGAATGCATGATGGCACAACTCCACTGATCCTGGCTGCGCGCCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAACTCACATGCTGACGTCAAT 
;y : GCCGTGGATGACCTAGGCAAGTCGGCTTTGCATTGGGCGGCCGCGGTGAACAATGTGGATGCTGCTGTTGTGCTCCTGAAGAACGGAGCCAACAAGG 
| „ ACATCGAGAACAACAAGGAGGAGACTTCCCTGTTCCTGTCGATCCGCCGTGAGAGCTATGAGACTGCCAAAGTGTTGCTGGACCACTTTGCCAACCG 
III GGAC ATC ACGG ATC ACATGGACCGATTG C CG CGGG ACATCGC ACAGGAGCGT ATG CACC ACGATAT CGTGCGGCTT TTGG ATGAGTACAACCTGGTG 
CGGTCCCCACAGCTGCATGGCACTGCCCTGGGTGGC^CACCCACTCTGTCrCCCACACTCTGCTCGCCAAATGGCTACCCTGGCAATCTCAAGTCTG 
«' . CCACACAGGGCAAGAAGGCCCGCAAGCCAAGCACCAAAGGGCTGGCTTGTGGTAGCAAGGAAGCTAAGGACCTCJVAGGCACGGAGGAAGAGTTCC 
js^ GGATGGCAAGGGCTGGCTGTTGGACAGCTCGTCGAGCATGCTGTCGCCTGTGGACTCCCTCGAGTCACCCCATGGCTACTTGTCAGATGTGGCCTCG 
|2 CACCCCCTCCTCCCCTCCCCATTCCAGCAGTCTCCATCCATGCCTCTCAGCCACCTGCCTGGTATGCCTGACACCCACCTGGGCATCAGCCACTTGA 
J £ ATGTGGCAGCCAAGCCTGAGATGGCAGCACTGGCTGGAGGTAGCCGGTTGGCCTTTGAGCACCCCCCGCCACGCCTCTCCCACCTGCCTGTAGCCTC 
8 1| CAGTGCCTGCACAGTGCTGAGTACCAATGGCACCGGGGCTATGAATTTCACCGTGGGTGCACCGGCAAGCTTGAATGGCCAGTGTGAGTGGCTTCCC 
1 5 1 CGGCTCCAGAATGGCATGGTGCCCAGCCAGTACAACCCACTACGGCCGGGTGTGACGCCGGGCACACTGAGCACACAGGCAGCTGGGCTCCAGCATA 
^ GCATGATGGGGCCACTACACAGCAGCCTCTCCACCAATACCTTGTCCCCGATTATTTACCAGGGCCTGCCCAACACACGGCTGGCAACACAGCCTCA 
■W CCTGGTGCAGACCCAGCAGGTGCAGCCACAGAACTTACCACTCCAGCCACAGAACTTACAGCCACCATCACAGCCACACCTCAGTGTGAGCTCGGCA 
GCCAATGGGCACCTGGGGCGGAGCTTCTTGAGTGGGGAGCCCAGTCAGGCAGATGTACAACCGCTGGGCCCCAGCAGTCTGCCTGTGCACACCATTC 
TGCCCCAGGAAAGCCAGGCCCTACCAACATCACTGCCATCCTCCATGGTCCCACCCATGACCACTACCCAGTTCCTGACCCCTCCATCACAGCACAG 
TTACTCCTCCTCCCCTGTGGACAACACCCCCAGCCACCAGCTGCAGGTGCCAGAGCCC^CTTTCCT(^CCCCATCCCCTGAGTCCCCTGACCAGTGG 
TCCAGCTCCTCCCCGCATTCCAACATCTCTGATTGGTCCGAGGGCATCTCCAGCCCGCCCACCACCATGCCGTCCCAGATCACCCACATTCCAGAGG 
CATTTAAATAAACAGAGATGTGGGATGCAGGACCCCAGCTTCCGTTCCCAAGCCCTGTTGGAAGTCCTTTCCAGTGCTTCAGGATGCTGGGGCGACC 
AAAGGAGCTTTTTAAAAAATGTTTTTATACAAAATAAGAGGACAAGAATTTCATTTTTTTTTTTAGTATTTATTTATGTACTTTTATTTTCCACAGA 
AAC ACTGCCTT TTT ATT TAT ATGTATTGTTT TCTATGGCACT AGGGGAAAAACAT AT CTGTT CCAAGAAAATAAACTAGTTCTCAGAGCCTTGATTT 
TCCTGGTCAGGGTGAAGTTCCCTGTGTGTCTGTAAAATATGAACAAGGAT^ 
TCGAAAAAAAAAA 
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ATGCCACGGCTCCTGACGCCCCTTCTCTGCCTAACGCTGCTGCCCGCGCGCGCCGCAAGAGGCTTGAGATGCTCCCAGCCAAGTGGGACCTGCCTGA 
ATGGAGGTAGGTGCGAAGTG GCCAGCGGCACTGAAGCCTG TG TTGCCAGCGGCAGCT TTGTGGGC C AA CGATGC CAGGAC CCCAATCCTTGCCT CAG 
CACACGGTGTAAGAATGCTGGAACGTGCTATGTTGTGGACCATGGTGGCATCGTGGACTATGCCTGCAGCTGTCCCCTGGGTTTCTCTGGGCCCCTC 
TGCCTGACACCTCTGGACAAGCCCTGCCTGGCCAACCCCTGCCGCAATGGGGGCACCT 

CTCCAGGGTGGTC^GGAAAAT^TGTCAGCAGGCTGACCCCTGTGCCTCCAACCCCTGTGCCAATGGTGGCCAGTGCCTGCCCT^ 

CATCTGTCGCTGCCCGCCTGGCTTCCATGGCCCCACCTGC^GGCAAGATGTTAATGAGTGCAGCCAGAACCCTGGGCTGTGCCGCCATGGAGGCCAC 

TGCCACAATGAGATCGGCTCCTATCGCTGTGCCTGCTGTGCCACCCATACT^ 

GCCAGAATGGAGCAACCTGCCGTCCTACAGGGGACACCACCCACGAGTGTGCCTGCTTGCC^GGTTTTGCTGGACAGAACTGTGAAGAAAA 

TGACTGTCCAGGAAACAACTGCAAGAATGGGGGTGCCTGTGTGGACGGCGTGAATACCTACMTTGCCGCTGCCCACCGGAGGTGACGGGTCAGTAC 

TGTACAGAGGATGTGGACGAATGTCAGCTCATGCCAAATGCCTGCCAGAATGCGGGAACCTGC 

TCAATGGGTGGACTGGCGAGGACTGCAGTGAGAACATTGATGACTGTGCCAGTGCCGCCTGTTTCCAGGGTGCCACTTGCa^CGACCGTGTGGCTTC 
CTTCTACTGCGAATGTCCGCATGGGCGCACAGGTCTGCTGTGCCACCTCAAGCATGCGTGCATC^GCAACCCCTGCAACGAGGGCrCCAACTGTGAC 
ACCAACCCTGTCAACGGCAAACGAATCTGCACCTGCCCCTCGGGGTACACAGGGCCAGCCTGCAGCCAGGACGTGGATGAGTGTGATCTGGGTGCCA 
ACCGTTGTGAGCACGCAGGCAAATGCCTCAAC^CACTGGGTTCTTTTGAGTGCCAGTGTCTAC^GGGCTACACGGGACCCGGCTGTGAGATTGATGT 
TAATGAGTGCATCTCCAACCCATGTCAGAATGACGCCACTTGCCTGGACCAGATTGGGGAGTTCCAATGCATATGTATGCCAGGTTATGAAGGTGTA 
TACTGTGAAATCAACACGGATGAGTGCGCCAGCAGCCCCTGTCTGCACAATGGCCACTGCATGGACAAGATCCATGAGTTCCAATGTCAGTGCCCCA 
AAGGCTTCAACGGGCACCTGTGCCAGTATGATGTGGATGAGTGTGCCAGCACACCATGCAAGAACGGTGCCAAGTGCCTGGATGGGCCCAACACCTA 
TACCTGCGTGTGTACAGAAGGTTACACAGGGACCCACTGCGAAGTGGACATTGACGAGTGTGACCCTGACCCCTGCCACTATGGTTCCTGTAAGGAT 
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GGTGTGGCCACCTTTACCTGCCTGTGCCAGCCAGGCTAC^CAGGCCATCACTGTGAGACCAACATCAATGAGTGCCACAGCCAACCGTGCCGCCATG 
GGGGC^CCTGCCAGGACCGTGACAACTCCTACCTCTGCTTATGCCTCA^ 

CAACCCCTGTGACTCTGGCACCTGTCTGGACAAGATTGATGGCTACGAATGTGCCTGTGAACCAGGCTACACAGGAAGCATGTGTAACGTCAACATT 
GACGAATGTGCGGGCAGCCCCTGCCACAACGGGGGCACTTGTGAGGATGGCATCGCGGGCTTCACTTGCCGCTGCCCCGAGGGCTACCATGACCCCA 
CGTGCCTGTCCGAGGTCAACGAGTGCAACAGTAACCCCTGCATCCACGGAGCTTGCCGGGATGGCCTCAATGGGTACAAGTGTGACTGTGCCCCTGG 
GTGGAGTGGAACAAACTGTGACATCAACAACAACGAG 

TGCACCTGCCGAGAAGGCTTCAGTGGCCCTAATTGCCAGACCAACATCAACGAATGTGCCTCCAACCCCTGCCTGAACCAGGGGACCTGCATTGATG 

ATGTCGCTGGATACAAGTGCAACTGTCCTCTGCCATATACAGGAGCCACGTGTGAGGTGGTGTTGGCCCCATGTGCTACCAGCCCCTGCAAAAACAG 

CGG CGTATGCAAGGAGTC TG AAG AC TATGAGAGTTTCT CCTGTGT CTGTCCCACAGGCTGGCAAGGTCAAACCTG CGAGGTTG ACATCAATG AGTGT 

GTGAAAAGCCCATGTCGCCATGGGGCCTCCTGCCAGAACACCAATGGCAGCT^^ 

GTGACATCGATGACTGCCGCCCCAACCCGTGTCACA 

GGGTGCCTTCTGTGAGGAGGACATCAATGAATGTGCCAGCAATCCCTGCCAAAATGGAGCCAATTGCACTGACrGTG 

TGC CCCGTGGGCTT CAATGGC AT CC ACTGCGAGAACAACACACCTGACTGT ACTGAG AGCTC CTGCTT C AATGGTGGTA CCTGTGTGGATGGT ATCA 
ACT CCTT CACCTGT CTGTGT C CACC TGGCTT C ACGGGC AGCT ACTGTC AGT ATGATG TC AATG AGTGTGATTCACGGCCCTG TCTGCA CGGTGGTAC 
CTGCCAAGACAGCTATGGTACTTATAAGTGTACCTGCCCACAGGG 

TGC AAGAATGGTGG CAGGTGCTGGC AGAC CAACACGC AGT ACCAC TGTGAGTGCCGC AG CGG CTGGACTGG CGTCAACTG CG ACGTGCTCAGTGTGT 
CCTGTGAGGTGGCTGCACAGAAGCGAGGCATTGACGTCACTCTCCTGTGCCAGCATGGAGGGCTCTGTGTGGATGAGGGAGATAAACATTACTGCCA 
CTGCCAGGCAGGCTACACGGGCAGCTACTGTGAGGACGAGGTGGACGAGTGCTCACCTAACCCCTGCCAGAATGGAGCTACCTGCACTGACTATCTC 
GGCGGCTTTTCCTGCAAGTGTGTGGCTGGCTACCATGGGTCTAACTGCTCCGAGGAGATCAACGAGTGCCTGTCCCAGCCCTGCCAGAATGGGGGTA 
C CT GCATTGATCTG ACC AACT CCTACAAGTGTTC CTGCCC CC GGGGGA C AC AGGGTGTA CACTGTGAGATCAATGTTGATGACTGCCATC C CCCCCT 
TGACCCTGCCTCCCGAAGCCC<^GTGCTTCAACAATGGCACCTGTGTGGACCAGGTGGGTGGCTATACCTGCACCTGCCCACCAGGCTTCGTCGGG 
GAG CGGTGT GAGGGTGATGT CAATGAATGTCT CT CCAACC CCTGTGAC CCACGTGGC ACC C AG AACTGTGTGCAGCGTGTTAATGACTTCC ACTGCG 
s 4 AGTGCCGGGCTGGCCACACTGGACGCCGCTGTGAGTCAGTCIATCAATGGCTGCAGGGGCAAACCT 

JIl CAACACCGCCCGTGGATTCATCTGTAGGTGCCCTGCGGGCTTCGAGGGTGCCACATGTGAGAATGATGCCCGCACTTGTGGCAGCTTACGCTGTCTC 
O AACGGTGGTACATGCATCTCGGGCCCACGTAGTCCCACCTGCCTATGCCTGGGATCCTTCACCGGCCCTGAGTGCCAGTTCCCAGCCAGCAGCCCCT 
F% GTGTGGGTAGCAACCCCTGCTACAATCAGGGCACCTGTGAGCCCACATCCGAGAACCCTTTCTACCGCTGTCTATGCCCTGCCAAATTCAACGGGCT 
% ACTGTGCCACATCCTGGACTACAGCTTCACAGGTGGCGCTGGCCCGGACATTCCCCCACCGCAGATTGAGGAGGCCTGTGAGCTGCCTGAGTGCCAG 
U GTGGATGCAGGCAATAAGGTCTGCAACCTGCAGTGTAATAATCACGCATGTGGCTGGGATGGTGGCG 

Jg! AGAACTGCACGCAGTCTCTACAGTGCTGGAAGTATTTTAGCGACGGCCACTGTGACAGCCAGTGCAACTCGGCCGGCTGCCTCTTTGATGGCTTCGA 
»\ CTGCCAGCTCACCGAGGGACAGTGCAACCCCCTGTATGACCAGTACTGCAAGGACCACTTCAGTGATGGCCACTGCGACCAGGGCTGTAACAGTGCC 
GAATGTGAGTGGGATGGCCTAGACTGTGCTGAGCATGTACCCGAGCGGCTGGCAGCGGGCACCCTGGTCCTGGTGGTGCTGCTTCCACCCGACCAGC 
|g£ TACGGAACAACTCCTTCCACTTTCTGCGGGAGCTCAGCCACGTGCTGCAC^CCAAC^ 

j ,. | CCCGTACTATGGCCACGAGGAAGAGCTGCGCAAGCACCCAATCAAGCGCTCTACAGTGGGTTGGGCCACCTCTTCACTGCTTCCTGGTACCAGTGGT 
W GGGCGCCAGCGCAGGGAGCTGGACCCCATGGACATCCGTGGCTCCATTGTCTACCTGGAGATCGACAACCGGCAATGTGTGCAGTCATCCTCGCAGT 
g GCTTCCAGAGTGCCACCGATGTGGCTGCCTTCCTAGGTGCTCTTGCGTCACTTGGCAGCCTCAATATTCCTTACAAGATTGAGGCCGTGAAGAGTGA 
jy, GCCGGTGGAGCCTCCGCTGCCCTCGCAGCTGCACCTCATGTACGTGGCAGCGGCCGCCTTCGTGCTCCTGTTCTTTGTGGGCTGTGGGGTGCTGCTG 
!lt TCCCGCAAGCGCCGGCGCCAGCATGGCCAGCTCTGGTTCCCTGAGGGTTTCAAAGTGTCAGAGGCCAGCAAGAAGAAGCGGAGAGAGCCCCTCGGCG 
13 AGGACTCAGTCGGCCTCAAGCCCCTGAAGAATGCCTCAGATGGTGCTCTGATGGACGACAATCAGAACGAGTGGGGAGACGAAGACCTGGAGACCAA 
fll G AAGTTC CGGT TTG AGG AGC C AGTAGTTCTC CCTGACCTGAG TGAT CAG ACTGACCACAGGCAGTGGACCCAG CAGCAC CTGGACGCTGC TGA CCTG 
t*\ CGCATGTCTGCCATGGCCCCAACACCGCCTCAGGGGGAGGTGGATGCTGACTGCATGGATGTCAATGTTCGAGGACCAGATGGCTTCACACCCCTCA 
pj TGATTGCCTCCTGCAGTGGAGGGGGCCTTGAGACAGGCAACAGTGAAGAAGAAGAAGATGCACCTGCTGTCATCTCTGACTTCATCTACCAGGGCGC 
p| CAGCTTGCACAACCAGACAGACCGCACCGGGGAGACCGCCTTGCACTTGGCTGCCCGATACTCTCGTTCAGATCGTCGAAAGCGCCTTGAGGCCAGT 
GCAGATGCCAACATCCAGGACAACATGGGCCGTACTCCGTTACATGCAGCAGTTTCTGCAGATGCTCAGGGTGTCTTCCAGATCCTGCTCCGGAACA 
fsw GGGCCACAGATCTGGATGCCCGAATGCATGATGGCACAACTCCACTGATCCTGGCTGCGCGCCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAA 
CTCACATGCTGACGTCAATGCCGTGGATGACCTAGGCAAGTCGGCTTTGCATTGGGCGGCCGCGGTGAACAATGTGGATGCTGCTGTTGTGCTCCTG 
AAG AACGGAGC CAA C AAGGACATCG AGAACAACAAGGAGGAGACTTC C CTGTT CCTGTCGATCCGCCGTG AGAGCTATGAGACTGCCAAAGT GTTGC 
TGGACCACTTTGCCAACCGGGACATCACGGATCACATGGACCGATTGCCGCGGGACATCGCACAGGAGCGTATGCACCACGATATCGTGCGGCTTTT 
GGATGAGTACAACCTGGTGCGGTCCCCACAGCTGCATGGCACTGCCCTGGGTGGCACACCCACTCTGTCTCCCAOlCTCTGCTCGCCAAATGGCrAC 
C CTGGC AATCT CAAGTCTGCCAC ACAGGG CAAGAAGGCCCGC AAGCCAAGCACCAAAGGGCTGGCTTGTGGTAGCAAGGAAGCTAAGG AC CTC AAGG 
CACGGAGGAAGAGTTCCCAGGATGGCAAGGGCTGGCTGTTGGACAGCTCGTCGAGCATGCTGTCGCCTGTGGACTCCCTCGAGTCACCCCATGGCTA 
CTTGTCAGATGTGGCCTCGCACCCCCTCCTCCCCTCCCCATTCCAGCAGTCTCCATCCATGCCTCTCAGCCACCTGCCTGGTATGCCTGACACCCAC 
CTGGGCATCAGCCACTTGAATGTGGCAGCC^GCCTGAGATGGCAGCACTGGCTGGAGGTAGCCGGTTGGCCTTTGAGCACCCCCCGCCACGCCTCT 
CCCACCTGCCTGTAGCCTCCAGTGCCTGCACAGTGCTGAGTACCAATGGCACCGGGGCTATGAATTTCACCGTGGGTGCACCGGCAAGCTTGAATGG 
CCAGTGTGAGTGGCTTCCCCGGCTCCAGAATGGCATGGTGCCCAGCCAGTACAACCCACTACGGCCGGGTGTGACGCCGGGCACACTGAGCACACAG 
GCAGCTGGGCTCCAGCATAGCATGATGGGGCCACTACACAGCAGCCTCTCCACCAATACCTTGTCCCCGATTATTTACCAGGGCCTGCCCAACACAC 
GGCTGGCAACACAGCCTCACCTGGTGCAGACCCAGCAGGTGCAGCCACAGAACT^ 

CCTCAGTGTGAGCTCGGCAGCCAATGGGCACCTGGGGCGGAGCTTCTTGAGTGGGGAGCCCAGTCAGGCAGATGTACAACCGCTGGGCCCCAGCAGT 
CTGCCTGTGCACACCATTCTGCCCCAGGAAAGCCAGGCCCTACCAACATCACTG 

CCCCTCCATCACAGCACAGTTACTCCTCCTCCCCTGTGGACAACACCCCCAGCCACCAGCTGCAGGTGCCAGAGCCCACTTTCCTCACCCCATCCCC 
TGAGTCCCCTGACCAGTGGTCCAGCTCCTCCCCGCATTCCAACATCTCTGATTGGTCCGAGGGCATCTCCAGCCCGCCCACCACCATGCCGTCCCAG 
ATC ACC C AC AT TCC AGAGGCATTTAAATAA 

HUMAN SEQUENCE - GENOMIC 

CAGGCCAGGGAGATAATGGGGGACAGGGAAGATAACGCAGTCCCCACX3GCT 

GCTGTGCACGTGGTTCCTACATAGCCTTTGCCAGACACCCTGCGTGGCCCTGGAGGTCTCCAGGCCGTGCCCCGTGCCCCGACACGCAGGATGTGCT 
GGGGGCCTGGGCCTGTCCCCTCCATCGCAGCCCTTGTTTGTTGGCCTCCCGTGATGGGCGTTTGCTGCCTGTGACACGGAGCCCCAGGATTAGTTTC 
AGGCCTCGGTCTCAGGCCAGGGCCTTTCCTCGTCACCCAGAGATGAGGGGGCTTGCGTCCGGGGTGGGCCCAGGTCCCTGCAGTGTCGTGGAGCCCA 
CCCGACTGAGGCCGCCTCTCCTGGTGCAGTCCTTGTTCCTGGGCACGCCTGGTGGGGTGAGGCCTAGGCTGTGCCCCGCTGCCAACCGGGAATGAGG 
GGTGGCTTCTGAGCGTGACATTTGTGCAGCTTGTCTTCAGCGTCCCCCCATCTGTGCTCCACTCCTCCCTGGGTCAGGGTCCTGGCACAGCCGGGAG 
CGCTCAGGGGTCTCGGTGCACATTTGCCTCCCGGCGGGACCAGCAGACGGCACTCTGATGGCGGAAAGACCAGCAGGCGGTGGCCGATTTGGGAGAT 
CCCTCTGGGTGAGGCTCCCGGGGTCACGTGTGTCTCCTTCCCGCAGGTGCCACGTGTGAGGTGGTGCTGGCCCCGTGTGCCCCCAGCCCCTGCAGAA 
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ACGGCGGGGAGTGCAGGCAATCCGAGGACTATGAGAGCTTCTCCTGTGTCTGCCCCACGGGCTGGCAAGGTGAGGCTGGCCAGGGCCCGGTGAGGGC 
TGGGATGGG AGGTCAGGATGT CTGCGGG ACAC AGGCAG CTCCCAGGCAGGCTAG ATG AG TCTTTG AAG AGGAGCCGGTGGGTGCTGAGGAGGC CCTG 
GTCGGAGAAGTTCTGGAATCAGGAATTGACCTGGGAGCACCGTTCCCAACTCCAGTTCCTGTGACCTTCTTAGGCCAAAATTAGGGGAGAGGGGATG 
GTCCTGGGGTCACGGAAGCCCACTCCTGGGTCGGGAGAGGCACTGTAGGTGGGTGGGCCAGCCTGGGAAGGGCCTGGAGGGCCAGGGGCCGCTGGTG 
ACCAACCGGCCTCCTCCTGCCCCACAGGGCAGACCTGTGAGGTCGACATCAACGAGTGCGTTCTGAGCCCGTGCCGGCACGGCGCATCC 
CACCCACGGCGGCTACCGCTGCCACTGCCAGGCCGGCTACAGTGGGCGCAACTGCGAGACCGACATCGACGACTGCCGGCCCAGTGAGTAGCGCCGC 
GGCTCTGGCCTCCTCCAGGAAGCTCTCAGGCCrCAGTTCCrc 

TGGCCGGGGAGGGTTGGGTCAGGTGTGGGGGATGTGCTGAGGGATGGCCAGGGTGGTTCAGGAGGGCCCCAGAGCCGGCTTGCTCCTCCCTGCACTT 
CGCGGAAGAGTCACGAGAGCCCCTCCCACGGCTCTTCACTGAGCCGAGGATGGCTCGGGCCCGGCCTGCACTCCCGCCTGGCTCATCGGGCCCCGCT 
GCAGGCCAGGGCTCTCAGGCCTCCCAGCCCTGCTTCACAGAGGTTGAGGTTGGCAGAAGCCGGGGGTCTTGCTGGCATCACCTGGCAGGCAGAGGCA 
AGAACT GTCTCTCCTCC CCTGCTCGGTCTGTGAAGCC TGC AAAGC TGC CCC CTG CCCTGGAGCTTG AGG ACAGG AGCAGGTGGGGAGAGAGACCCCA 
AGCACAGGAGACGGGTGTGACGCAGCCTGTGGGTGCTGGGGCTCCCCACCAGACACCTTTGTCACAGGGCTGCTTGCCGCAGCCTCGGCAACGA 
GCTGGGCTGTCCCCAGCCATGCAGCCTTCCCGGCCCCCTCCCGCAGGTGTGGGTTTGTGCCTGCCCCTCACACTCACCCTTCCGTCCTCTCCCAGAC 
CCGTGTCACAACGGGGGCTCCTGCACAGACGGCATCAACACGGCCTTCT^ 

ACGAGTGTGCCAGTGACCCCTGCCGCAACGGGGCCAACTGCACGGACTGCGTGGACAGCTACACGTGCACCTGCCCCGCAGGCTTCAGCGGGATCCA 
CTGTGAGAACAACACGCCTGACTGCACAGAGAGGTGTGCGGGGCTCGAGTGAGGCCGTGGAAGGGAACGGGCGGTGCGGGCCACACGCAGTAGCTGG 
CAGCCTGGCACACCATGCAGGCGTCCGCCTAGGCCGGCTGGGCACTTAGCGCTGGCTGCATTGAGTCCAGACATTGTTCATTGTACCGACCTGTCCC 
TCCTGAGAGGGCCATTGTGACCTCCTCCTGTGTCCCGCACCAGGAGCGCCGGTAGTCTGTCCTGAAGAACTGGTGCGCTTCCTTCTATTAGACAACG 
AGAACTCTGTCCGTTCTTGTTTCCTTTTTGCCGTGTTTGCCAGGAAGCTCATTTGGTAAACGTCAGTCTCGTCCCTGGTTTCTGACGTAATTACCTC 
CCGTGTTACCAGGACGGTTTCTGGTTTCTCCCTCATTTACTTTAACCACCCTATGCCCACGCGTTTGGTAAAACCACTCTCGACCTGACTTATAAAG 
AAAGCAGGGGAGGGAGGTGTGACGTGGTGTGAGAGCCGGGCCCCGGGTTCCCACTGGCCTCCCTGGGTCAGCCAGGCTGCCCTGGGCCTGTCCTGGC 
CATCAGGCCCCTAGGGTTGAGCAGAAGGGGAGGTGCTGGCGAGGCAGAATCTGCTGGAGCGGGGACCCACCAATGCCCTCCGCTCAGCCCCCGCCTG 
CCCACCCCCTTGCAGCTCCTGCTTCAACGGTGGCACCTGCGTGGACGGCATCAACTCGTTCACCTGCCTGTGTCCACCCGGCTTCACGGGCAGCTAC 
TGCCAGCACGATGTCAATGAGTGCGACTCACAGCCCTGCCTGCATGGCGGCACCTGTCAGGACGGCTGCGGCTCCTACAGGTGCACCTGCCCCCAGG 
r~ GCTACACTGGCCCCAACTGCCAGGTGAGTGCGCCGGCCACAGAGGTGCCCGAAGGAGGGGCCCTGGGTGGGTGCCTGCCTGCGGGAGGTGGGAACGG 
TCACCCCCAGGTCCCACTGTGTCGGCTCGGGGTCACCCCCACACCCCCGGGCAGAGGGTTTCTGGGGATCTGAGCAGCCGGTGAAGGAACCTGATGC 
y GGAAGCAGCAGGCACCTTCGTTTCAATCCCAGGTTTCTGGAGCCGGGGCGGGAGCTCAGGAATTGGGGAATTGAGGAAAAGTGTTCCTTCTAGCAAA 
P GGCCGAGGGTGGTCTGGACCTGCTGAGGGCCCTGAGGGAGACCCAGCCCAGGCTGTTCCTGGTGTGGGGGTGGTGGGGAGTCCAGGGGCGGCAGTCT 
1-t CCACTTCTGTAGAATGGGTTGCAGCCTGGGTTGGAGTAGGCCCCTTGGCGGATGTGCGTTCTGAGCTACCGGGGAAATGGCCGGGGCGCGGGCACCC 
AGCTGACCCCAATCTGTCCCCAGAACCTTGTGCACTGGTGTGACTCCTCGCCCTGCAAGAACGGCGGCAAATGCTGGCAGACCCACACCCAGTACCG 
Ms CTGCGAGTGCCCCAGCGGCTGGACCGGCCTTTACTGCGACGTGCCCAGCGTGTCCTGTGAGGTGGCTGCGCAGCGACAAGGTAACCTGCTGTGCCCA 
y CCCGGCTCGGGTCCCAGCCCATCAAGGTCCTCTGTGGGCCTGGGCCTCACCTGTCTACCACCCCATCCCCCGCAGGTGTTGACGTTGCCCGCCTGTG 
; CCAGCATGGAGGGCTCTGTGTGGACGCGGGCAACACGCACCACTGCCGCTGCCAGGCGGGCTACACAGGCAGCTACTGTGAGGACCTGGTGGACGAG 
|JL| TGCTCACCCAGCCCCTGCCAGAACGGGGCCACCTGCACGGACTACCTGGGCGGCTACTCCTGCAAGGTGGGGGTCCCTCCTAGGGTAAGGGTTGTGG 
CCGGCACGAGTGTTGCCACACACCAGGCCCTGGCTGGGAGCTGGCCCAGTGGAGAAAACTGAACCTGATAGGCCCATGCACTGTTCAGTCTCATTAG 
I f GGGAGGGCTGGGGTAATCAGGGTAGACTGCCTGGAAGAGGTGGCCTGTGGAAAGCCTGAAGGAGGGTACCTATACTGAGAAGTGGGATGGGGTTTTC 
CCTTCCCCTAGATTGTGTCTGGGCTTGGCCAACACCTACCCTGAGGCCCTCACCTCTATCCTATGGGACGGGGTCCACCCACCCCCAACAGGCAGTG 
fj ACTCCGGTCACCGAGGCCCCGCCAGGGTCTGTTGGGCTGGGTCTCTCTCCAGGTCTGACAGGAGCGAGGGGCCCGTGGCTTCGCTGGACCTGAGGGC 
*Tj AGCTCATGTGGCCCTGTCAGCCCTCACAGTGGGGTGTGGGAGCACTGCATCCTGGCGCCGGCTGAGCCGAAGGGCCCCTCGTTCTGTCGCCTGCACA 
GTGCGTGGCCGGCTACCACGGGGTGAACTGCTCTGAGGAGATCGACGAGTGCCTCTCCCACCCCTGCCAGAACGGGGGCACCTGCCTCGACCTCCCC 
: -= = I AACACCTACAAGTGCTCCTGCCCACGGGGCACTCAGGGTAAGGGCCGCTGCACGGAGGGCTG^TGTTGGCCATCCATGGCCAGGGCAGGGGCAGGGC 
J! AGGCACCCCGGGACCGGCCAGAGGCATCCAGGAACCAGCCAGAAACTCCCGTTTTCCTGTGTGAGGCCAAGGCCGACTCACAGCTGCCCAGGGAAAG 
hS GGCCCAGAGCGGGGGTCCCAGTGGGAAGGGCGTCTCCACGGCACCCTTGACACCTGCCTCTCCCGAGTGTCCGTGCAGCCCCAGACCTGAGCGCTTG 
|™|. TCTTCCGGGACGGACACGCGGCACGGCAGGGCCGGGGTGTGGCGGGCTTGGGCCACTGACGAAACCTGGCCCCGC^ 
NNNNNNNNNNNNNNNNNNNNNNNNN NITONS 
NNNNNNNNN^NIWNNNNNNNNNNNNNNNNNNNN^ 
NNNNITONNNInMNNNNNNNNNNNNNNNNNNNNNNNNNN 
N^NNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNIWNNN^ 

NNNNNNNNNNNNNNNmNNNNNNNNN^NNNNNNNNNNNNNNNNNNNNNNNNN^ 



ISTNNNlMtfNNNNNNNNNNNNlTONNNNNN 

NNNNNNNNNNNNNNNNNNNNNimNNN^ 

NI^NNNNJSTNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNI^NNNITONNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
Nl^NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NI^NNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNN^NNNNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNl^Nin^NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
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NNNNNNNtWNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN)^ 

NNNNNNNNNNNNNNNtHtfNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNKNNN^ 

NNNNNNNNNNNNl^NNNNNNNNNNNNNN NITONS 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNN^NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^NNNI^ 

NNNNITONNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNfWNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNN^NNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNIST^NNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN]^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNN^NNNIWNNNNNNNNKNNNNNNNNOTTONNNNNNI^NNNNNNNNNNNNNNNN 

NNNNNNNNNNNNIWNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNHKNNNNNNNNNNNNN^ 

OTTONNNNNNNNNNNNNNNNNNimNNNNNNN^ 

NNNNITONNNNNNNNNNNNNNNNNNNNNN^^ 

NNNN1WNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNmNNNNNNNNNNN^^ 

NNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNC CCGGGCCJAGGGGTCTCT GGGGG CTT CCT AGGGAGCTCGCTCAGCCTCACTT CT CGAC 

C CC T CAC CC CC C AGGCC CCTGAAGAACGCTT C AG ACGGTG CC CTC ATGG ACGACAACCAGAATG AGTGGGGGG AC GAGGACCTGGAG ACC AAGAAGT 

TCCGGGTGAGTCGCGAGGCTCCCGGGCTCCTGGGCTCCCGGGCACCTGCTGCCGGGCTGCCCTGACAGGCTCTGCTCACTCCCTCTATGTAGTTCGA 

GGAGCCCGTGGTTCTGCCTGACCTGGACGACCAGACAGACCACCGGCAGTGGACTCAGCAGCACCTGGATGCCGCTGACCTGCGCATGTCTGCCATG 

GCCCCCACACCGCCCCAGGGTGAGGTTGACGCCGACTGCATGGACGTCAATGTCCGCGGGCCTGGTAAGGGTGCCAGCAGCCAGGGCTTCCCTAGCC 

CCGTGGCCCACCTGCCTCTCTCCCCTAAGCCCCGAGGCTGGGGTACAGTTGATTCTCTGGAAACTTAGAATTGGGGGTGAGAGCTTTCATCCTTGGG 

GTGTTTCCCATTCAGAGTAGACGTGGGGGGGTCCTGGAGTCCTCCTGTTCTTCCACCAACCCCTTCCTGGGGTGATACCGCAGGGCCACTCTGTCCC 

TGTAAATTACTCTTTTCTGAAACCTTCTTGAGACATGGAAAGCGTTGATTTTCTTTTTCTTTTTTTTTTTTCTTTTTTTTTTGTTTTTGAGATGGAG 

TCTCGCTCCGTCACCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCCCTGCAACCTCCACCTCTCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCC 

CCAGTAGCTGGGACTACAGGCGCCTGCCACCACACCTGGCTAATTTTTGTATTTTTAGCAGCGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCG 

AACTCCTGACCTCAGGTGATCCGCCCACCTCGGCCTCCCCCAGTGCTAGGATGACAGCGTGAGCCTCCGCATCCTGCTGAAGCATTAATTTTCTAAC 

TGCATGCTTCTGGGGTCCTCTTTTTCCTGGGTGGATTTTGGGTGCCAGGTCTTGGGCAGGTCAGGAAGCAGTTGCCCGCCAGGAGTTTAAGCTGGAT 

TCGGCTCTGTCCACTGAGCAGCCCAGGCGGACTTCAGCTGCCCCTTGGTGGCTGTGTGCACGGGGCCACCAAGTGCTGGGTGGTGCATCCACACCGT 

GGCCCCTTGAGCTTGGGGCTGCTGCCCCCCTCCCCCTGGGCTGCAGCTGGGGACCGGCCCTCCAGACTGAGCACCCGTCTCTGCCTCTGCAGATGGC 

TTCACCCCGCTCATGATCGCCTCCTGCAGCGGGGGCGGCCTGGAGACGGGCAACAGCGAGGAAGAGGAGGACGCGCCGGCCGTCATCTCCGACTTCA 

TCTACCAGGGCGCCAGCCTGCACAACCAGACAGACCGCACGGGCGAGACCGCCTTGCACCTGGCCGCCCGCTACTCACGCTCTGATGCCGCCAAGCG 

CCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGACAACATGGGCCGCACCCCGCTGCATGCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCCAG 

GTAGGCAGTGGCTGCCTGTGTGCCCACCTGCCCTCCTCAGGGCCGCCTGGTGGTCTGGGGCAGTGGCCAGGCTTACGTGGCCCTGGGAGCCTGACCC 

CGAGCACAGCTGAGTCCGGGACAACTGGTGCCTCCACCTGGGACCTTCGCAGTCAGCGAGGTGCGAGGGGGAGGGCGTCGGGCCCATCTGTGTTCTC 

CAGGGAAGTCAGGCAGAGGCGGGTCTGGCAGGAGGCCTGGGGGATCTGCTGAGTGAGGCAGCACCTCCCCACCCCCAGCAAAACAGGCTCCATCAGG 

GTTGTGGGCCTTGCTCAAGGTCCAGGTTCCACTGCTGCAGCCCCTCGCAGCCCCGCCCCTCCCTCAACCTTGGCTGCCGGCGTAGCCTGTGGCAGTG 

AGAAGCAGGGTTTAGAGGCTGCCGCTCGGTGCCTGCAGACCTAGGGCTCAGCTTGCCGGTGAGCTCGTGGCAAGAATGGATTTAGGGATTTGGATGC 

CTGGGTCTCCAGGGAGTGTCCCTGGCAGGGGCTGCCTTTGCAGTCACCCCTGCTGCGAGTCCCCAGGCTCCAGGCGGCCCTGGAGCAAGCAGGTTCA 

GATGGGCACAGCCCGGGGAGTTACCACAGAGCTTCTCATTTCCTGATTTCTTAGCTCAGGTGACACATTGTCATCTTGCAGAAATAAATGTAGGTGA 

GCAGAAAGAGCGTGGAAGGGAGCCCCGTGCGGGTTTGGTGTGTGCCTCTCTCAGCTGCCTTCTTTGCACAGATGTGGAAGTTCCCAGGTGCTTTGCA 

GGAATCAGGCCAAGTTGCCTTTTGCACCCGCCAGTGAGCAGGGGCCATTCCTCCTGCCAGATGCCAAGACCCGGCTGCATTACAAGGGCTGCCCACC 

CCCTCTCTGGGCAGAGCCGGACACTAGCTCGGCGGTTCTGAGACGGCTGTAGGGCCGTGAGCCCGGCTTCCTGAGTGCCAGCTGTAACCGCCGTTGG 

GGGCAGGGACTTGACCTCTCTGTTTTGTAGGTGGTCATGGCAGCTAGGACCACAGTGAGGATTAAATGAGGCCACACGTGGTCACGATGGATCCCAC 

TGTCACCAAGGGGCCTGTCTGTGGGCAGCACAGCCCCTGCCCCCATCCGGGCCCCTCCTCAGGGGCAGCCCCATGGCGTTTCGTCTTGCCCGGCCGT 

GCCGATCTCAGGAGGGTCTCGTCTGTGTCCGGAAGACAGTGGGGCTTCCCGTCCAGGCGTTCGTTCTGGGCAGGAGGCATCGGTGTACGTCTGCCCA 

GCACCCGCCTGAGCCTCTCCCTGTTGCCCAGATCCTGATCCGGAACCGAGCCACAGACCTGGATGCCCGCATGCATGATGGCACGACGCCACTGATC 

CTGGCTGCCCGCCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAACTCACACGCCGACGTCAACGCCGTAGATGACCTGGGTGAGCCCACGGGGG 

CACGGCTGCTCTGTCGTGGGGCGGGACCGCCACAGGGACGGGTGGGACTGGGTTGCCTCCAGCTGGTCTCCAACCTACCCCATCTGCTTCTTTCACG 

CAGGCAAGTCCGCCCTGCACTGGGCCGCCGCCGTGAACAATGTGGATGCCGCAGTTGTGCTCCTGAAGAACGGGGCTAACAAAGATATGCAGAACAA 

CAGGGTG AG CG CGAGGCTGGGATGCCAGGGG AGACGTGAG GG CTG AATCCAC AG AGAAGTGAGGG CCAAAC CC ACGGGGCGC AGGG AC AC AAGGGC C 

TGACCCACAGGGTCTCGAGGGTCGCTGGGCCCGCATGTGGGCCCCACCCGCATGATGCGGGCACCTCGTTAGTGCTCCTGCCCTCCTTCAGCCCCTG 

TTTATGGAGCCCTTCCAAGTGCAGGTCCAGCTGTCAGGACACGGCGGCTCGCCCCTCAGACCCAGCCCTACCCTCCTGGGCGGCAGTCATGGCAGGG 

CACACAGCCTGGCGTGGGGAACGCTGCTGCTGCCACTGCTGTGTCACCGTCACCTGGACCTCACCTTCTGTCCCCAGCTGTCACCAGGCGGGGGTGG 

GGATGGGAATGTGGTGACGCAGGTGGTGCAGATTGAGCCAGAACACGCGTGGCGGCAGCTCCCTGCGGGGCGGGGCCTCTTGGTGTGTTCACCAAGG 

CCAAGGACCTCAAGGCTCAGAGGAAGAAGTCTCAGGACTATATCCAAGGGGAGCCACCCCC^ 

GCTGCTGTTGGTTTCCCCTGGGTGCTCAGGGCACAGGTGCAGA(^CCCCCACCTCCCTGCCGCCAGAACCCCCACCCCCGCCCCCAACTCCTGCTGC 
CCCCTTGGCATGTCAGGCTCAGGCGTCTCTCCCTCCTGGGTGAGGGCACACAGCGGGCCTGGGCACCGGGGCATGTTGGCCCAGGCGCTCCCCAGTC 
CGTGGCAGCATGGCCCCACAGAGACTGGGCCCCAGAGGGCATCAAGGCCTGGGAAGCCCCTTCCCACTCCCACACAGCAGCCTCACCCAGACCTGTG 
ACGTGTCCACCGCACAGAGGAGACCCCAGGAAGGAGCTTGGTGGCCACCAGGGTGGCTTGATGGCCGTCCAGATGACGAGCTCCCTCCCTGGTGCCC 
TCGCCGGTGCCTCTGAACCGCTCCAGGAATTTCCTTTTGTGCCTTATTGGGGGCAGGGAAGAGGGCCTGCAGTTGGTTAGATTTTCAGTGGGGTCTG 
TGACCCCCCCATAGAGGTGGAGCCCCGCTGATCTAGGGTAGAGGACTGCACAGATCCCCTCTCTGGGTGGGTTTCAGAAGATGTATCAAAGCCTTAA 
CATTTAACAAGAGTCAGGCTAGGTGGTTGCAGGACGCTGGGGTGGGGTCCT 

TGCTTCCTCTGGTGATGGAACCTTGGGGAGGGTCCCCACGCCTGGCCTGGCCCCCCTCACCGGCCCCCGCCCTCATCCCCCAGGAGGAGACACCNNN 
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NNNNNNNNNNNNIWNNNNNNNNNNNNIWNNN^ 
NNNNNNNNNNNNNNNftNNNNNNNNNNNNlWNlTON^ 

GCACAGGAGCGCATGCATCACGACATCGTGAGGCTGCTGGACGAGTACAACCTGGTGCGCAGCCCGCAGCTGCACGGAGCCCCGCTGGGGGGCACX3C 

CCACCCTGTCGCCCCCGCTCTGCTCGCCCAACGGCTACCTGGGCAGCCTC^GCC^ 

CCTGGCCTGTGGAAGCAAGGAGGCCAAGGACCTCAAGGCAOSGAGGAAGAAGTCC^ 

TCGCCCGTGGACTCCCTGGAGTCACCCCATGGCTACCTGTCAGACGTGGCCTCGCCGCCACTGCTGCCCTCCCCGTTCCAGCAGTCTCCGTCCGTGC 
CCCTCAACCACCTGCCTGGGATGCCCGACACCGACCTGGGCATCGGGCACCT 

CCGGCTGGC CTTTGAGACTGGCCCACCTCGTCTCTCCCACCTGCCTGTGGCCTCTGGCACCAGCACCGTCCTGGGCTCCAGCAGCGGAGGGGCCCTG 

AATTTCACTGTGGGCGGGTCCACCAGTTTGAATGGTCAATGCGAGTGGCTGTCCCGGCTGCAG 

GGGGGAGTGTGGCACCAGGCCCCCTGAGCACAC^GGCCCCCTCCCTGCAGCATGGCATC^ 

GTCCCAGATGATGAGCTACCAGGGCCTGCCCAGCACCCGGCTGGCCACCCAGCCTCACCTGGTGCAGACCCAGCAGGTGCAGCCACAAAACTTAC^ 
ATG C AG C AG CAG AA CCTGCAGCCAGCAAACATCCAGCAGCAG CAAAGC C TGCAGCCGCCACCACCAC CAC CAC AGC CGC AC CTTGGCGTG AGCTCAG 
CAGCCAGCGGCCACCTGGGCCGGAGCTTCCTGAGTGGAGAGCCGAGCCAGGCAGACGTGCAGCCACTGGGCCCCAGCAGCCTGGCGGTGCACACTAT 
TCTGCCCCAGGAGAGCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCC^^ 

AGCTACTCCTCGCCTGTGGACAACACCCCCAGCCACCAGCTACAGGTGCCTGAGCACCCCTTCCTCACCCCGTCCCCTGAGTCCCCTGACCAGTGGT 

CCAGCTCGTCCCCGCATTCCAACGTCTCCGACTGGTCCGAGGGCGTCTCCAGCCCTCCCACCAGCATGCAGTCCCAGATCGCCCGCATTCCGGAGGC 

CTTCAAGTAAACGGCGCGCCCCACGAGACCCCGGCTTCCTTTCCCAAGCCTTCGGGCGTCTGTGTGCGCTCTGTGGATGCCAGGGCCGACCAGAGGA 

GCCTTTTTAAAACACATGTTTTTATACAAAATAAGAACAAGGATTTTAATTTTTT^ 

CCTTTTTATTTATATGTACTGTTTTATCTGGCCCCAGGTAGAAACTTTT^ 

GTAGGATGAAAAGATTCTTCTGTGTTTATAAAATATAAACAAAGATTCATGATTTATAAATGCC^TTTATTTATTGATTCCTTTTTTCAAAATCCAA 
AAAGAAATGATGTTGGAGAAGGGAAGTTGAACGAGCATAGTCCAAAAAGCTCCTGGGGCGTCCAGGCCGCGCCCTTTCCCCGACGCCCACCCAACCC 
CAAGCCAGCCCGGCCGCTCCACCAGCATCACCTGCCTGTTAGGAGAAGCTGCATCCAGAGGCAAACGGAGGCAAAGCTGGCTCACCTTCCGCACGCG 
GATTAATTTGCATCTGAAATAGGAAACAAGTGAAAGCATATGGGTTAGATGTTGCCATGTGTTTTAGATGGTTTCTTGCAAGCATGCTTGTGAAAAT 
GTGTTCTCGGAGTGTGTATGCCAAGAGTGCACCCATGGTACCAATCATGAATCTTTGTTTCAGGTTCAGTATTATGTAGTTGTTCGTTGGTTATACA 
h& AGTTCTTGGTCCCTCCAGAACCACCCCGGCCCCCTGCCCGTTCTTGAAATGTAGGCATCATGCATGTCAAACATGAGATGTGTGGACTGTGGCACTT 
~f*$ GCCTGGGTCACACACGGAGGCATCCTACCCTTTTCTGGGGAAAGACACTGCCTGGGCTGACCCCGGTGGCGGCCCCAGCACCTCAGCCTGCACAGTG 
TCCCCCAGGTTCCGAAGAAGATGCTCCAGCAACACAGCCTGGGCCCCAGCTCGCGGGACCCGACCCCCCGTGGGCTCCCGTGTTTTGTAGGAGACTT 
£j GCCAGAGCCGGGCACATTGAGCTGTGCAACGCCGTGGGCTGCGTCCTTTGGTCCT^^ 

TGGAGGGAGGCGGGGGCTGCCCTTGGGCCACCCCTCCTAGTTTGGGAGGAGCAGATTTTTGCAATACCAAGTATAGCCTATGGCAGAAAAAATGTCT 
GTAAATATGTTTTTAAAGGTGGATTTTGTTTAAAAAATCTTAATGAATGAGTCTGTTGTGTGTCATGCCAGTGAGGGACGTCAGACTCGGCTCAGCT 
4* CGGGGAGCCTTAGCCGCCCATGCACTGGGGACGCTCCGCTGCCGTGCCGCCTGCACTCCTCAGGGCAGCCTCCCCCGGCTCTACGGGGGCCGCGTGG 
TGCCATCCCCAGGGGGCATGACCAGATGCGTCCCAAGATGTTGATTTTTACTGTGTTTTATAAAATAGAGTGTAGTTTACAGAAAAAGACTTTAAAA 
GTGATCTACATGAGGAACTGTAGATGATGTATTTTTTTCATCTTTTTTGTTAACTGATTTGCAATAAAAATGATACTGATGGTGATCTGGCTTCCAC 
if— TCCCCTCTGCTCTGGCCTTTGGCTCCCTTTCTGGGAGGGAGGCAGGGCTGCTATGCTCTGAGGGAGCCAGGAGTCGAGGGCCCCTTCTGCTGGGAGA 
LjJ GTGACGGTGAGGCTGCCTAGTCCTGGCCCACGGGGGTGTGGGGACCACGCTGCCTCCAGGGACTCCATGGTGTCTGCAGCCTGCCTGGTCCAGGCCC 
TTTGTAGGGAGATGGACACACAGCAGCAAGGGGGTTGCAGCCCTATGGGAGGTGGGGCTCGTGCCTGGGGTGACACGGCTCCCAGGACACCATGCGT 
GAGTCTGGCCCTCCTGGCAGCTCGGGGCTCTTCTCCTCTCAGCCTCGGAGGGATGAAGGTCCCACCCAGCCATCCTGGGGACAGCCCCTCAGGGAGC 
|=A TCTGCAGCAGGCAGGGGGCTGCATAGGGAGGGCCTTGAGGCAGTGGCATGACCCCTCTACGGGCTGGAGACCACAGGGAGGACTCTGGCCCCTATAG 
Jsb| GAGGGCAAAGGGAGCTGTCGAAGCCTATGAGCAGGGCAGCATGGGTCGTGGCAGAGTCTGGAAAGTTGTGTGATCCTAAGGAACTGGCTGAGCGAGG 
^ CAGAGGCGGCCTGGTCCTGGGGCCCTGCCCCTGAATAGAGCTGCCAGCCTCACACAAGGGTGGGCCCCTTCTCTCCCCACTGCCTGGGCCTCTGCCC 
flj AGCCCCAGACCTTCAGGGCAGGCCAGTGGCTTCAAACCAGAGCGGTGGGGAGTCTGAGATCCCTCTTTGGATTGCAAAGCACTGCCTGCCCTGGGCC 
j 5 "f CAGTCTCTCCAAGGAGGGATGTGAGCCCGAGGCCTCTACTATGGCTGGGGGCTGCGNNNNl^ 
■J; NNNNNNNNNNNNlWNlWNNNNNNNNNtmNNNN^ 
O NNNNNNNNNl^NNNNI^NNNNNNNNNNNNNNNN 

Lx NNNNNNNNN^NmNNNNCCGGCATCACTCAGTACAGCCACAGACAGCCTC^ 

GGTGAACCTTAGTGTCAGCTTGGCCGGK3CCATTTCGCCAAACGGAGTGTGTCTCTGAGGTG 

GTAAGGCAGTCCGCCCTCCCCAGTGTGGGTGGGCCCCGTGCGGTCCGCTGAGGCCCGGGGAGAACAATAGGCCGAGTGCCAGGGGTCCTCCCACCCA 
ACCGCTTGCACTGGACATTGGTCTTTTCCTGCCTTCAGACTTGGACTGAAAACGTGGGCTCTTCCTGGGCCTGGAGCCCACCGGCCTTCGGACTCGC 
AACGCCATCAGGCCTCCTGCCTGCAACTGCAGACCCTGGGAACTGCAGGCCTCGAGGTCGTGTGGCCAATTCCCTGCAATAACCTCTCCAGTGGCAC 
GTCTTATGGGCTCCGCTTGCCGGAAGAACCCTGACGAATGCGCACGAGGGAATATGATTCCACCGTAGAAAAGGAGGCGCTGACACCCACCATGCGG 
ACAAGCCTTGGAAGCACGGTGCTCAGTGAAGGGTGCCGGCCGCAAAGGGCCGCAGGTGCAGTGACTCCATTTCTATGGGGTGCCTGTGGTCACCGAA 

TTTGTTTTTATACTTTAAGTTTTAGTTTAT^ 

AAGAAAATGTGGCACATATATACCATGGAATACTATGCAGCCATAAAAAATGATGAGTTCATGTCCTTTGTAGGGACGTGGATGAAGACGGCCCGGG 
GTTTTACTTTGGAGTGACGCAAATGTTCTGGGCCCAGGGAGAGGTGGTGCGTGCACAGCCCTTCGAGTGC^ 

TTTTTTTTTTTTTTTGAGACGGAGTCTCGCTCTGTCACCCAGGCTGGAATGTGGTGGCGTGATCTTGGCTCACTGCAACCTCCGCCTCTGTGTTCAA 
GTGATTCTCCTGCCTCAGCCTCCAAAGTAACTGGGATTACAGGCCCCTACCACCACACCCGGCTGATTTTTTTGTATTTTTAGTAGAGATTGGGTTT 
CACCATGTTGGCCAGGCTGGTTTCAAACTCCTGATCrCAAGTGATCCACCCGCCTCGGCCTCTTAAAGTGCTGAGGTTACAGGCGTGAGCCACTGCA 
CCTGGCCTAACTCGTACACTTTAAAGTGGTTAATTTTAGGTTCTGTGAGTCTCACCTC^ 

AGACAGGGTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATGATAGCTTATTGCAGCCTCAACCTCCTGGGCCCAGGTGATCCTCCCACCT 
TGGTCTCTCTAGCAGCTGGGATTACAGGTGCACACCACCTCACCCACCCAATTTTTTTGTTATTTTTTGCAGAGATGGGGTCTCACTTTGTTGCCCA 
GGTTGGTCTCGAACTCCTGAGCTCATGCAGTCCTCTCCTCACCTTGGCCTCCCAGAGTGCTGGGATTACAGGTGTGGGCCArTGCGCCTGGCCACAT 
ATGCAAC TT TAAAT ATTTCAGTAGC CAC ATT AAAAAGGTAAAAAG AAAC AGGCAAAGTG T AT ATT AAT ACT AC AGCTTAGTTAAT CCAAT ATAT CAA 
AAATATTATTTTAACACAACATAAAAAGTGAATGAGTTTTTTTTTGTTGTTTTTTGTTTTTTGAGACAGAGTCTCCCTCTGTCACCCAGGCTGGAGT 
GCAGTGGTGCGATCTCAGCT^CTGCAACTTCCGCCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACCTG 
CCACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGAGGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGATCTCTTGACCTCGTGATCGGCCCAC 
CTCGGCCTCCCAAAGTGCTGGGATTATAGGCGTAAGCCACCACGCCCGGCCGTGAATGAGATGTTTTATATTCTTTTTGTTTGTTTGTTTTTGGTAC 
TGTGTCTTTGAAACC<^GGATGTACAGTACACTCACAGCACCCTAGATTGGGCCTGGCCACACTTTAAAAGGTGTTCAGTATCTCCAGGAGGCTCAC 
GTCCAGATTCGGACTCTTCAGCAGGATGCCAAGGGGGCTCCAGACAGTTCCACCCAGCGCAGGAAAACACCAAGTGTATCTTCAGCCTGGTGACTCA 
CTCCACGCCGCTGCCGGCGTGAACCCCGAGTGTGGCGCTGCGTCCCTCTGGATAGAGCAAAGGAGGGTCCCAAGAGCCCGGAGGCACCAACTCATCA 
GGGCCCACAAATCAGGCCCCCGGCAGGCAGGGCTGCCTCCATCCAAGAGGGAAGCACGGTGCCGTAGCCCCCGGGCTGCTCCAGGGATCGACAGGCC 
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TGTGCTGCCTTCATCCCAAAAGTCTCACTTGCAAAGGGTGTCCGTGGGGCTCAACTGCGTGCTCAGCGGTGGACTGATTTCGTTCTGTGTGGCTCCT 
TATCTCATCGTGGACCCAAATCCACGAACCACACATGAGGAGCT^ 

CTGTTTCCCGTGGTGAGCATCCACCAGCAGCGGTTCCTGGTGGGACACCGACGTATTCGTCTGCAAAAGTAAAAGTGTTTCAAAACAAAACCGGGTC 
CAGCTTGGTCCCGGTGCGTTGAAAGTGCTGGCGTTGGCCAGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATC 

acaaggtcaggagattgagaccatcctggctaacacggtgaaaccccgtctctactaaaaaatacaaaaaa™ 

GTAGTCCTAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCAGAGATCATGCCTCTGCACTCCAGCC 

TGGGCGACAGAGCAAGACACTCTCACAAAAAAAAAAAAAAAAAAAAAGTGCTGGCGTTAGTGGAGGTCTCTCAGAGGGGGCGTT^ 

CGTT ACT CAGGCTTTGCAGACGAAG ACGCAGGGTCCGTGAGTGAG CGAGCTTTGGGAAAGGCTGT TCTTCAAT CTGCAT CTTGAAGGTGG CTGGGTG 

GCGCCTCAGTCGCCTGCCCCGACGTGACTCCCGATCTGGCTGCTTCG^^ 

AGTGTCCTCGTGGGTCAAGTGGGGACCTGGGTGCTGGCCCTGGGCCTGCGATGGTGAGCTGCTGGTGCACGCTGGGCATGGGGTGAGGGCTCAGAAA 
GGGAACAGCTCTTGTTATCGTTAGGAGGTTTTCAATCGCATTCTTTGTTCTCCACCTGCATGCCGTGTTGGCACAGTTAAGC 

GCAGGTGTCCTAACCGT CTACAT TTTAAG ATGGACAAGTC TAGGCTCATGG AGATTGCAGGACTG AGC TGC GAAGCTTT ATTCATGCCCAAC ACG AC 
AGGCGCCAGGTGCCCAGGGCCTCAGAGGTGGGTTTCCGTTGTTCTTCIACACAACCTTGAACTTGTGGCCCATCCCTGGACCTTACAGCACAGGACAC 
ACTCTCAGTGTTTGTTAGGAATCTATAATTTTCTGCGGAGTCACGCGTGATATTTGCTGTGGCTTTTGGTATATCTTCCACTCTATCGCACGGGATT 
TTCTGGTGAAAATGTGCCCTGGGTCCACACAAACGTGCACATTCTACCACAGCCTT^ 

TTAGAAAAGTTATAACTACGGCCAGGCGTGGTGGCTCACACCTGTCATCCCAGCACTTCGGGAGGCCGAGGCGAGAGGATCACCTGTGGTCAGGAGT 

TCCACACCAGCCTGGTCAACTGGTGAAACCCCCCTCTCCACTAAAAATACAAAAATTAGCTGGGCACCATGGTGGGCACCTAGAATCTCAGCTACTT 

GGGGGGG CTGAGGC AGGAGAATTGCTTG AAT TCGGAAGG AGG AGGTTG C AGTG AGC TGAGATTGCACCACTGC ACTCCAG CCTGGGGGATACAGTG A 

GACTCGGTCTCAAAAAAAAAAAAGAAAAGTGTATAACTTTACTTTTTTTTCTTTTTGTGGAGATGGAATCTTGCTATGTTGCCCAGGCTC 

ACCCCTAGGTTCAAGTGATCCTCCCACTTTGGCCTCCCAAAATGCTGAGACGACAGATGGGAGCAACCTCACCTGGCAAGCGTGCACCTGTTACTTT 

GGCCGACTCAGAGGCGGCTCTCTGCCTCCCCCTGGTGGCACTGCTCTGCCAGCGCCCCACAGGTCCTCCCGAGTGACCACAGGGCAGAAATGGCCCT 

CACCCCACGCCCAGGGCCATCCCCCGCGACACTGCTATTATCCTGTTTTACACATGAGGAAGCTGGGGCTCAAAGAGGTTAGGCGACTTAGCCAAGT 

TCGCCCATTCACTGAGCCAGGGCTATGTGATTCCCGGTCCGTGTCAAACCACGAGGAGTCACCTGCCCTCAGAGGAGCAGGAAGGCCTGTAGAGTTG 

ATGCCATCCAGCCTGGTGACTTTTCAGACAGGGAGACGGCTGCTCAGAGGGGAGAAGGGGGTTGTTGGGCCACTTTGCCAGGAGTCAATACCATGGC 

CAGCCTGGCATCGGATACTTTGCCAGGAGTTAATGCCACGACTGGCCTGGCATCGAATACTGGAGCTCTTCCCAGCTCCACCCGGGCCCTTTTCTCT 

TCCCTATCTACAGTCCCCAGGCCACCAAGTCCATGTCCACCACTTAAATCATCACTCAATGCTCAGGACTGTCTCCTGCCTGCTGTGTGTGAACCCC 

GTGCTGTGCTGGGGTTGTGCCCTCGGGGACTCTCCCGGGTTCTTGGGGACTCTCCTGGGTTCAGCGCCTCTTTCTCCTAGGCCACCCAAGGCTGCCT 

CCTGGGCCCTGGCTCTCCCTAGCCCCTAGCTCGAACCCTAACCTGGAGAGCCTTTGGGGCCCAAACTGGTTGCATCGCTCCTGGCTGAAAAGCCTCT 

GGAAGCTTCCTATACTTCCAGACACCACCTGCGGGTCTCTTAGCAAGCCTACAGAGTCCACACTAGCCTACGCCCACTGGCCGCTCACCCCGCAGAA 

GCCTCATGTGGACTGTTCACTTCAGCCAGAAATGCTCCTTCTCCTCCTGCCTGTCCCCAGCAGACACTGCATGGTGCCCTCTGTATGGGCACCTCTG 

TGGCCAGCCAGCATCCATGTTCCCCGGAGGCAGAGTCTGGGCTTGTCTGTCTCACGGCCACAGCCCAGCTCCTAGCAGTGCCTGGGGTACAGCAGGG 

ACTCAGCAGACATCTGTGCATGGTTTGGGGATAGAGGACTCGTGGGCTTACCAAGACCCTGCCTCAGCTCAAGGACAGCCAGCAGTTCCTCCTGTGG 

CCCTGGGGGCCTGCGGTCCCAGGTTACTGTGACATCGACATCAGCCTTGAGCTTTCTGCTTCATTCCTAGTGCTTTTTAGCACCTTACATGGTAAGT 

ACTTGGCTCAATGATTCCAGAAAAATGTTCTCTGAGACCTGCTATGTGCCCAACATGCTTGAAGATGCTGGAGATGAGGAGCATGACGCTCCTGCAG 

TAACGAGGCTGCTGGCAATGCCGTCAATGTGCCCTGTGCCTCCGGGGAGGCCTGGTGTGGAGGATACGCTTGCCACCCTGTGAGGTCAGGTCCAGGA 

AGTCCTTGCAGGAGGCCGGGAAAGGGCAAGGCTGTCACCTTCTTTTACAGGGGAGGAGGCAGAGGCTTGGAAAGGCCAGGTAGCTTGCCCAAGGCTG 

AACACCTGGGTAGTGGCTGAGGGTCTCTGACTTCTGGAAGCTTGCAGTACAGCTGGACACACGGTAGTTAGCCCACCATTGCTGGAGCCATCTTACC 

TGTGCTCTGCTAGCCAGGTGGGCGCCCAAAAGGAGATGCCTTTGCCTGTGCTGGGAGGAGGCGCTGTGGGACCCATTCCCCAGCAGCCGAGAGCAAG 

GGCACATGTGCCAGGGCGGGGATGCACCTTCAGCCTCATGTCAGCCCTGGGAAGGCCTAGAAAGTGGACAGTGCTGTCTCGCTGCACACCCCTGGCG 

AGCACACCGGGCGAGCACTCCAGGTTAGGGTGGGGCTCAGAGGTGAGTAGAACACGGGAAGGGGGTGGCAGCGTGGTGAGCATGTTTGCGGACCCTA 

TGCTGTGTGCTTGGCAGGCCTTGTGCATGGAAGTGAGTGGGACTGAAGCTCCCTCCACACTGCCCACCATGGCCCTGGGGGTCTGGCCTTCCCCTCA 

CAGAGGTGACTTCCTCTCCTCTATGAGGGTAAGAAGAGGAAGCCTGGCCCACCCTCTGGCCAGCCTGGTGGCATGCTGACAGGGAACCCCTGCAGGG 

GCCTGATGAACAAACGCCACAGAACAAACTCTACGGACCCTGCCTCCAGATCTGCTTCCTCTCCCCTGATTCCAGCTTCCCAGCCCTATTCCCTCCT 

CCTCCAGGAAGAGCTCCGTGTCAGTGCTCTCCACCTTGGAGGCCTTCATATGTTTTTTTGCCTGGACCACCCAGCTAACCGTCTCACAGAGACTGCC 

CCTTCCTCTCTCCAGGCCAAGCCTCCAGCTTCAGCGTCCAGGACTAAGGGTCTGTGTCAGGGGCTGAGGACAGTGTCCGGCAGGGAGCAGGCCTGCC 

C AGTAA ACCTC C AC AAGC AC C CAGAC AT ACT CGC C ACTGCTATTTT AC ACC AGGTAGGAAAGAGGAGC AGAAGCTC ACAGTG AGGGGCCGT CCTGCG 

TTCACTGAGACTGCTGCTCTGCTGGGGGCTCCCTGGAGGACCGCCACTGGCCCGCCAGTTGTCAGCAAAGAGCCAGGTCCACCCATCAGCCTCTGGC 

ATTTCAGGCCCCTATC^TAGGAGGCTATAAGCACAGGCTAGGGGTGCTGAGTGCCTGCCTTCTAGACCAACACTGTCCAGTGGAAATGTAACACCAG 

CCCCATGTGGAGTTTTACATTTCCCAGCAGCTATGTCAAAAACAAAAGAAACAGGTGAAATAAAATTAAATAGTATTTTGTTTAATCTGATACATCC 

AAGATATTGTCATTTTAATATGCAATAAAAAATGATTGATTCGATATTTGCATCCTTCACAACCTACCATGTAGTTTACACTCAGAGCCCAGCTCAG 

TCTCACAGCAAGCACCAGCGGCCCCCGTGCTGGACAGCATGGCTAATGGGAGGAGCTGACCCGTCGGAGGCACCAAGGTCACGCTGTGACCAAGGCT 

GTGGTGACGCAAAGGCAGCGGGTTTCCTGGCACAGCCTGGAGGTGCACCAGCCTGTCCTGTGTGCTCTGCATAATGTGGTACAGCAGTGGTCACAGA 

GCTTTGCACCAGATGCTGATTCCAGCCCACCTAGGGGGGTTCAGGGAAGTGGTGACACTTGTGTCCAGCCTCCAGGTGTGAGCAGGGGCAGAGACTT 

GGAG AATGG AG AAAC CAGCGG ATGCTGC AGGC AAAAG C AGCAAAGCCC C ACCAGGAATAGTGATGGCC ACC ACCGG CGGGAC ACC TCCCAG CACTGG 

GTGGAGGCCTCTGCTGGCTCGC^GTCAGTCACCTGATGTCCCCATTCGCCCCTGCCATGCTTGTTGTCCTGGCAACCCCATTTCAGCCCTGAAACGT 

TTAGGTGCTCATCTTGCTCTATCTTAGAGATGAAGACCTGATGCTCAGAGACTAAGGAAGTTGAAAA^ 

GCCTTCAACGGAAGGGTCTATGGCCCAGAGGCTGGATGGGCTCGGGTGTAGGGTGCAGGAACCCTCCACTCCACATCAGGCTACAAACTGCTGAGAG 
CTGTCT CCGTGCAC TCAGGATTAGGG AG AAG AGGGGG AAGAAAGAC AG CCG CGC CCC CT GCAGGTG ATGTG GTGATT CA C AC CCCTAGGAGCCAAC A 
GAAGCCGCTGTCTAACATCCCTGCCCTGAGGTCCAGAGACCTTGCATTTTATGCCTAGGGCAGGGTTGGTTTCTGCAGCTATGGTCTCTCCCATAGC 
TTAGTGCCCACCCCGGTGGGACCACAGGGTTTCGGAGGACCCTAGAGGGTCGAGCCCTGGGCTCCTGGCTCTTGGGTTCTTTGAACTCGCGAGCGGC 
AAAAGGCGAGCCCCACGCCTGAGAGGCAGCATCCACCTCTGCATCCCCCTCTTTGGAAAGGGCAAACAATGACACCTCCTTCCGCGGAGGGTGCAGA 
GCGCCTCGACCACTGGGAAGAAGCAGATGGGGGCAGCGGGGCTGGGGCTGGGCCCTACCACGTGCTCCCTAACTCTGTTCAAACAGGTCAGGAATCT 
GACTCCTGAAAGGCAAATGTTAGCTGAGCAGATTGGCCCTGGTGCCACGTCAGCCGGACTGAAGCTCTCCCTGCACCGCCTGCGGCCCTGCTGGCGT 
GGCACGGATATCACCCCCTCCCTCACCAGCCGCCGGCCCAGCTGGGGACAAGGGTGGCTCCCACCCAGGGAGGACCTCAGCCGCTGCTGGGAGCTCA 
GCCTCCCGCGATGCCATCGGGCGGGCACCCCCACCACGGCGCCTGCCCTCTGGAAGGAGCAGCAGAGGAAGGCCGCGGCTCTTCCCTGCCCTTGC 

HUMAN SEQUENCE - mRNA 

AGGCCCGCGATGCTCCCAGCCCGGTGAGACCTGCCTGAATGGCGGGAAGTGTGAAGCGGCCAATGGCACGGAGGCCTGCGTCTGTGGCGGGGCCTTC 
GTGGGCCCGCGATGCCAGGACCCCAACCCGTGCCTCAGCACCCCCTGCAAGAACGCCGGGACATGCCACGTGGTGGACCGCAGAGGCGTGGCAGACT 
ATGCCTGCAGCTGTGCCCTGGGCTTCTCTGGGCCCCTCTGCCTGACACCCCTGGACAATGCCTGCCTCACCAACCCCTGCCGCAACGGGGGCACCTG 
CGACCTGCTCACGCTGACGGAGTACAAGTGCCGCTGCCCGCCCGGCTGGTCAGGGAAATCGTGCCAGCAGGCTGACCCGTGCGCCTCCAACCCCTGC 
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GCCAACGGTGGCCAGTGCCTGCCCTTCGAGGCCTCCTACATCTGCCACTGCCCACCCAGCTTCCATGGCCCCACCTGCCGGCAGGATGTCAACGAGT 
GTGGCCAGAAGCCCGGGCTTTGCCGCCACGGAGGCACCTGCCACAACGAGGTCGGCTCCTACCGCTGCGTCTGCGjCGCCACCCACACTGGCCCCAA 
CTGCGAGCGGCCCTACGTGCCCTGCAGCCCCTCGCCCTGCCAGAACGGGGGCACCTGCCGCCCCACGGGCGACGTCACCCACGAGTGTGCCTGCCTG 
CCAGGCTTCACCGGCCAGAACTGTGAGGAAAATATCGACGATTGTCCAGGAAACAACT 

ACAACTGCCGCTGCCCGCCAGAGTGGACAGGTCAGTACTGTACCGAGGATGTGGACGAGTGCCAGCTGATGCCAAATGCCTGCCAGAACGGCGGGAC 
CTGC CAC AACACCC ACGGTGGCTAC AACTGC GTG TGTGTCAACGG CTGG ACTGGTGAGG ACTGCAGCGAGAAC ATTGATGACTGTG CCAGCGCCGCC 
TGCTTCCACGGCGCCACCTGCCATGACCGTGTGGCCTCCTTCTACTGCGAGTGTCCCCATGGCCGCACAGGTCTGCTGTGCCACCTCAACGACGCAT 
GCATCAGCAACCCCTGTAACGAGGGCTCCAACTGCGACACCAACCCTGT 

CTGCAGCCAGGACGTGGATGAGTGCTCGCTGGGTGCCAACCCCTGCGAGCATGCGGGCAAGTGCATCAACACGCTGGGCTCCTTCGAGTGCCAGTGT 
CTGCAGGGCTACACGGGCCCCCGATGCGAGATCGACGTCAACGAGTGCGTCTCGAACCCGTGCCAGAACGACGCCACCTGCCTGGACCAGATTGGGG 
AGTTCCAGTGCATCTGCATGCCCGGCTACGAGGGTGTGCACTGCGAGGTCAACACAGACGAGTGTGCCAGCAGCCCCTGCCTGCACAATGGCCGCTG 
CCTGGACAAGATCAATGAGTTCCAGTGCGAGTGCCCCACGGGCTTCACTGGGCATCTGTGCCAGTACGATGTGGACGAGTGTGCCAGCACCCCCTGC 
AAGAATGGTGCCAAGTGCCTGGACGGACCCAACACTTACACCTG 

GCGACCCCGACCCCTGCCACTACGGCTCCTGCAAGGACGGCGTCGCCACCTTCACCTGCCTCTGCCGCCC^GGCTACACGGGCCACCACTGCGAGAC 
CAACATCAACGAGTGCTCCAGCCAGCCCTGCCGCCACGGGGGCACCTGCCAGGACCGCGACAACGCCTACCTCTGCTTCTGCCTGAAGGGGACCACA 
GGACCCAACTGCGAGATCAACCTGGATGACTGTGCCAGC&GCCCCTGCGAC^ 

AGCCGGGCT AC ACAGGGG AGCATGTGTAACAT CAACAT CG ATG AGTGTG CGGGCAACC CCTGC CACAACGGGGGC ACCTGCGAGGACGGCATCAATG 
GCTTCACCTGCCGCTGCCCCGAGGGCTACCACGACCCCACCTGCCTGTCTGAGGTCAATGAGTGCAACAGCAACCCCTGCGTCCACGGGGCCTGCCG 
GGACAGCCTCAACGGGTACAAGTGCGACTGTGACCCTGGGTGGAGTG 

AACGGCGGCACCTGCAAAGACATGACCAGTGGCTACGTGTGCACCTGCCGGGAGGGCTTCAGCGGTCCCAACTGCCAGACCAACATCAACGAGTGTG 

CGTCCAACCCATGTCTGAACCAGGGCACGTGTATTGACGACGTTGCCGGGTACMGTGCAACTGCCTGCTGCCCTACACAGGTGCCACGTGTGAGGT 

GGTGCTGGCCCCGTGTGCCCCCAGCCCCTGCAGAAACGGCGGGGAGTGCAGGCAATCCGAGGACTATGAGAGCTTCTCCTGTGTCTGCCCCACGGGC 

TGNNNNNNNNNGGCAGACCTGTGAGGTCGACATCAACGAGTGCGTTCTGAGCCCGTGCCGGCACGGCGCATCCTGCCAGAACACCCACGGCGGCTAC 

CGCTGCCACTGCCAGGCCGGCTACAGTGGGCGCAACTGCGAGACCGACATCGACGACTGCCGGCCCAGACCCGTGTCACAACGGGGGCTCCTGCACA 

GACGGCATCAACACGGCCTTCTGCGACTGCCTGCCCGGCTTCCGGGGCACTTTCTGTGAGGAGGACATCAACGAGTGTGCCAGTGACCCCTGCCGCA 

ACGGGGCCAACTGCACGGACTGCGTGGACAGCTACACGTGCACCTGCCCC^ 

AGAGAGCTCCTGCTTC^CGGTGGCACCTGCGTGGACGGCATCAACTCGTT^ 

GATGTCAATGAGTGCGACTCACAGCCCTGCCTGCATGGCGGCACCTGTCAGGACGGCTGCGGCTCCTACAGGTGCACCTGCCCCCAGGGCTACACTG 

GCCCCAACTGCCAGAACCTTGTGCACTGGTGTGACTCCTCGCCCTGCAAGAACGGCGGCAAATGCTGGCAGACCCACACCCAGTACCGCTGCGAGTG 

CCCCAGCGGCTGGACCGGCCTTTACTGCGACGTGCCCAGCGTGTCCTGTGAGGTGGCTGCGCAGCGACAAGGTGTTGACGTTGCCCGCCTG TGCCAG 

CATGGAGGGCTCTGTGTGGACGCGGGCAACACGCACCACTGCCGCTGCCAGGCGGGCTACACAGGCAGCTACTGTGAGGACCTGGTGGACGAGTGCT 

CACCCAGCCCCTGCCAGAACGGGGCCACCTGCACGGACTACCTGGGCGGCTACTCCTGCAAGGTGCGTGGCCGGCTACCACGGGGTGAACTGCTCTG 

AGGAGATCGACGAGTGCCTCTCCCACCCCTGCCAGAACGGGGGCACCTGCCTCGACCTCCCCAACACCTACAAGTGCTCCTGCCCACGGGGCACTCA 

GGGTGTGCACTGTGAGATCAACGTGGACGACTGCAATCCCCCCGTTGACCCCGTGTCCCGGAGCCCCAAGTGCTTTAACAACGGCACCTGCGTGGAC 

CAGGTGGGCGGCTACAGCTGCACCTGCCCGCCGGGCTTCGTGGGTGAGCGCTGTGAGGGGGATGTCAACGAGTGCCTGTCCAATCCCTGCGACGCCC 

GTGGCACCCAGAACTGCGTGCAGCGCGTCAATGACTTCCACTGCGAGTGCCGTGCTGGTCACACCGGGCGCCGCTGCGAGTCCGTCATCAATGGCTG 

CAAAGGCAAGCCCTGCAAGAATGGGGGCACCTGCGCCGTGGCCTCCAACACCGCCCGCGGGTTCATCTGCAAGTGCCCTGCGGGCTTCGAGGGCGCC 

ACGTGTGAGAATGACGCTCGTACCTGCGGCAGCCTGCGCTGCCTCAACGGCGGCACATGCATCTCCGGCCCGCGCAGCCCCACCTGCCTGTGCCTGG 

GCCCCTTCACGGGCCCCGAATGCCAGTTCCCGGCCAGCAGCCCCTGCCTGGGCGGCAACCCCTGCTACAACCAGGGGACCTGTGAGCCCACATCCGA 

GAGCCCCTTCTACCGTTGCCTGTGCCCCGCCAAATTCAACGGGCTCTTGTGCCACATCCTGGACTACAGCTTCGGGGGTGGGGCCGGGCGCGACA 

CCCCCGCCGCTGATCGAGGAGGCGTGCGAGCTGCCCGAGTGCCAGGAGGACGCGGGCAACAAGGTCTGCAGCCTGCAGTGCAACAACCACGCGTGCG 

GCTGGGACGGCGGTGACTGCTCCCTCAACTTCAATGACCCCTGGAAGAACTGCACGCAGTCTCTGCAGTGCTGGAAGTACTTCAGTGACGGCCACTG 

TGACAGCCAGTGCAACTCAGCCGGCTGCCTCTTCGACGGCTTTGACTGCCAGCGTGCGGAAGGCCAGTGCAACCCCCTGTACGACCAGTACTGCAAG 

GACCACTTCAGCGACGGGCACTGCGACCAGGGCTGCAACAGCGCGGAGTGCGAGTGGGACGGGCTGGACTGTGCGGAGCATGTACCCGAGAGGCTGG 

CGGCCGGCACGCTGGTGGTGGTGGTGCTGATGCCGCCGGAGCAGCTGCGCAACAGCTCCTTCCACTTCCTGCGGGAGCTCAGCCGCGTGCTGCACAC 

CAACGTGGTCTTCAAGCGTGACGCACACGGCCAGCAGATGATCTTCCCCTACTACGGCCGCGAGGAGGAGCTGCGCAAGCACCCCATCAAGCGTGCC 

GCCGAGGGCTGGGCCGCACCTGACGCCCTGCTGGGCCAGGTGAAGGCCTCGCTGCTCCCTGGTGGCAGCGAGGGTGGGCGGCGGCGGAGGGAGCTGG 

ACCCCATGGACGTCCGCGGCTCCATCGTCTACCTGGAGATTGACAACCGGCAGTGTGTGCAGGCCTCCTCGCAGTGCTTCCAGAGTGCCACCGACG'T 

GGCCGCATTCCTGGGAGCGCTCGCCTCGCTGGGCAGCCTCAACATCCCCTACAAGATCGAGGCCGTGCAGATGAGACCGTGGAGCCGCCCCCGCCGG 

CGCAGCTGCACTTCATGTACGTGGCGGCGGCCGCCTTTGTGCTTCTGTTCTTCGTGGGCTGCGGGGTGCTGCTGTCCCGCAAGCGCCGGCGGCAGCA 

TGGCCAGCTCTGGTTCCCTGAGGGCTTCAAAGTGTCTGAGGCCAGCAAGAAGAAGCGGCGGGAGCCCCTCGGCGAGGACTCCGTGGGCCTCAAGCCC 

CTGAAGAACGCTTCAGACGGTGCCCTCIATGGACGACftACCAGAAT^ 

TGGTTCTGCCTGACCTGGACGACCAGACAGACCACCGGCAGTGGACTCAGCAGCACCTGGATGCCGCTGACCTGCGCATGTCTGCCATGGCCCCCAC 
ACCGCCCCAGGGTGAGGTTGACGCCGACTGCATGGACGTCAATGTCCGCGGGCCTGATGGCTTCACCCCGCTCATGATCGCCTCCTGCAGCGGGGGC 
GGCCTGGAGACGGGCAACAGCGAGGAAGAGGAGGACGCGCCGGCCGTCATCTCCGACTTCATCTACCAGGGCGCCAGCCTGCACAACCAGACAGACC 
GCACGGGCGAGACCGCCTTGCACCTGGCCGCCCGCTACTCACGCTCTGATGCCGCCAAGCGCCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGA 
CAACATGGGCCGCACCCCGCTGCATGCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCCAGATCCTGATCCGGAACCGAGCCACAGACCTGGATGCC 
CGCATGCATGATGGCACGACGCCACTGATCCTGGCTGCC^ 

CCGTAGATGACCTGGGCAAGTCCGCCCTGCACTGGGCCGCCGCCGTGAACAATC 

TATGCAGAACAACAGGGAGGAGACACCCCTGTTTCTGGCCGCCCGGGAGGGCAGCTACGAGACOSCCAAGfGTGCTC 

GACATCACGGATCATATGGACTOCCTGCCGCGCGACATCGCACAGGAGCGCATGCATCACGACATCGTGAGGCTGCTGGACGAGTACAACCTGGTGC 
GCAGCCCGCAGCTGCACGGAGCCCCGCTGGGGGGCACGCCCACCCTGTCGCCCCCGCTCTGCTCGCCCAACGGCTACCTGGGCAGCC 
CGTGCAGGGCAAGAAGGTCCGCAAGCCCAGCAGCAAAGGCCTGGCCTGTGGAAGCAAGGAGGCCAAGGACCTCAAGGCACGGAGGAAGAAGTCCCAG 
GACGGCAAGGGCTGCCTGCTGGACAGCTCCGGCATGCTCTCGCCCGTGGACTCCC^ 

CACTGCTGCCCTCCCCGTTCCAGCAGTCTCCGTCCGTGCCCCTCAACCACCTGCCTGGGATGCCCGACACCCACCTGGGCATCGGGCACCTGAACGT 
GGCGGCCAAGCCCGAGATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTTTGAGACTGGCCCACCTCGTCTCTCCCACCTGCCTGTGGCCTCrGGC 
ACCAGCACCGTCCTGGGCTCCAGCAGCGGAGGGGCCCTGAATTTCACTGTGGGCGGGTCCACCAGTTTGAATGGTCAATGCGAGTGGCTGTCCCGGC 
TGCAGAGCGGCATGGTGCCGAACCAATACAACCCTCTGCGGGGGAGTGTC 

GGTAGGCCCGCTGCACAGTAGCCTTGCTGCCAGCGCCCTGTCCCAGATGATGAGCTACCAGGGCCTGCCCAGCACCCGGCTGGCCACCCAGCCTCAC 
C TGGTG C AG AC CCAGCAGGTG C AGCCAC AAA ACTT ACAG ATG C AG CAG CAGAACCTGCAGCCAGCAAA CATCCAGCAGC AGC AAAGCCTG CAG CCGC 
CACCACCACCACCACAGCCGCACCTTGGCGTGAGCTCAGCAGCCAGCGGCCACCTGGGCCGGAGCTTCCTGAGTGGAGAGCCGAGCCAGGCAGACGT 
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GCAGCCACTGGGCCCCAGCAGCCTGGCGGTGCACACTATTCTGCCCCAGGAGAGCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCCACCC 
GTGACCGCAGCCCAGTTCCTGACGCCCCCCTCGCAGCACAGCTACTCCTCGCCTGTGGACAACACCCCCAGCCACCAGCTACAGGTGCCTGAGCACC 
CCTTCCTCACCCCGTCCCCTGAGTCCCCTGACCAGTGGTCCAGCTCGTCCCCGCATTCCAACGTCTCCGACTGGTCCGAGGGCGTCTCCAGCCCTCC 
CACCAGCATGCAGTCCCAGATCGCCCGCATTCCGGAGGCCTTCAAGTAA 

HUMAN SEQUENCE - CODING 

ATGTACGTGGCGGCGGCCGCCTTTGTGCTTCTGTTCTTCGTGGGCTGCGGGGTGCTGCTGTCCCGCAAGCGCCGGCGGCAGCATGGCCAGCTCTGGT 

TCCCTGAGGGCTTCAAAGTGTCTGAGGCCAGCAAGAAGAAGCGGCGGGAGCCCCTCGGCGAGGACTCCGTGGGCCTCAAGCCCCTGAAGAAC 

AGACGGTGCCCTCATGG ACG ACAACCAGAATGAGTGGGGGGACGAGGACCT GGAGAC CAAGAAGTTC CGGTTCGAGGAGCCC GTGGTTCTGCCTGAC 

CTGGACGACCAGACAGACCACCGGCAGTGGACTCAGCAGCACCTGGATGCCGCTGACCTGCGCATGTCTGCCATGGCCCCCACACCGCCCCAGGGTG 

AGGTTGACGCCGACTGCATGGACGTCAATGTCCGCGGGCCTGATGGCTTCACCCCGCTCATGATCGCCTCCTGCAGCGGGGGCGGCCTGGAGACGGG 

CAA CAG CGAGG AAG AGGAGGACGCGCCGGC CGTCAT CT CCG ACTTCATCT AC CAGGGCGCC AGCCTGCAC AA CCAGACAGACCGCA CGGGCGAGACC 

GCCTTGCACCTGGCCGCCCGCTACTCACGCTCTGATGCCGCCAAGCGCCTGCTGGAGGCCAGCGCAGATGCCAACATCCAGGACZAACATGGGCCGCA 

CCCCGCTGCATGCGGCTGTGTCTGCCGACGCACAAGGTGTCTTCCAGATCCTGATCCGGAACCGAGCCACAGACCTGGATGCCCGCATGCATGATGG 

CACGACGCCACTGATCCTGGCTGCCCGCCTGGCCGTGGAGGGCATGCTGGAGGACCTCATCAACTCACACGCCGACGTCAACGCCGTAGATGACCTG 

GGCAAGTCCGCCCTGCACTGGGCCGCCGCCGTGAACAATGTGGATGCCGC^ 

GGGAGGAGACACCCCTGTTTCTGGCCGCCCGGGAGGGCAGCTACGAGACCGCCAAGGTGCTGCTGGACCACTTTGCCAACCGGGACATCACGGATCA 
TATGGACCGCCTGCCGCGCGA»TCGC^CAGGAGCGCATGCATC^CGACATCGTGAGGCTGCTGGA<^AGTACAACCTGGTGCGCAGCCCGC^GCTG 
CACGGAGCCCCGCTGGGGGGCACGCCCACCCTGTCGCCCCCGCTCTGCTCGCCCAACGGCTACCTGGGCAGCCTCAAGCCCGGCGTGCAGGGCAAGA 
AGGTCCGCAAGCCCAGCAGCAAAGGCCTGGCCTGTGGAAGCAAGGAGGCC^AGGACCTCAAGGCAC^ 

CCTGCTGGACAGCTCCGGC^TGCTCTCGCCCGTGGACTCCCTGGAGTCACCCC^TGGCTACCTGTCAGACGTGGCCTCGCCGCCACTGCTGCCCTCC 
CCGTTCCAGCAGTCTCCGTCCGTGCCCCTCAACCACCTGCCTGGGATGCCCGACACCCACCTGGGCATCGGGCACCTGAACGTGGCGGCCAAGCCCG 
AGATGGCGGCGCTGGGTGGGGGCGGCCGGCTGGCCTTTGAGACTGGCCCACCTCGTCTCTCCCACCTGCCTGTGGCCTCTGGCACCAGCACCGTCCT 
GGGCTCCAGCAGCGGAGGGGCCCTGAATTTCACTGTGGGCGGGTCCACCAGTTTGAATGGTCAATGCGAGTGGCTGTCCCGGCTGCAGAGCGGCATG 
GTGCCGAACCAATACAACCCTCTGCGGGGGAGTGTGGCACCAGGCCCCCTGAGCACACAGGCCCCCTCCCTGCAGCATGGCATGGTAGGCCCGCTGC 
ACAGTAGCCTTGCTGCCAGCGCCCTGTCCCAGATGATGAGCTACCAGGGCCTGCCCAGCACCCGGCTGGCCACCCAGCCTCACCTGGTGCAGACCCA 
GCAGGTGCAGCCACAAAACTTACAGATGCAGCAGCAGAACCTGCAGCCAGCAAACATCCAGCAGCAG 

CAGCCGCACCTTGGCGTGAGCTCAGCAGCCAGCGGCCACCTGGGCCGGAGCTTCCTGAGTGGAGAGCCGAGCCAGGCAGACGTGCAGCCACTGGGCC 
CCAGCAGCCTGGCGGTGCACACTATTCTGCCCCAGGAGAGCCCCGCCCTGCCCACGTCGCTGCCATCCTCGCTGGTCCCACCCGTGACCGCAGCCCA 
GTTCCTGACGCCCCCCTCGCAGCACAGCTACTCCTCGCCTGTGGACAACACCCCCAGCCACCAGCTACAGGTGCCTGAGCACCCCTTCCTCACCCCG 
TCCCCTGAGTCCCCTGACCAGTGGTCCAGCTCGTCCCCGCATTCCAACGTCTCCGACTGGTCCGAGGGCGTCTCCAGCCCTCCCACCAGCATGCAGT 
CCCAGATCGCCCGCATTCCGGAGGCCTTCAAGTAA 
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